Using R with ArcGIS

With a successful collaboration between DSSC and ESRI, a hands-on workshop on ESRI R plugin was presented by Shaun Walbridge, a senior developer from ESRI, on Wednesday, April 20. Shaun provided an in-depth tutorial on how to use R in ESRI, and answered questions from students and librarians.

Our audiences were from a broad background: librarians from Columbia and other institutions in NYC, PhD students, Master’s students, people from administrations, etc.

IMG_20160420_101218892

Presentation Highlights:

SP data types in R
  • 0D: SpatialPoints
  • 1D: SpatialLines
  • 2D: SpatialPolygons
  • 3D: Solid
  • 4D: Space-time
R — ArcGIS Bridge
  • Store your data in ArcGIS, access it quickly in R, return R objects back to ArcGIS native data types (e.g. geodatabase feature classes).
  • Knows how to convert spatial data to sp objects.
  • Package Documentation
  • Upcoming: Conda for managing R environments

For more information about this workshop, please visit http://github.com/scw/r-columbia-2016-talk

Thank you again for joining us!

 

 

Extended Hours and Study Hall

Can you believe it is already getting close to finals? We can’t either! To help you in your mad rush to the end of the year the library will begin extended hours and study hall starting April 17, 2016. The hours will be as follows:

Sunday: 11am-Midnight, Study Hall Midnight-4am
Monday-Thursday: 9am-Midnight, Study Hall 8am-9am, Midnight-4am
Friday:9am-10pm, Study Hall 8am-9am, 10pm-4am
Saturday: 10am-10pm, Study Hall 10pm-4am

Service desks will be closed Sunday-Thursday at 11:45PM and Friday & Saturday at 9:45PM from 4/18-5/12 during study hall hours. It will remain open as a study hall space from 4/18 midnight until 4AM and 4/18-4/11 8am-9am, midnight until 4am. We will close at Midnight on 5/12 and begin winter intersession hours on 5/13.

Happy Studying!

R Open Labs: mi

Last week, Professor Ben Goodrich stopped by R Open Labs to show us how to use the mi package, which he worked on with Professor Andrew Gelman.

As the name ‘mi’ suggests, this package does multiple imputations to help you predict missing values in your data sets.

missingdata

Screen capture of mi demo by Benjamin Goodrich

 

 

 

 

Check out the demo if you’d like to give the mi package a try!

R Open Labs continues every Weds, 10 am -12pm, through April 20th.  Drop in anytime! 

 

 

R Open Labs: QQPlot

Post by Ellie Ransom, Research Services Coordinator, Science and Engineering Libraries

At last week’s R Open Lab, we visualized our data to test our assumptions of normality using two of of R’s native commands, qqnorm and qqline.

We practiced using the built in dataset, trees, and looked at the variables Height and Volume separately.

Notice how the Height variable is basically normal, but the Volume variable appears to be skewed.

qqnorm(trees$Height)
qqline(trees$Height)

Inline image 2

qqnorm(trees$Volume)
qqline(trees$Volume)

Inline image 3

It might make sense to transform the skewed data before analyzing it!

qqnorm(log(trees$Volume))
qqline(log(trees$Volume))

Inline image 4

Share your thoughts and suggestions with us here, and we’ll see you next week!

R Open Labs: Loading Data

At last week’s R Open Lab, we explored two packages (Memisc and foreign) as well as some of R’s native commands for loading data files into our R Studio workspace.

View Slides

I <3 R

 

 

 

 

 

 

 

 

We practiced with ASCII, .TSV, .SAV, and .DTA files, all available from ICPSR, a data archive with a rich collection of social science data sets.

We’re taking a break from  R Open Labs this week, but we’ll be back on February 17 at 10 am in the DSSC.

Share your thoughts and suggestions with us here, and we’ll see you next week!

 

R Open Lab: Week 1

I <3 R

Last week, we kicked off R Open Labs with a demo on Base Graphics, or how to make graphics using basic commands. We also gave a brief intro to Swirl, a great package for learning R.

You can catch up here with these helpful slides: Base Graphics System

Check back next week, Wednesday, Feb. 3 at 10 am, for a quick demo on how to load different data files into R, a free I <3 R button, and plenty of time to practice your code and ask questions.

Got feedback or want to suggest a package to demo? Leave a comment or take our short survey!

Lehman Library Spring 2016 Hours

26777r Welcome back! We hope you had a relaxing winter recces! Lehman Library will be open from 9am-5pm on January 18th, 2016.Starting January 19th we will begin our normal semester hours which are as follows:

Monday-Thursday: 9am-Midnight
Friday: 9am-7pm
Saturday: 10am-6pm
Sunday: 11am-11pm

We look forward to seeing you soon!

R Open Labs this Spring in the DSSC

By WOCinTech Chat [CC BY-SA 2.0], via Flickr

By WOCinTech Chat [CC BY-SA 2.0], via Flickr

10am – 12pm
Every Wednesday
Digital Social Science Center

Stop by any Wednesday this Spring for R Open Labs. We start off every Open Lab with a 10 minute exploration of an R package or dataset; the rest of the time is yours to work on homework, research or building your coding skills in good company.

This Spring we’ll be digging into:

  • dplyr
  • ggplot2
  • Shiny
  • mySQL
  • your choice!

Beginners welcome!  We want to hear from you which datasets and packages you want to learn.

Continue reading

Lehman Library Winter Intersession Hours

c
Lehman Library will begin winter intersession hours starting Wednesday, December 23rd, 2015. We will be open from 9am-5pm that day then CLOSED December 24th-Decemeber 27th, 2015. We will also be closed December 31st-January 3rd, 2016. Winter intersession hours are as follows:

Monday-Friday: 9am-5pm
Saturday & Sunday:CLOSED

We will also be closed January 18th in observance of MKL day and begin Spring semester hours on January 19th.

Have a great break!

NHGIS Releases Standardized Tables for 2000 & 2010 Census

Comparing Census data from one decade the next can be difficult since boundaries  and variables can change. Trying to figure out those differences can be time consuming and sometimes frustrating. To help make this easier, NHGIS recently released 65 time series tables standardizing 2000 & 2010 100% count variables to the 2010 Census boundaries for ten geographic levels including tracts, block groups and 5 digit zip code tabulation areas (ZCTAs).

The tables are marked with a TS from within the Topic filter

NHGIS topic filter

Available standardized tables are listed in the Time Series Tables tab, and labeled Standardized to 2010 under the Geographic Integration column.

nhgis_tables

More information on time tables produced by the NHGIS can be found in the Data Documentation section, including both the geographically standardized tables as well as nominally integrated tables. The page includes methodology, and a PDF containing the complete list of variables within the time series datasets.