Category Archives: Statistics & Data

Survey Documentation and Analysis (SDA)

Survey Documentation and Analysis (SDA) is a web based interface that allows access and analysis of data. The data can be accessed from IPUMS or from  the Inter-university Consortium for Political and Social Research (ICPSR).

SDA allows you to:

  • Browse the codebook describing a datasetsda
  • Calculate frequencies or crosstabulation (with charts)
  • Do comparison of means
  • Calculate a correlation matrix
  • Compare correlations
  • Perform multiple regression
  • Perform logit/probit regression
  • List values of individual cases
  • Recode variables (into public work area)
  • Compute a new variable
  • List/delete derived variables
  • Download a customized subset

SDA allows you to analyze data at a level appropriate to your level of experience. SDA can only analyze datasets that reside on an SDA server. If you would like to test drive SDA, or to see if SDA is useful for your research check out their General Information page.

R Open Labs – Basic Syntax

Hi all, last Wednesday we kicked off the first session of R Open Lab in the DSSC( based in Lehman Library). We started with basic syntax and briefly discussed how to explore the features of our datasets. We used data from Wal-Mart and we will continue exploring this dataset for the next few sessions. Beginners are welcome to join!


See you this Wednesday  10/12/2016 at 10:00 AM!

Python Open Labs Session – 2

In the second session of Python Open Labs, the focus was on conditional statements. Topics covered include : Conditional Operators, Conditional Statements, Boolean Expressions, Python’s obsession with indentation (and the idea of scope!), Two Way Decisions, Multi-way Decisions (using if-else, and elif).

The code on slides is in Python 2.7 – So if you use anything > 2.7, please remember to replace raw_input with input and add parenthesis  to your print statements!

The course material from all Python Open Labs Session is available on Lion Mail Drive link here.
PS: Google drive link requires Columbia UNI login, in case you don’t have one or you prefer using another email ID,  please request on the drive link to grant access and Kunal will do so.

R Open Lab: Week 1

I <3 R

Last week, we kicked off R Open Labs with a demo on Base Graphics, or how to make graphics using basic commands. We also gave a brief intro to Swirl, a great package for learning R.

You can catch up here with these helpful slides: Base Graphics System

Check back next week, Wednesday, Feb. 3 at 10 am, for a quick demo on how to load different data files into R, a free I <3 R button, and plenty of time to practice your code and ask questions.

Got feedback or want to suggest a package to demo? Leave a comment or take our short survey!