Author Archives: Yue Jin

R Open Lab – Merge and Filter Data

During the first 20-30 minutes of yesterday’s open lab, we talked about how to merge datasets and filter data using base R and dplyr package. The rest of the open lab were free discussions between participants and instructors.

Thank you to all who showed up!

Welcome to explore the materials I used for the open lab:

https://drive.google.com/open?id=0B4uO8w_uGtsCWjRId0p6QllKZlE

Enjoy the spring!

R Open Lab – ggplot

Data visualization is an integral part of data exploration and presentation. Yesterday, we talked about ggplot2, a package which provides a mature and consistent system for plotting in R.

We explored the advantage and disadvantages of ggplot2, the syntax and usage of the package.

As always, thank you to everyone who showed up.

Materials I used for the open lab can be found here.

https://drive.google.com/open?id=0B4uO8w_uGtsCMUVoLXl1Z0JsSlU

Enjoy spring! ❤️

R Open Labs – Apply Family

This Wednesday we talked about apply function family in base R. We covered apply(), tapply(), lapply(), sapply() and vapply(). We also briefly introduced the concept of factors in R.

As always, thank you to all who showed up! Next two weeks’ R Open Lab will be cancelled due to midterm week and spring break.

Good luck on your midterms and have a wonderful spring break! 😄

R Open Labs – readr Package

Today we introduced readr package. It is a package used for reading csv/xls/txt etc. data. It is designed to flexibly parse many types of data found in the wild, while still cleanly failing when data unexpectedly changes. 

We covered the functionality of the package and the difference between this package and base R.

Next week we will talk about apply family.

See you next Wednesday from 10 am – 12 pm at DSSC (Lehman Social Science Library Room 215)!

R Open Labs – Linear Regression

Thank you to all who showed up last week!

Last Wednesday we went over doing linear regression in R, including: how to fit and interpret linear models, evaluate model validity and select among competing models.

See you this Wednesday from 10 am – 12 pm at DSSC (Lehman Social Science Library Room 215)!

Materials I used for this open lab is posted here. https://drive.google.com/open?id=0B4uO8w_uGtsCQTJ6aUNFMUJpZVE

R Open Labs – Shiny

Hope you guys enjoyed the snow. ❄️

This Wednesday we talked about Shiny. We discussed why we use R Shiny and how to create a Shiny application. Thank you to all who showed up! Hoping to see you next Wednesday 02/15/2017 10:00-11:00 at DSSC. The R Open Lab for next week will be shorter than usual because we will be hosting an R workshop about Data Wrangling right after the open lab 12:00 pm -1:00 pm at the Science & Engineering Library!

Looking forward to seeing you guys! 😊

R Open Lab – Writing Loops

The topic for today is writing loops in R. Looping is equivalent to iterating or just replicating instructions. By letting the computer take over the repetitive work, we make our life much easier. We also talked a bit about how to avoid writing loops and make our code more efficient. 

Thank you to everyone who showed up today. Today’s session is the last session for this semester. Good luck with the finals and see you guys next year! 😀

R Open Lab – dplyr package

This Wednesday we talked about how to use the dplyr package in R. The dplyr package makes data manipulation faster, easier and more straightforward. We covered the syntax and functions of the package, and tried it out on our Walmart Store Sales dataset. We also briefly introduced R Notebook and R presentation.

R open lab will be suspended next Wednesday due to the Thanksgiving holiday.

See you guys on Wednesday 11/30/2016 10:00-12:00 at DSSC!

Also, we will host a R workshop about Simple Linear Regression this Friday 11/19/2016 10:00-11:00 at the Science & Engineering LIbrary. Beginners are welcome!