R Open Lab Reflections

Sharing is always good! I am really happy to get a great opportunity to work for Digital Center in my last semester at Columbia as a R Open Lab Intern in Digital Center Internship Program.R is really a powerful tool for statistic and data science nowadays which I love most. Holding weekly R Open Lab and […]

Read More…

Mid-Semester Reflection (Python Open Labs – Spring 2018)

Stuart Walesh, an author and consultant, once said: “The computer is incredibly fast, accurate, and stupid. Man is unbelievably slow, inaccurate, and brilliant. The marriage of the two is a challenge and opportunity beyond imagination.” Many of us use computers. Sometimes, the time we spend on them consume the majority of our day. Whether or […]

Read More…

Computationally Detecting Similar Books in Project Gutenberg

As one of the first digital libraries, Project Gutenberg has lived through a few generations of computers, digitization techniques, and textual infrastructures. It’s not surprising, then, that the corpus is fairly messy. Early transcriptions of some electronic texts, hand-keyed using only uppercase letters, were succeeded by better transcriptions, but without replacing the early versions. As […]

Read More…