Nelson Liu's Blog

Discovering the World of Academic Podcasts

I was never a big TV-watcher or radio-listener, so I assumed podcasts weren't for me. However, I decided to give it a go after AI2 (specifically Waleed Ammar and Matt...

Paraphrase Identification Models in Tensorflow

I've been loosely hacking on the Quora Question Pairs dataset in my free time to get some more experience working with vanilla Tensorflow for NLP in a practical setting. Yesterday,...

Research Update: Extracting Phonesthemes From Word Vectors

In the fall, I was awarded a Mary Gates Research Scholarship for a project seeking to extract phonesthemes (submorphemic sound clusters that seem to have meaning, more on what this...

Installing and Updating GTX 1080 Ti Drivers / CUDA on Ubuntu

I recently had to figure out how to set up a new machine with NVIDIA's new GTX 1080 Ti graphics card for use with CUDA-enabled machine learning libraries, e.g....

Making autoenv + conda faster

I've recently switched over to using the fantastic autoenv to automatically activate my anaconda environments and set necessary environment variables when I enter a directory on my terminal. You basically...

AI2 Research Internship Recap

I took Winter quarter 2017 off at UW to do a research internship at the Allen Institute for Artificial Intelligence (AI2), where I worked on neural reading comprehension models for...

scikit-learn test_size and train_size pitfalls and coming changes

I recently authored a scikit-learn PR to edit the behavior of train_size and test_size in most of the classes that use it; I thought that their interaction was...

My Journey in Open Source / How to Get Started Contributing

I just finished the Google Summer of Code Program, wherein I worked on the Python machine learning package scikit-learn. Since I began working with the project in November 2015, I've...

scikit-learn GSoC Summary, Lessons Learned, and Future Work

This summer, I was quite fortunate to work on the scikit-learn project with my mentors Jacob Schreiber and Raghav RV as part of the Google Summer of Code Program. I...

(GSoC Week 10) scikit-learn PR #6954: Adding pre-pruning to decision trees

The scikit-learn pull request I opened to add impurity-based pre-pruning to DecisionTrees and the classes that use them (e.g. the RandomForest, ExtraTrees, and GradientBoosting ensemble regressors and classifiers) was...