Nelson Liu's Blog

machine learning

Paraphrase Identification Models in Tensorflow

I've been loosely hacking on the Quora Question Pairs dataset in my free time to get some more experience working with vanilla Tensorflow for NLP in a practical setting. Yesterday,...

Installing and Updating GTX 1080 Ti Drivers / CUDA on Ubuntu

I recently had to figure out how to set up a new Ubuntu 16.04 machine with NVIDIA's new GTX 1080 Ti graphics card for use with CUDA-enabled machine learning...

AI2 Research Internship Recap

I took Winter quarter 2017 off at UW to do a research internship at the Allen Institute for Artificial Intelligence (AI2), where I worked on neural reading comprehension models for...

scikit-learn test_size and train_size pitfalls and coming changes

I recently authored a scikit-learn PR to edit the behavior of train_size and test_size in most of the classes that use it; I thought that their interaction was...

scikit-learn GSoC Summary, Lessons Learned, and Future Work

This summer, I was quite fortunate to work on the scikit-learn project with my mentors Jacob Schreiber and Raghav RV as part of the Google Summer of Code Program. I...

(GSoC Week 10) scikit-learn PR #6954: Adding pre-pruning to decision trees

The scikit-learn pull request I opened to add impurity-based pre-pruning to DecisionTrees and the classes that use them (e.g. the RandomForest, ExtraTrees, and GradientBoosting ensemble regressors and classifiers) was...

(GSoC Week 4) MAE and Median Calculation

In the first part of my project, I am implementing the Mean Absolute Error criterion for the scikit-learn DecisionTreeRegressor. In this blog post, I'll talk about what the criterion does,...

(GSoC Week 2) Intro to decision trees

Apologies for the late post, I had this sitting in my drafts and forgot to publish it! Decision Trees (DTs) are a non-parametric supervised learning method used for classification and...