Nelson Liu's Blog

Flattening the Gigaword Corpus

Code for flattening the Gigaword corpus and associated usage instructions are at nelson-liu/flatten_gigaword The English Gigaword Corpus is a massive collection of newswire text; the unzipped corpus is...

Self-hosted CI for Research, Part 1: Running Jenkins builds in Docker Containers

This post is part of my series on setting up a self-hosted continuous integration server. Part 0 has a table of contents. Using Docker containers for Jenkins builds is attractive...

Self-hosted CI for Research, Part 0: Introduction and Motivation

This month, I'll be writing about how I set up my self-hosted continuous integration setup (powered by Jenkins and Docker). In this initial post, I wanted to provide some motivation...

Some work I liked at ACL 2017

I was fortunate to attend ACL 2017 last week in Vancouver. There was a lot of great work on a variety of topics, and I'll quickly mention some papers/talks/...

Discovering the World of Academic Podcasts

I was never a big TV-watcher or radio-listener, so I assumed podcasts weren't for me. However, I decided to give it a go after AI2 (specifically Waleed Ammar and Matt...