Nelson Liu's Blog

tooling

Software Archaeology: Re-generating the CoNLL 2000 Chunking Data

I've been using the data from the CoNLL 2000 shared task on syntactic chunking for some ongoing work, but the original dataset is tiny by modern standards. The train set...

Extracting last timestep outputs from PyTorch RNNs

Here's some code I've been using to extract the last hidden states from an RNN with variable length input....

Making autoenv + conda faster

I've recently switched over to using the fantastic autoenv to automatically activate my anaconda environments and set necessary environment variables when I enter a directory on my terminal. You basically...