Nelson Liu's Blog

tutorial

Software Archaeology: Re-generating the CoNLL 2000 Chunking Data

I've been using the data from the CoNLL 2000 shared task on syntactic chunking for some ongoing work, but the original dataset is tiny by modern standards. The train set...

Extracting last timestep outputs from PyTorch RNNs

Here's some code I've been using to extract the last hidden states from an RNN with variable length input....