Software Archaeology: Re-generating the CoNLL 2000 Chunking Data October 27, 2018 research, tooling, tutorial, open source, software archaeology, nlp I've been using the data from the CoNLL 2000 shared task on syntactic chunking for some ongoing work, but the original dataset is tiny by modern standards. The train set...