Software - Sabine Buchholz

The chunklink script

The Perl script chunklink.pl serves to convert Penn Treebank II files into a one-word-per-line format containing (at least) the same information as the original files.

This script was used to generate the data for the CoNLL-2000 Shared Task.

Earlier, slightly different versions of the script were used to make the data for the experiments described in
Daelemans, Buchholz, Veenstra. 1999. Memory-Based Shallow Parsing and
Buchholz, Veenstra, Daelemans, 1999. Cascaded Grammatical Relation Assignment.

DRAFT of a readme file.