Obtain data
For this shared task, we have collected and converted treebanks
for a number of languages. Although we would like to offer you this
data for free, the reality is that most of these resources are
protected and require you to sign a license first. We therefore have
four kinds of data:
- Free data
This is data that you can download for free ("open
source")
- Data requiring hard-copy license
This is data that you can download from this site, once you have send us a
signed hard-copy (i.e. a
letter, not a fax) of the Software License Agreement. This is
required for only one data set.
- Data requiring a faxed license
This is data that you can download from this site, once you have
signed and faxed the Software License Agreement for use of the
original treebank. For convenience, this process is intermediated by
us, the CoNLL Shared Task organizers.
- Data available from LDC
This is data that you can download from the Linguistic Data Consortium
(LDC), once you have signed and faxed them the appropriate Software
License Agreement.
We tried our best to minimize the effort required from the
participants, but apologize for the remaining inconvenience.