Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision | Next revision Both sides next revision | ||
user:zeman:treebanks:hu [2011/12/13 13:20] zeman Sample. |
user:zeman:treebanks:hu [2011/12/13 13:32] zeman Inside. |
||
---|---|---|---|
Line 54: | Line 54: | ||
==== Inside ==== | ==== Inside ==== | ||
- | Both versions (CoNLL 2007 and BDT-II) are in the CoNLL 2006/2007 format. | + | The original Szeged Treebank is a phrase-based treebank and it is distributed in XML-based, TEI-compliant format. The CoNLL 2007 version is dependency-based (i.e. the head of each phrase was identified), distributed |
- | The syntactic guidelines (structure and labels) are described in Spanish | + | Morphological annotation includes lemmas. Morphosyntactic tags were probably disambiguated manually. |
- | + | ||
- | Multi-word expressions have been collapsed | + | |
==== Sample ==== | ==== Sample ==== |