Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision | Last revision Both sides next revision | ||
user:zeman:treebanks:it [2012/01/03 15:38] zeman Sample. |
user:zeman:treebanks:it [2012/01/03 15:43] zeman Inside. |
||
---|---|---|---|
Line 42: | Line 42: | ||
==== Inside ==== | ==== Inside ==== | ||
- | The original | + | The original |
- | Morphological annotation includes lemmas. Morphosyntactic tags were probably disambiguated manually. The tagset used in SzTB seems to be same or similar to [[http:// | + | Morphological annotation includes lemmas. Morphosyntactic tags were probably disambiguated manually. In the CoNLL version, tags were decomposed into CPOS column, POS column and the list of feature-value pairs in the FEAT column. |
- | Personal names have been collapsed into one token, using underscore as the joining character (e.g. Torgyán_József). | + | Multi-word expressions |
==== Sample ==== | ==== Sample ==== |