[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
user:zeman:treebanks:hu [2011/12/13 13:42]
zeman Parsing results.
user:zeman:treebanks:hu [2011/12/13 22:52] (current)
zeman Personal names.
Line 57: Line 57:
  
 Morphological annotation includes lemmas. Morphosyntactic tags were probably disambiguated manually. The tagset used in SzTB seems to be same or similar to [[http://nl.ijs.si/ME/V4/msd/html/msd-hu.html|Multext-East]]. In the CoNLL version, tags were decomposed into CPOS column, POS column and the list of feature-value pairs in the FEAT column. Morphological annotation includes lemmas. Morphosyntactic tags were probably disambiguated manually. The tagset used in SzTB seems to be same or similar to [[http://nl.ijs.si/ME/V4/msd/html/msd-hu.html|Multext-East]]. In the CoNLL version, tags were decomposed into CPOS column, POS column and the list of feature-value pairs in the FEAT column.
 +
 +Personal names have been collapsed into one token, using underscore as the joining character (e.g. Torgyán_József).
  
 ==== Sample ==== ==== Sample ====

[ Back to the navigation ] [ Back to the content ]