Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
user:zeman:treebanks:hr [2014/07/17 20:59] zeman Size and Inside. |
user:zeman:treebanks:hr [2014/07/17 21:16] zeman |
||
---|---|---|---|
Line 42: | Line 42: | ||
All sentences in the improved pre-release version are manually annotated on morphological and syntactic levels. The officially available version 1 is a mixture of manual and automatic annotation, see the section on sizes above. | All sentences in the improved pre-release version are manually annotated on morphological and syntactic levels. The officially available version 1 is a mixture of manual and automatic annotation, see the section on sizes above. | ||
+ | |||
+ | The treebank is distributed in the [[: | ||
+ | |||
+ | In Version 1, if there is a token that has empty (" | ||
+ | |||
+ | All sentences in the improved pre-release contain dependency information; | ||
+ | |||
+ | The syntactic tags (DEPREL) are simplistic but somewhat inspired by the Prague Dependency Treebank, there are only 15 of them: | ||
+ | |||
+ | ^ Tag ^ Percent ^ Example ^ Description ^ | ||
+ | | Adv | 5% | Kosovu | adverbial modifier | | ||
+ | | Ap | 3% | Esat | appositional modifier, incl. first name attached to last name | | ||
+ | | Atr | 26% | privatizacije | attribute modifying a noun phrase | | ||
+ | | Atv | 2% | iskoristiti | ? | | ||
+ | | Aux | 7% | se | ? | | ||
+ | | Co | 3% | a | conjunction as coordination head (Prague-style coordinations) | | ||
+ | | Elp | 0.6% | Proces | ellipsis | | ||
+ | | Obj | 7% | privatizacije | object of a verb | | ||
+ | | Oth | 2% | Barem | other | | ||
+ | | Pnom | 2% | složen | nominal predicate attached to copula | | ||
+ | | Pred | 10% | analizira | predicate (verbal) | | ||
+ | | Prep | 10% | na | preposition | | ||
+ | | Punc | 13% | . | punctuation | | ||
+ | | Sb | 7% | Kosovo | subject | | ||
+ | | Sub | 4% | da | subordinating conjunction | | ||
+ | |||
+ | (The sum of the percentages exceeds 100% because of rounding.) | ||
==== XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX ==== | ==== XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX ==== |