Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Last revision Both sides next revision | ||
user:zeman:treebanks:sv [2012/01/17 14:11] zeman Sample. |
user:zeman:treebanks:sv [2012/01/17 14:23] zeman Parsing. |
||
---|---|---|---|
Line 47: | Line 47: | ||
==== Inside ==== | ==== Inside ==== | ||
- | The original morphosyntactic | + | The morphological analysis in the CoNLL 2006 version does not include lemmas. The part-of-speech |
- | + | ||
- | The morphological analysis in the CoNLL 2006 version does not include lemmas (the original DTAG version does contain them). The morphosyntactic tags have been assigned (probably) manually. | + | |
- | + | ||
- | Some multi-word expressions have been collapsed into one token, using underscore as the joining character. This includes adverbially used prepositional phrases (e.g. i_lørdags = on Saturdays) but not named entities. | + | |
==== Sample ==== | ==== Sample ==== | ||
Line 92: | Line 88: | ||
==== Parsing ==== | ==== Parsing ==== | ||
- | Nonprojectivities in DDT are not frequent. Only 988 of the 100,238 tokens in the CoNLL 2006 version are attached nonprojectively (0.99%). | + | Nonprojectivities in Talbanken |
- | The results of the CoNLL 2006 shared task are [[http:// | + | The results of the CoNLL 2006 shared task are [[http:// |
^ Parser (Authors) ^ LAS ^ UAS ^ | ^ Parser (Authors) ^ LAS ^ UAS ^ | ||
- | | MST (McDonald et al.) | 84.79 | 90.58 | | + | | Microsoft |
- | | Malt (Nivre et al.) | 84.77 | 89.80 | | + | | Malt (Nivre et al.) | 84.58 | 89.50 | |
- | | Riedel | + | | Illinois (Do and Chang) | 82.31 | 89.05 | |
+ | | MST (McDonald | ||
+ | | Kenji Sagae | 82.00 | 88.57 | | ||
+ | | Nara (Yuchang Cheng) | 81.08 | 88.57 | | ||
+ | | Basis (John O' | ||
+ | | Riedel et al. | 80.66 | 88.33 | | ||