Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
user:zeman:treebanks:ja [2012/01/04 09:34] zeman Sample. |
user:zeman:treebanks:ja [2014/04/22 16:49] (current) zeman Updated link. |
||
---|---|---|---|
Line 1: | Line 1: | ||
===== Japanese (ja) ===== | ===== Japanese (ja) ===== | ||
- | [[http:// | + | [[http:// |
==== Versions ==== | ==== Versions ==== | ||
Line 42: | Line 42: | ||
==== Inside ==== | ==== Inside ==== | ||
- | The original morphosyntactic tags have been converted to fit into the three columns | + | The text has been romanized and the original characters |
- | The morphological analysis | + | The morphological analysis does not include lemmas. The part-of-speech |
- | + | ||
- | Some multi-word expressions have been collapsed into one token, using underscore as the joining character. This includes adverbially | + | |
==== Sample ==== | ==== Sample ==== | ||
Line 104: | Line 102: | ||
==== Parsing ==== | ==== Parsing ==== | ||
- | Nonprojectivities in DDT are not frequent. Only 988 of the 100,238 tokens in the CoNLL 2006 version are attached nonprojectively (0.99%). | + | Nonprojectivities in TüBa-J/ |
- | The results of the CoNLL 2006 shared task are [[http:// | + | The results of the CoNLL 2006 shared task are [[http:// |
^ Parser (Authors) ^ LAS ^ UAS ^ | ^ Parser (Authors) ^ LAS ^ UAS ^ | ||
- | | MST (McDonald et al.) | 84.79 | 90.58 | | + | | Basis (John O'Neil) | 90.57 | 93.16 | |
- | | Malt (Nivre et al.) | 84.77 | 89.80 | | + | | Nara (Yuchang Cheng) | 89.91 | 93.12 | |
- | | Riedel | + | | Malt (Nivre |