Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision | Next revision Both sides next revision | ||
user:zeman:treebanks:ru [2012/01/13 18:04] zeman Sample. |
user:zeman:treebanks:ru [2012/01/13 18:13] zeman Inside. |
||
---|---|---|---|
Line 40: | Line 40: | ||
==== Size ==== | ==== Size ==== | ||
- | There are 497,465 tokens in 34895 sentences, yielding 14.26 tokens per sentence on average. The original data was not split to training and test. In our HamleDT experiments, | + | There are 497,465 tokens in 34895 sentences, yielding 14.26 tokens per sentence on average. The original data was not split to training and test. In our HamleDT experiments, |
==== Inside ==== | ==== Inside ==== | ||
- | We have a Treex reader for the Syntagrus | + | The native |
- | + | ||
- | Both versions | + | |
Part of speech tag description (obtained per e-mail from Koldo Gojenola, thanks!): | Part of speech tag description (obtained per e-mail from Koldo Gojenola, thanks!): |