Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
user:zeman:treebanks:hi [2012/10/02 16:45] zeman HPST 2012 sample. |
user:zeman:treebanks:hi [2012/12/15 13:15] (current) zeman |
||
---|---|---|---|
Line 14: | Line 14: | ||
* Shakti Standard Format (SSF; native) | * Shakti Standard Format (SSF; native) | ||
* CoNLL format | * CoNLL format | ||
+ | * Hyderabad DT říkají tomu starému s malými daty. Tohle je Hindi treebank z velkého projektu sponzorovaného NSF | ||
+ | |||
There has been no official release of the treebank yet. There have been three as-is sample releases for the purposes of the NLP tools contests in parsing Indian languages, attached to the [[http:// | There has been no official release of the treebank yet. There have been three as-is sample releases for the purposes of the NLP tools contests in parsing Indian languages, attached to the [[http:// | ||
Line 67: | Line 69: | ||
^ Part ^ Sentences ^ Chunks ^ Ratio ^ Words ^ Ratio ^ | ^ Part ^ Sentences ^ Chunks ^ Ratio ^ Words ^ Ratio ^ | ||
- | | Training | | + | | Training | 12041 | | | 268093 | 22.27 | |
- | | Development | 1233 | | | 26416 | 21.42 | | + | | Development | 1233 | | | 26416 | 21.42 | |
- | | Test | | + | | Test | | | | | | |
- | | TOTAL | | | | | | | + | | TOTAL | |
==== Inside ==== | ==== Inside ==== |