Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision | Next revision Both sides next revision | ||
user:zeman:treebanks:te [2012/03/22 11:46] zeman ICON 2009 Telugu data size. |
user:zeman:treebanks:te [2012/03/22 16:47] zeman ICON 2010 Telugu data size. |
||
---|---|---|---|
Line 51: | Line 51: | ||
| TOTAL | | TOTAL | ||
- | The data distributed | + | As for ICON 2010, the data description in [[http:// |
- | ^ Part ^ Sentences ^ Chunks ^ Ratio ^ Words ^ Ratio ^ | + | ^ Part ^ Sentences ^ Chunks ^ Ratio ^ PSentences |
- | | Training | 1400 | ? | ? | 7602 | 5.43 | | + | | Training |
- | | Development | 150 | ? | ? | 839 | 5.59 | | + | | Development | |
- | | Test | 150 | ? | ? | 836 | 5.57 | | + | | Test | |
- | | TOTAL | 1700 | ? | ? | 9277 | 5.46 | | + | | TOTAL |
- | + | ||
- | We drew our training and test data from the ICON 2010 datasets but we have fewer sentences – why? | + | |
- | + | ||
- | ^ Part ^ Sentences ^ Chunks ^ Ratio ^ | + | |
- | | Training | + | |
- | | Test | + | |
- | | TOTAL | + | |
==== Inside ==== | ==== Inside ==== |