[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision Both sides next revision
user:zeman:treebanks:hi [2011/12/06 22:46]
zeman Parsing results.
user:zeman:treebanks:hi [2011/12/08 08:38]
zeman Zarovnání čísel v tabulkách.
Line 46: Line 46:
  
 ^ Part ^ Sentences ^ Chunks ^ Ratio ^ ^ Part ^ Sentences ^ Chunks ^ Ratio ^
-| Training | 1501 | 13779 | 9.18 | +| Training |    1501 |  13779 |  9.18 | 
-| Development | 150 | 1250 | 8.33 | +| Development |  150 |   1250 |  8.33 | 
-| Test | 150 | 1156 | 7.71 | +| Test |         150 |   1156 |  7.71 | 
-| TOTAL | 1801 | 16185 | 8.99 |+| TOTAL |       1801 |  16185 |  8.99 |
  
 The ICON 2010 version came with a data split into three parts: training, development and test. The intra-chunk dependencies have been added: The ICON 2010 version came with a data split into three parts: training, development and test. The intra-chunk dependencies have been added:
  
 ^ Part ^ Sentences ^ Chunks ^ Ratio ^ Words ^ Ratio ^ ^ Part ^ Sentences ^ Chunks ^ Ratio ^ Words ^ Ratio ^
-| Training | 2972 | | | 64452 | 21.69 | +| Training |    2972 | | |  64452 |  21.69 | 
-| Development | 543 | | | 12616 | 23.23 | +| Development |  543 | | |  12616 |  23.23 | 
-| Test | 321 | | | 6588 | 20.52 | +| Test |         321 | | |   6588 |  20.52 | 
-| TOTAL | 3836 | | | 83656 | 21.81 |+| TOTAL |       3836 | | |  83656 |  21.81 |
  
 I have counted the sentences and tokens (words) on the ''.conll'' files; there are slight differences from the statistics presented in (Husain et al., 2010). I have counted the sentences and tokens (words) on the ''.conll'' files; there are slight differences from the statistics presented in (Husain et al., 2010).

[ Back to the navigation ] [ Back to the content ]