[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision Both sides next revision
pub-company:icon2009 [2009/10/20 17:16]
stranak
pub-company:icon2009 [2009/10/20 23:08]
stranak
Line 33: Line 33:
  
 ==== Out of Vocabulary ==== ==== Out of Vocabulary ====
-data           | tokens  types | tokens in train | types in train | +No data have been lemmatised, so  all the numbers mean forms. 
-| Tides-train-en | 1226144 | 48048 | |+^ data                ^ tokens  types ^ 
-| Tides-train-hi | 1312435 | 53451 | |+**Tides-train-en**      | 1226144 | 48048 | 
-| Tides+DP11-train-en | 1402536 | 52947 | |+**Tides-train-hi**      | 1312435 | 53451 | 
-| Tides+DP11-train-hi | 1434543 | 57131 | ||+**Tides+DP11-train-en** | 1402536 | 52947 | 
 +**Tides+DP11-train-hi** | 1434543 | 57131 | 
 +**Tides-dev**           1434543 | 57131 | 
 +| **Tides-test**          | 1434543 | 57131 | 
 + 
 + 
 +^         Coverage               ^^^ 
 +|          | **tokens in train** | **types in train** | 
 +| **Tides-test** |                                | 
 +| **Tides-dev** |                                | 

[ Back to the navigation ] [ Back to the content ]