Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
pub-company:icon2009 [2009/10/20 17:16] stranak |
pub-company:icon2009 [2009/10/21 17:22] stranak |
||
---|---|---|---|
Line 33: | Line 33: | ||
==== Out of Vocabulary ==== | ==== Out of Vocabulary ==== | ||
- | | data | tokens | + | No data have been lemmatised, so all the numbers mean forms. |
- | | Tides-train-en | 1226144 | 48048 | || | + | ^ |
- | | Tides-train-hi | 1312435 | 53451 | || | + | ^ data ^ tokens |
- | | Tides+DP11-train-en | 1402536 | 52947 | || | + | | **Tides-train-en** |
- | | Tides+DP11-train-hi | 1434543 | 57131 | || | + | | **Tides-train-hi** |
+ | | **Tides+DP11-train-en** | 1402536 | 52947 | | ||
+ | | **Tides+DP11-train-hi** | 1434543 | 57131 | | ||
+ | | **Tides-dev-en** | ||
+ | | **Tides-dev-hi** | ||
+ | | **Tides-test-en** | ||
+ | | **Tides-test-hi** | ||
+ | |||
+ | |||
+ | ^ | ||
+ | | | **tokens seen in train** | ||
+ | | | ||
+ | | | abs | OOV | abs | OOV | abs | OOV | abs | OOV | | ||
+ | | **Tides-test-en** | | ||
+ | | **Tides-test-hi** | | ||
+ | | **Tides-dev-en** | ||
+ | | **Tides-dev-hi** |