Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
user:zeman:interset:drivers [2008/04/04 09:09] zeman 1st person in Portuguese. |
user:zeman:interset:drivers [2009/02/16 15:57] zeman Český Multext. |
||
---|---|---|---|
Line 58: | Line 58: | ||
More than half of the time was consumed during testing for tuning tags containing the Sem feature. | More than half of the time was consumed during testing for tuning tags containing the Sem feature. | ||
+ | |||
+ | ==== Multext ==== | ||
+ | |||
+ | The tagset of the MULTEXT-EAST project and corpora. The file '' | ||
+ | |||
+ | Work started: 16.2.2009 | ||
===== Danish (da) ===== | ===== Danish (da) ===== | ||
Line 98: | Line 104: | ||
Work finished: 31.3.2008 | Work finished: 31.3.2008 | ||
Total work time: 10 min | Total work time: 10 min | ||
- | |||
- | |||
- | |||
- | |||
- | |||
===== Portuguese (pt) ===== | ===== Portuguese (pt) ===== | ||
Line 110: | Line 111: | ||
http:// | http:// | ||
http:// | http:// | ||
+ | |||
+ | Work started: 2.4.2008 | ||
+ | Work finished: 24.4.2008 | ||
+ | Total work time: 28:18 h | ||
+ | |||
+ | The CoNLL version of the Floresta tagset was a real pain. Not only is the tagset complex with many features, some of them strangely overlapping, | ||
| **Feature** | **Explanation** | **Examples** | | | **Feature** | **Explanation** | **Examples** | | ||
Line 249: | Line 256: | ||
| < | | < | ||
| < | | < | ||
- | | R | noise | 2 occurrences | | + | | R | noise; should be PR | 2 occurrences | |
| recohidas> | | recohidas> | ||
| < | | < |