[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
user:zeman:interset:drivers [2009/03/25 21:52]
zeman en::conll2009
user:zeman:interset:drivers [2009/09/08 18:15]
zeman pl::ipipan finished.
Line 93: Line 93:
 Total work time: about 3 hours Total work time: about 3 hours
  
-==== CoNLL Tagset ====+==== CoNLL 2006 ====
  
 The driver is just an envelope around the ''en::penn'' driver. The driver is just an envelope around the ''en::penn'' driver.
Line 99: Line 99:
 Total work time: 48 minutes Total work time: 48 minutes
  
-==== CoNLL 2009 Tagset ====+==== CoNLL 2009 ====
  
 Another envelope around the ''en::penn'' driver. However, three new tags required changes even in the older drivers: ''HYPH'', ''AFX'' (''PRF'') and ''NIL''. Another envelope around the ''en::penn'' driver. However, three new tags required changes even in the older drivers: ''HYPH'', ''AFX'' (''PRF'') and ''NIL''.
  
-Work started: 25.3.2008 +Work started: 25.3.2009 
-Work finished: 25.3.2008+Work finished: 25.3.2009
 Total work time: 2:57 h Total work time: 2:57 h
  
Line 119: Line 119:
 Total work time: 4:00 h Total work time: 4:00 h
  
-==== CoNLL (derived from STTS) ====+==== CoNLL 2006 ====
  
 Only simple envelope around the STTS driver needed. Only simple envelope around the STTS driver needed.
Line 126: Line 126:
 Work finished: 31.3.2008 Work finished: 31.3.2008
 Total work time: 10 min Total work time: 10 min
 +
 +
 +==== CoNLL 2009 ====
 +
 +This tagset is derived from the STTS, too. Unlike CoNLL 2006, there are also morphological features this time, which required additional processing effort.
 +
 +Work started: 5.4.2009
 +Work finished: 6.4.2009
 +Total work time: 9:39 h
 +
 +
 +===== Polish (pl) =====
 +
 +Based on the [[http://korpus.pl/index.php|Korpus Języka Polskiego IPI PAN]]. (Saša tyhle značky potřebuje zpracovat v Intercorpu.)
 +
 +Work started: 4.9.2009
 +Work finished: 8.9.2009
 +Total work time: 9:54 hours
  
 ===== Portuguese (pt) ===== ===== Portuguese (pt) =====

[ Back to the navigation ] [ Back to the content ]