[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Next revision
Previous revision
user:zeman:interset:tagsets:conll-2006-sl [2010/04/23 07:44]
zeman vytvořeno
user:zeman:interset:tagsets:conll-2006-sl [2010/04/23 08:12] (current)
zeman Odkazy na dokumentaci Multext East.
Line 1: Line 1:
 ====== CoNLL 2006 Slovene ====== ====== CoNLL 2006 Slovene ======
  
-Data adapted from the Slovene Dependency Treebank. Tags have been reformatted. Statistics and examples below are taken from the training part.+Data adapted from the Slovene Dependency Treebank, which in turn is based on the //1984// novel from the Multext East V3 parallel corpus (http://nl.ijs.si/ME/V3/, http://nl.ijs.si/ME/V3/doc/index.html#mtev3-doc-div2-id2305296). Tags have been reformatted. Statistics and examples below are taken from the training part.
  
   * 1534 sentences   * 1534 sentences
   * 728 tags   * 728 tags
  
 +===== Documentation =====
 +
 +  * Tomaž Erjavec, Peter Holozan, Vojko Gorjanc, Marko Stabej: MULTEXT-East Morphosyntactic Specifications Version 3.0 for Slovene, Ljubljana, Slovenia, 2004.
 +    * http://nl.ijs.si/ME/V3/msd/html/
 +    * common http://nl.ijs.si/ME/V3/msd/html/msd.html#SECTION04000000000000000000
 +    * Slovene http://nl.ijs.si/ME/V3/msd/html/msd.html#SECTION05600000000000000000
 +  * http://localhost/cgi/tags/index.pl?corpus=conll-2006-sl

[ Back to the navigation ] [ Back to the content ]