[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision Both sides next revision
user:zeman:treebanks:eu [2011/11/29 09:24]
zeman Documentation of part of speech tags.
user:zeman:treebanks:eu [2011/11/29 09:34]
zeman Domain.
Line 28: Line 28:
 ==== Domain ==== ==== Domain ====
  
-Mixed (“GDT consists of randomly selected textual fragments and texts in three domains: politics (current affairsmanual transcripts and minutes of European parliamentary sessions), health, and travel.”)+Newswire + unknown (“25000 word forms from EPEC (Aduriz et al., 2003) and 25000 word forms coming from newspapers that can be considered equivalent to the other corpora in the project [3LBi.e. Catalan and Spanish]”; “EPECa corpus of written Basque tagged at morphological and syntactic levels for the automatic processing”).
  
 ==== Size ==== ==== Size ====

[ Back to the navigation ] [ Back to the content ]