[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
user:zeman:interset:drivers [2011/06/27 16:12]
zeman New drivers during the last year.
user:zeman:interset:drivers [2014/07/17 16:32]
zeman hr::multext
Line 45: Line 45:
  
 Most of the time was dedicated to extracting, transcribing and translating examples in an effort to understand the tag classes. Most of the time was dedicated to extracting, transcribing and translating examples in an effort to understand the tag classes.
 +
 +===== Croatian (hr) =====
 +
 +==== Multext ====
 +
 +The tagset of the MULTEXT-EAST project as used in the SETimes.HR corpus. Documentation lists 1291 tags, we removed one wrong tag and kept 1290.
 +
 +Work started: 16.7.2014
 +Work finished: 17.7.2014
 +Total work time: 5:45 h
 +
 +This is the second Multext-East tagset covered by DZ Interset. Adding it was not too difficult because much of the previous effort on ''​cs::​multext''​ could be reused.
  
 ===== Czech (cs) ===== ===== Czech (cs) =====
Line 324: Line 336:
 | <​vfin>​ | noise; should be <​co-vfin>​ | | | <​vfin>​ | noise; should be <​co-vfin>​ | |
 | VFIN | noise | há od haver | | VFIN | noise | há od haver |
 +
 +===== Slovak (sk) =====
 +
 +==== Slovenský národný korpus (SNK) ====
 +
 +1457 structured tags.
 +
 +Total work time: 5:32 hours.
  
 ===== Swedish (sv) ===== ===== Swedish (sv) =====

[ Back to the navigation ] [ Back to the content ]