[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
user:zeman:interset:how-to-use [2008/03/14 09:59]
zeman Note CSH.
user:zeman:interset:how-to-use [2009/02/20 14:57]
zeman Download is already available.
Line 1: Line 1:
 ===== Manual ===== ===== Manual =====
- 
  
 ==== Installation ==== ==== Installation ====
  
-If you exist on the ÚFAL network, you can use directly Dan's version here. Otherwise, you need to [[mailto:zeman@ufal.mff.cuni.cz|ask Dan]] for a zipped package of the currently existing drivers. (I intend to maintain it here for download some time later.) Unzip it to a convenient place; below, we assume it is in ''/home/zeman/interset''.+If you exist on the ÚFAL network, you can use directly Dan's version here. Otherwise, you need to [[download]] a zipped package of the currently existing drivers. Unzip it to a convenient place; below, we assume it is in ''/home/zeman/interset''.
  
 **Contributions welcome!** If you write your own driver, please share it with others! If you send it to me, I will add it to the package for download here. **Contributions welcome!** If you write your own driver, please share it with others! If you send it to me, I will add it to the package for download here.
Line 12: Line 11:
 Note: This list may not be up-to-date. To see what drivers are currently available on your system, call ''driver-test.pl'' without arguments. Note: This list may not be up-to-date. To see what drivers are currently available on your system, call ''driver-test.pl'' without arguments.
  
-  tagset::ar::conll - Arabic CoNLL treebank (coarse, fine and feat fields in one string, delimited by tabs) +  tagset::ar::conll - Arabic CoNLL treebank (coarse, fine and feat fields in one string, delimited by tabs) 
-  tagset::bg::conll - Bulgarian CoNLL treebank +  tagset::bg::conll - Bulgarian CoNLL treebank 
-  tagset::cs::pdt - Czech positional tags of the Prague Dependency Treebank +  - tagset::cs::conll - Czech CoNLL treebank, based on the Prague Dependency Treebank 
-  tagset::da::conll - Danish CoNLL treebank +  - tagset::cs::multext - Czech subset of the tagset from the Multext East project 
-  tagset::en::conll - English CoNLL treebank (one-to-one mapping to en::penn) +  - tagset::cs::pdt - Czech positional tags of the Prague Dependency Treebank 
-  tagset::en::penn - English Penn Treebank +  tagset::da::conll - Danish CoNLL treebank 
-  tagset::sv::conll - Swedish CoNLL treebank (one-to-one mapping to sv::mamba) +  - tagset::de::conll - German CoNLL treebank (one-to-one mapping to de::stts) 
-  tagset::sv::hajic - Tags output by Swedish tagger by Jan Hajič +  - tagset::de::stts - German: Stuttgart-Tübingen Tagset (Tiger treebank) 
-  tagset::sv::mamba - Swedish Mamba tags from Talbanken05 (CoNLL treebank) +  - tagset::en::conll - English CoNLL treebank (one-to-one mapping to en::penn) 
-  tagset::sv::svdahybrid - Dan's tagset, aiming at making distribution of tags from sv::hajic and da::conll as close as possible +  tagset::en::penn - English Penn Treebank 
-  tagset::zh::conll - Chinese CoNLL treebank+  - tagset::pt::conll - Portuguese CoNLL treebank (based on the Floresta treebank) 
 +  - tagset::sv::conll - Swedish CoNLL treebank (one-to-one mapping to sv::mamba) 
 +  tagset::sv::hajic - Tags output by Swedish tagger by Jan Hajič 
 +  tagset::sv::mamba - Swedish Mamba tags from Talbanken05 (CoNLL treebank) 
 +  tagset::sv::svdahybrid - Dan's tagset, aiming at making distribution of tags from sv::hajic and da::conll as close as possible 
 +  tagset::zh::conll - Chinese CoNLL treebank
  
 === Directory Structure === === Directory Structure ===

[ Back to the navigation ] [ Back to the content ]