Both sides previous revision
Previous revision
Next revision
|
Previous revision
Next revision
Both sides next revision
|
user:zeman:interset:how-to-use [2008/03/14 09:59] zeman Note CSH. |
user:zeman:interset:how-to-use [2009/02/20 14:57] zeman Download is already available. |
===== Manual ===== | ===== Manual ===== |
| |
| |
==== Installation ==== | ==== Installation ==== |
| |
If you exist on the ÚFAL network, you can use directly Dan's version here. Otherwise, you need to [[mailto:zeman@ufal.mff.cuni.cz|ask Dan]] for a zipped package of the currently existing drivers. (I intend to maintain it here for download some time later.) Unzip it to a convenient place; below, we assume it is in ''/home/zeman/interset''. | If you exist on the ÚFAL network, you can use directly Dan's version here. Otherwise, you need to [[download]] a zipped package of the currently existing drivers. Unzip it to a convenient place; below, we assume it is in ''/home/zeman/interset''. |
| |
**Contributions welcome!** If you write your own driver, please share it with others! If you send it to me, I will add it to the package for download here. | **Contributions welcome!** If you write your own driver, please share it with others! If you send it to me, I will add it to the package for download here. |
Note: This list may not be up-to-date. To see what drivers are currently available on your system, call ''driver-test.pl'' without arguments. | Note: This list may not be up-to-date. To see what drivers are currently available on your system, call ''driver-test.pl'' without arguments. |
| |
* tagset::ar::conll - Arabic CoNLL treebank (coarse, fine and feat fields in one string, delimited by tabs) | - tagset::ar::conll - Arabic CoNLL treebank (coarse, fine and feat fields in one string, delimited by tabs) |
* tagset::bg::conll - Bulgarian CoNLL treebank | - tagset::bg::conll - Bulgarian CoNLL treebank |
* tagset::cs::pdt - Czech positional tags of the Prague Dependency Treebank | - tagset::cs::conll - Czech CoNLL treebank, based on the Prague Dependency Treebank |
* tagset::da::conll - Danish CoNLL treebank | - tagset::cs::multext - Czech subset of the tagset from the Multext East project |
* tagset::en::conll - English CoNLL treebank (one-to-one mapping to en::penn) | - tagset::cs::pdt - Czech positional tags of the Prague Dependency Treebank |
* tagset::en::penn - English Penn Treebank | - tagset::da::conll - Danish CoNLL treebank |
* tagset::sv::conll - Swedish CoNLL treebank (one-to-one mapping to sv::mamba) | - tagset::de::conll - German CoNLL treebank (one-to-one mapping to de::stts) |
* tagset::sv::hajic - Tags output by Swedish tagger by Jan Hajič | - tagset::de::stts - German: Stuttgart-Tübingen Tagset (Tiger treebank) |
* tagset::sv::mamba - Swedish Mamba tags from Talbanken05 (CoNLL treebank) | - tagset::en::conll - English CoNLL treebank (one-to-one mapping to en::penn) |
* tagset::sv::svdahybrid - Dan's tagset, aiming at making distribution of tags from sv::hajic and da::conll as close as possible | - tagset::en::penn - English Penn Treebank |
* tagset::zh::conll - Chinese CoNLL treebank | - tagset::pt::conll - Portuguese CoNLL treebank (based on the Floresta treebank) |
| - tagset::sv::conll - Swedish CoNLL treebank (one-to-one mapping to sv::mamba) |
| - tagset::sv::hajic - Tags output by Swedish tagger by Jan Hajič |
| - tagset::sv::mamba - Swedish Mamba tags from Talbanken05 (CoNLL treebank) |
| - tagset::sv::svdahybrid - Dan's tagset, aiming at making distribution of tags from sv::hajic and da::conll as close as possible |
| - tagset::zh::conll - Chinese CoNLL treebank |
| |
=== Directory Structure === | === Directory Structure === |