[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
ufal:tasks [2012/01/19 12:13]
ufal
ufal:tasks [2012/01/23 11:07]
ufal
Line 54: Line 54:
  
 ===== Part-of-Speech Tagging ===== ===== Part-of-Speech Tagging =====
 +
 +=== POS Taggers integrated in Treex ===
 +  * Featurama
 +  * Morce
 +  * MxPost tagger
 +  * Tree tagger
 +  * TnT tagger
 +  * Jan Hajič's tagger
 +  * a number of toy tagger prototypes (students' assignments) for about ten languages
 +
 +=== Details on Czech Tagging ===
 +A Guide to Czech Language Tagging at UFAL  http://ufal.mff.cuni.cz/czech-tagging/
  
 ===== Lemmatization ===== ===== Lemmatization =====
 +=== Lemmatizers integrated in Treex ===
 +* Martin Popel's lemmatizer for English
 +* a number of toy lemmatizers for about ten langauges (students' homeworks)
 +* for Czech, lemmatization is traditionally treated as a part of POS disambiguations, so almost all Czech taggers are capable of lemmatization
  
 ===== Analytical Parsing ===== ===== Analytical Parsing =====
 +=== Analytical parsers integrated in Treex ===
 +* Ryan McDonald's MST parser
 +* Rudolf Rosa's MST parser
 +* MALT parser
 +* ZPar
 +* Stanford parser
 +
 +=== Details on Czech parsing ===
 +A Complete Guide to Czech Language Parsing http://ufal.mff.cuni.cz/czech-parsing/
 +
  
 ===== Tectogrammatical Parsing ===== ===== Tectogrammatical Parsing =====
 +=== Conversion of analytical trees to tectogrammatical trees integrated in Treex ===
 +* a scenario for rule-based tree transformation
 +* Ondřej Dušek's tools for functor assignment trained on PDT and PCEDT
  
 ===== Named Entity Recognition ===== ===== Named Entity Recognition =====
 +=== NE recognizers integrated in Treex ===
 +* Jana Straková's SVM based recognizer for Czech http://www.aclweb.org/anthology/W/W09/W09-3538.pdf
 +* Stanford Named Entity Recognizer for Czech
  
 ===== Machine Translation ===== ===== Machine Translation =====
 +
 +=== MT implemented in Treex ===
 +* elaborated English->Czech tecto-based translation
 +* prototype of Czech->English tecto-based translation
  
 ===== Coreference resolution ===== ===== Coreference resolution =====

[ Back to the navigation ] [ Back to the content ]