Differences
This shows you the differences between two versions of the page.
| Both sides previous revision Previous revision Next revision | Previous revision | ||
|
ufal:tasks [2012/01/23 10:54] ufal |
ufal:tasks [2012/01/23 11:15] (current) ufal |
||
|---|---|---|---|
| Line 44: | Line 44: | ||
| | **contact: | | **contact: | ||
| + | === Tokenizers integrated in Treex === | ||
| + | * rule-based (reg.exp.) tokenizers | ||
| + | * trainable tokenizer TextSeg | ||
| ===== Language Identification ====== | ===== Language Identification ====== | ||
| + | Martin Majliš' | ||
| ===== Sentence Segmentation ===== | ===== Sentence Segmentation ===== | ||
| + | === Segmenters integrated in Treex === | ||
| + | * rule-based segmenters | ||
| + | * TextSeg (trainable) | ||
| ===== Morphological Segmentation ===== | ===== Morphological Segmentation ===== | ||
| ===== Morphological Analysis ===== | ===== Morphological Analysis ===== | ||
| + | === Morphological Analyzers integrated in Treex === | ||
| + | * Jan Hajič' | ||
| + | * toy analyzers for about ten languages (students' | ||
| ===== Part-of-Speech Tagging ===== | ===== Part-of-Speech Tagging ===== | ||
| Line 68: | Line 78: | ||
| ===== Lemmatization ===== | ===== Lemmatization ===== | ||
| - | |||
| === Lemmatizers integrated in Treex === | === Lemmatizers integrated in Treex === | ||
| - | |||
| * Martin Popel' | * Martin Popel' | ||
| * a number of toy lemmatizers for about ten langauges (students' | * a number of toy lemmatizers for about ten langauges (students' | ||
| Line 76: | Line 84: | ||
| ===== Analytical Parsing ===== | ===== Analytical Parsing ===== | ||
| + | === Analytical parsers integrated in Treex === | ||
| + | * Ryan McDonald' | ||
| + | * Rudolf Rosa's MST parser | ||
| + | * MALT parser | ||
| + | * ZPar | ||
| + | * Stanford parser | ||
| + | |||
| + | === Details on Czech parsing === | ||
| + | A Complete Guide to Czech Language Parsing http:// | ||
| + | |||
| ===== Tectogrammatical Parsing ===== | ===== Tectogrammatical Parsing ===== | ||
| + | === Conversion of analytical trees to tectogrammatical trees integrated in Treex === | ||
| + | * a scenario for rule-based tree transformation | ||
| + | * Ondřej Dušek' | ||
| ===== Named Entity Recognition ===== | ===== Named Entity Recognition ===== | ||
| + | === NE recognizers integrated in Treex === | ||
| + | * Jana Straková' | ||
| + | * Stanford Named Entity Recognizer for Czech | ||
| ===== Machine Translation ===== | ===== Machine Translation ===== | ||
| + | |||
| + | === MT implemented in Treex === | ||
| + | * elaborated English-> | ||
| + | * prototype of Czech-> | ||
| ===== Coreference resolution ===== | ===== Coreference resolution ===== | ||
| + | === Coreference resolvers integrated in Treex === | ||
| + | * simple rule-based baseline resolvers for Czech and English | ||
| + | * Michal Novák' | ||
| + | * Ngụy Giang Linh's trainable (perceptron-based] resolver | ||
| ===== Spell Checking ===== | ===== Spell Checking ===== | ||
