Differences
This shows you the differences between two versions of the page.
Next revision | Previous revision Next revision Both sides next revision | ||
ufal:tasks [2012/01/18 12:59] ufal vytvořeno |
ufal:tasks [2012/01/18 14:49] ufal |
||
---|---|---|---|
Line 1: | Line 1: | ||
====== Overview of NLP/CL tools available at UFAL ====== | ====== Overview of NLP/CL tools available at UFAL ====== | ||
- | Tokenization | + | ===== Tokenization |
- | Language Identification | + | Segmentation of text into tokens (words, punctuation marks, etc.). For languages using space-separated words (English. Czech, etc), the taks is relatively easy. For other languages (Chinese, Japanese, etc.) the task is much more difficult. |
- | Sentence Segmentation | + | |
- | Morphological Segmentation | + | === Europarl tokenizer === |
- | Morphological Analysis | + | * **info:** A sample tokenizer, distributed as a part of the Europarl tools |
- | Part-of-Speech Tagging | + | * **version: |
- | Lemmatization | + | * **author:** Philipp Koehn and Josh Schroeder |
- | Analytical Parsing | + | * **licence: |
- | Tectogrammatical Parsing | + | * **url:** http:// |
- | Named Entity Recognition | + | * **languages: |
- | Machine Translation | + | * **efficiency**: |
- | Coreference resolution | + | * **contact: |
- | Spell Checking | + | |
- | Text Similarity | + | ===== Language Identification |
- | Recasing | + | |
- | Rekonstrukce diakritiky | + | ===== Sentence Segmentation |
+ | |||
+ | ===== Morphological Segmentation | ||
+ | |||
+ | ===== Morphological Analysis | ||
+ | |||
+ | ===== Part-of-Speech Tagging | ||
+ | |||
+ | ===== Lemmatization | ||
+ | |||
+ | ===== Analytical Parsing | ||
+ | |||
+ | ===== Tectogrammatical Parsing | ||
+ | |||
+ | ===== Named Entity Recognition | ||
+ | |||
+ | ===== Machine Translation | ||
+ | |||
+ | ===== Coreference resolution | ||
+ | |||
+ | ===== Spell Checking | ||
+ | |||
+ | ===== Text Similarity | ||
+ | |||
+ | ===== Recasing | ||
+ | |||
+ | ===== Rekonstrukce diakritiky | ||