[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision Both sides next revision
ufal:tasks [2012/01/18 15:38]
ufal
ufal:tasks [2012/01/19 12:01]
ufal
Line 5: Line 5:
  
 === Europarl tokenizer === === Europarl tokenizer ===
-  * **info:** A sample rule-based tokenizer, can use a list of prefixes which are usually followed by a dot but don't break a sentence. Distributed as a part of the Europarl tools.+  * **description:** A sample rule-based tokenizer, can use a list of prefixes which are usually followed by a dot but don't break a sentence. Distributed as a part of the Europarl tools.
   * **version:** v6 (Jan 2012)    * **version:** v6 (Jan 2012) 
   * **author:** Philipp Koehn and Josh Schroeder   * **author:** Philipp Koehn and Josh Schroeder
Line 12: Line 12:
   * **languages:** in principle applicable to all languages using space-separated words; nonbreaking prefixes available for DE, EL, EN, ES, FR, IT, PT, SV.   * **languages:** in principle applicable to all languages using space-separated words; nonbreaking prefixes available for DE, EL, EN, ES, FR, IT, PT, SV.
   * **efficiency**: NA    * **efficiency**: NA 
 +  * **reference**: 
   * **contact:**   * **contact:**
  

[ Back to the navigation ] [ Back to the content ]