[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
khresmoi:data_notes [2012/02/29 14:19]
hlavacova
khresmoi:data_notes [2012/02/29 14:25]
hlavacova
Line 31: Line 31:
  
 ===== ELDA ===== ===== ELDA =====
- 
 **ELRA-E0020, CESTA Evaluation Package**  **ELRA-E0020, CESTA Evaluation Package** 
  
Line 57: Line 56:
 en-GB → fr-FR: 13,033,584 slov en-GB → fr-FR: 13,033,584 slov
 fr-FR → en-GB 483,610 slov fr-FR → en-GB 483,610 slov
 +en-GB → de-DE: 412,406
 +de-DE → en-GB: 6,385,051 
  
 Staženo, TMX format, kvalita zatím neověřena (PP)  Staženo, TMX format, kvalita zatím neověřena (PP) 
Line 77: Line 78:
 Vyslán dotaz, zda už to někdo nestáhnul Vyslán dotaz, zda už to někdo nestáhnul
  
-===== korpus Europarl =====+===== Europarl =====
 8-) 8-)
 http://www.statmt.org/europarl/ http://www.statmt.org/europarl/
Line 84: Line 85:
   1825077  47667366 314658361 europarl-v6.fr-en.fr   1825077  47667366 314658361 europarl-v6.fr-en.fr
 Stažený nástroj na alignment. Stažený nástroj na alignment.
 +
 +===== much.more =====
 +8-)
 +Alignované abstrakty medicínských článů, staženo, >1 Mw
 +Volitelně anotace:
 +Automatic (!) annotation includes: Part-of-Speech; Morphology (inflection and decomposition); Chunks; Semantic Classes (UMLS: Unified Medical Language System, MeSH: Medical Subject Headings, EuroWordNet); Semantic Relations from UMLS.
 +
  
 ===== HON certified web sites ===== ===== HON certified web sites =====

[ Back to the navigation ] [ Back to the content ]