Differences
This shows you the differences between two versions of the page.
| Both sides previous revision Previous revision Next revision | Previous revision | ||
|
user:ptacek:dbmt [2007/05/07 19:32] ptacek |
user:ptacek:dbmt [2007/05/07 19:50] (current) ptacek |
||
|---|---|---|---|
| Line 1: | Line 1: | ||
| ====== DBMT ====== | ====== DBMT ====== | ||
| - | Czech-English Dependency-based Machine Translation | + | //Czech-English Dependency-based Machine Translation |
| + | //PCEDT BLEU dtest/ | ||
| - | je to Magenta pipeline, jen generovani je rule-based misto statistikeho tree-to-tree transducing a pak LM | + | je to Magenta pipeline, jen generovani je rule-based |
| + | |||
| + | na českém prekladu Penn Treebanku | ||
| + | |||
| + | - tokenizace a tagging | ||
| + | - parsing do a_trees [Hajic 98, Charniak 99] | ||
| + | - afun assigment [ZZ 02] | ||
| + | - a_tree -> t_tree [Bohmova 01] | ||
| + | - func assigment C4.5 [ZZ 02] | ||
| + | - slovnik pomoci GIZA++ [Och and Nay 02] // one most probable translation, | ||
| + | - generator | ||
| + | |||
| + | ====== Generator ====== | ||
| + | dostane TGTS bez tfa, a co koreference :?: | ||
| == 1. determining contextual boundness == | == 1. determining contextual boundness == | ||
| Line 24: | Line 38: | ||
| == 5. morphology == | == 5. morphology == | ||
| asi ne morpha :!: | asi ne morpha :!: | ||
| - | hledaji v tabulce ^ word form ^ morphological tag ^ lemma ^ | + | hledaji v tabulce |
| + | ^ word form ^ morphological tag ^ lemma ^ | ||
| + | kdyz nenajdou tak somple rules | ||
| + | taky vokalizace pro indefinite article | ||
