Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
user:ptacek:dbmt [2007/05/07 19:32] ptacek |
user:ptacek:dbmt [2007/05/07 19:50] (current) ptacek |
||
---|---|---|---|
Line 1: | Line 1: | ||
====== DBMT ====== | ====== DBMT ====== | ||
- | Czech-English Dependency-based Machine Translation | + | //Czech-English Dependency-based Machine Translation |
+ | //PCEDT BLEU dtest/ | ||
- | je to Magenta pipeline, jen generovani je rule-based misto statistikeho tree-to-tree transducing a pak LM | + | je to Magenta pipeline, jen generovani je rule-based |
+ | |||
+ | na českém prekladu Penn Treebanku | ||
+ | |||
+ | - tokenizace a tagging | ||
+ | - parsing do a_trees [Hajic 98, Charniak 99] | ||
+ | - afun assigment [ZZ 02] | ||
+ | - a_tree -> t_tree [Bohmova 01] | ||
+ | - func assigment C4.5 [ZZ 02] | ||
+ | - slovnik pomoci GIZA++ [Och and Nay 02] // one most probable translation, | ||
+ | - generator | ||
+ | |||
+ | ====== Generator ====== | ||
+ | dostane TGTS bez tfa, a co koreference :?: | ||
== 1. determining contextual boundness == | == 1. determining contextual boundness == | ||
Line 24: | Line 38: | ||
== 5. morphology == | == 5. morphology == | ||
asi ne morpha :!: | asi ne morpha :!: | ||
- | hledaji v tabulce | + | hledaji v tabulce |
^ word form ^ morphological tag ^ lemma ^ | ^ word form ^ morphological tag ^ lemma ^ | ||
+ | kdyz nenajdou tak somple rules | ||
+ | taky vokalizace pro indefinite article | ||