[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

This is an old revision of the document!


DBMT

Czech-English Dependency-based Machine Translation

je to Magenta pipeline, jen generovani je rule-based misto statistikeho tree-to-tree transducing a pak LM

1. determining contextual boundness

povazuji v CZ od slovesa vlevo jako CB → definite article
od slovesa vpravo nezapojene → indefinite article

2. reordering of constituents

podle CB se z ACT|PAT|ADDR vyberou Sb
declarative sentence: CB adjuncts + Sb + V + direct/indirect Obj + UB adjuncts

3. generation of verb forms

pasivum/aktivum se dela podle Sb funktoru

4. insertion of prepositions and articles

preps: podle české a podle EN nounu

articles: definite při postmodified NP, premodified by superlative or ordinal num
article prevented: uncountable + proper nouns nebo predetermination by possessive a demonstrative pronouns

5. morphology

asi ne morpha :!:
hledaji v tabulce

word form morphological tag lemma

[ Back to the navigation ] [ Back to the content ]