Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Last revision Both sides next revision | ||
external:addicter [2011/05/16 07:08] mphi |
external:addicter [2011/05/22 09:27] mphi |
||
---|---|---|---|
Line 4: | Line 4: | ||
This page lies in the external name space and is intended for collaboration with people outside of ÚFAL. | This page lies in the external name space and is intended for collaboration with people outside of ÚFAL. | ||
- | |||
- | ==== Word Alignment -- Progress and Results ==== | ||
- | |||
- | === Alternative model comparison === | ||
- | | ^ Precision/ | ||
- | | ^ Lex ^ Order ^ Punct ^ Miss ^ | ||
- | ^ meteor | ||
- | ^ ter* ^0.106/ | ||
- | ^ hmm | ||
- | ^ lcs | ||
- | ^ gizadiag* ^0.183/ | ||
- | ^ gizainter ^0.170/ | ||
- | ^ berkeley* ^0.200/ | ||
- | |||
- | === Explicit wrong lex choice detection === | ||
- | * align input+czeng to reference+czeng and input+czeng to hypotheses+czeng | ||
- | * extract hypothesis-to-reference alignments from there | ||
- | | | Precision/ | ||
- | | | Lex | Order | Punct | Miss | ||
- | | czengdiag* |0.187/ | ||
- | | czenginter |0.197/ | ||
- | |||
- | === Alignment combinations === | ||
- | via weighed HMM | ||
- | |||
- | | | Precision/ | ||
- | | | Lex | Order | Punct | Miss | ||
- | | meteor+hmm |0.162/ | ||
- | | ter+hmm |0.116/ | ||
- | | gizadiag+hmm |0.186/ | ||
- | | gizainter+hmm |0.194/ | ||
- | | berkeley+hmm |0.203/ | ||
- | | czengdiag+hmm |0.190/ | ||
- | | czenginter+hmm |0.214/ | ||
- | | ||||| | ||
- | | berkeley+czenginter+hmm |0.219/ | ||
- | | berkeley+czenginter+gizainter+hmm |0.220/ | ||
- | | berkeley+czenginter+meteor+hmm |0.220/ | ||
==== TODOs ==== | ==== TODOs ==== | ||
- | | + | * test alignment with synonym detection (cz_wn required) = separating |
- | | + | |
* order evaluation | * order evaluation | ||
* a lot of background research | * a lot of background research | ||
Line 52: | Line 13: | ||
* if it (and only it) were to be moved in the original permutation, | * if it (and only it) were to be moved in the original permutation, | ||
* evaluate with nr. of intersections | * evaluate with nr. of intersections | ||
+ | * try domain adaptation for word alignment, EMNLP 2011 paper | ||
* comb and comment the code | * comb and comment the code | ||
* add help files | * add help files | ||
* integrate with the rest of Addicter | * integrate with the rest of Addicter | ||
- | * learner' | + | * approach applicable to learner' |
* see Anne Lüdelig, TLT9 | * see Anne Lüdelig, TLT9 | ||
- | * adapt to Sara's program | + | * adapt Addicter |
* alternative to reference-based evaluation: " | * alternative to reference-based evaluation: " | ||
+ | |||
+ | ==== Word Alignment -- Progress and Results ==== | ||
+ | |||
+ | === Latest best results === | ||
+ | [[http:// | ||
+ | |||
+ | === Alternative model comparison === | ||
+ | |||
+ | hmm = lightweight direct alignment method (in our ACL/TSD article) | ||
+ | gizainter = GIZA++, intersection -- applied to hypotheses+references directly | ||
+ | gizadiag = GIZA++, grow-diag -- applied to hypotheses+references directly | ||
+ | czenginter = align source+CzEng to reference+CzEng, | ||
+ | czengdiag = same, but with GIZA++ grow-diag | ||
+ | |||
+ | | | ||
+ | | | ||
+ | ^ ter* |0.106/ | ||
+ | ^ meteor | ||
+ | ^ hmm | ||
+ | ^ lcs | ||
+ | ^ gizainter |0.170/ | ||
+ | ^ gizadiag* |0.183/ | ||
+ | ^ czengdiag* |0.187/ | ||
+ | ^ berkeley* |0.200/ | ||
+ | ^ czenginter |0.197/ | ||
+ | |||
+ | * non-1-to-1 alignments, converted to 1-to-1 via " | ||
+ | |||
+ | === Alignment combinations === | ||
+ | via weighed HMM | ||
+ | |||
+ | | | ||
+ | | | ||
+ | ^ ter+hmm | ||
+ | ^ meteor+hmm | ||
+ | ^ gizadiag+hmm | ||
+ | ^ gizainter+hmm | ||
+ | ^ berkeley+hmm | ||
+ | ^ czengdiag+hmm | ||
+ | ^ czenginter+hmm | ||
+ | | ||||| | ||
+ | ^ berk+czengint+hmm |0.219/ | ||
+ | ^ berk+czengint+gizaint+hmm |0.220/ | ||
+ | ^ berk+czengint+meteor+hmm |0.220/ | ||
+ | ^ berk+czengint+meteor+gizaint+hmm |0.221/ | ||
+ | |||