Differences
This shows you the differences between two versions of the page.
Next revision | Previous revision | ||
external:addicter [2011/05/09 13:43] zeman vytvořeno |
external:addicter [2011/05/22 09:31] (current) mphi |
||
---|---|---|---|
Line 4: | Line 4: | ||
This page lies in the external name space and is intended for collaboration with people outside of ÚFAL. | This page lies in the external name space and is intended for collaboration with people outside of ÚFAL. | ||
+ | |||
+ | ==== TODOs ==== | ||
+ | * test alignment with synonym detection (cz_wn required) = separating '' | ||
+ | * try misplaced phrase detection | ||
+ | * parse the reference, extrapolate unto the hypothesis via word alignment, get phrases from there | ||
+ | * group adjacent words aligned to the same word? (imitation) | ||
+ | * order evaluation | ||
+ | * currently finds misplaced items, but their shift distances are off | ||
+ | * not important for '' | ||
+ | * to fix -- for every misplaced token | ||
+ | * if it (and only it) were to be moved in the original permutation, | ||
+ | * evaluate with nr. of intersections | ||
+ | * try domain adaptation for word alignment with the "via source" | ||
+ | * technical | ||
+ | * comb and comment the code | ||
+ | * add help files | ||
+ | * integrate with the rest of Addicter | ||
+ | * approach applicable to learner' | ||
+ | * see Anne Lüdelig, TLT9 | ||
+ | * try Blast (Sara' | ||
+ | * alternative to reference-based evaluation: " | ||
+ | |||
+ | ==== Word Alignment -- Progress and Results ==== | ||
+ | |||
+ | === Latest best results === | ||
+ | [[http:// | ||
+ | |||
+ | === Alternative model comparison === | ||
+ | |||
+ | hmm = lightweight direct alignment method (in our ACL/TSD article) | ||
+ | gizainter = GIZA++, intersection -- applied to hypotheses+references directly | ||
+ | gizadiag = GIZA++, grow-diag -- applied to hypotheses+references directly | ||
+ | czenginter = align source+CzEng to reference+CzEng, | ||
+ | czengdiag = same, but with GIZA++ grow-diag | ||
+ | |||
+ | | | ||
+ | | | ||
+ | ^ ter* |0.106/ | ||
+ | ^ meteor | ||
+ | ^ hmm | ||
+ | ^ lcs | ||
+ | ^ gizainter |0.170/ | ||
+ | ^ gizadiag* |0.183/ | ||
+ | ^ czengdiag* |0.187/ | ||
+ | ^ berkeley* |0.200/ | ||
+ | ^ czenginter |0.197/ | ||
+ | |||
+ | * non-1-to-1 alignments, converted to 1-to-1 via " | ||
+ | |||
+ | === Alignment combinations === | ||
+ | via weighed HMM | ||
+ | |||
+ | | | ||
+ | | | ||
+ | ^ ter+hmm | ||
+ | ^ meteor+hmm | ||
+ | ^ gizadiag+hmm | ||
+ | ^ gizainter+hmm | ||
+ | ^ berkeley+hmm | ||
+ | ^ czengdiag+hmm | ||
+ | ^ czenginter+hmm | ||
+ | | ||||| | ||
+ | ^ berk+czengint+hmm |0.219/ | ||
+ | ^ berk+czengint+gizaint+hmm |0.220/ | ||
+ | ^ berk+czengint+meteor+hmm |0.220/ | ||
+ | ^ berk+czengint+meteor+gizaint+hmm |0.221/ | ||
+ | |||