Addicter

The introductory page on the Addicter project is here.

This page lies in the external name space and is intended for collaboration with people outside of ÚFAL.

via weighed HMM

	Precision/Recall/F-score:
	Lex	Order	Punct	Miss
ter+hmm	0.116/0.402/0.180	0.030/0.184/0.051	0.145/0.912/0.251	0.026/0.181/0.046
meteor+hmm	0.162/0.426/0.234	0.068/0.309/0.112	0.286/0.794/0.421	0.025/0.400/0.047
gizadiag+hmm	0.186/0.515/0.273	0.040/0.215/0.067	0.297/0.836/0.438	0.039/0.238/0.067
gizainter+hmm	0.194/0.505/0.281	0.062/0.282/0.101	0.299/0.806/0.436	0.033/0.382/0.061
berkeley+hmm	0.203/0.548/0.297	0.049/0.320/0.085	0.290/0.816/0.428	0.041/0.277/0.071
czengdiag+hmm	0.190/0.517/0.278	0.073/0.457/0.126	0.291/0.841/0.432	0.039/0.238/0.067
czenginter+hmm	0.214/0.545/0.307	0.093/0.525/0.158	0.304/0.818/0.443	0.038/0.363/0.068

berk+czengint+hmm	0.219/0.568/0.316	0.070/0.432/0.120	0.298/0.817/0.436	0.048/0.290/0.082
berk+czengint+gizaint+hmm	0.220/0.569/0.317	0.068/0.420/0.118	0.298/0.812/0.436	0.048/0.290/0.083
berk+czengint+meteor+hmm	0.220/0.569/0.317	0.070/0.440/0.121	0.295/0.810/0.433	0.048/0.290/0.083
berk+czengint+meteor+gizaint+hmm	0.221/0.571/0.318	0.068/0.424/0.118	0.298/0.808/0.436	0.049/0.292/0.084

test alignment with synonym detection (cz_wn required) = separating lex:: and disam::
order evaluation
- a lot of background research
- currently finds misplaced items, but their shift distances are off
  - to fix – for every misplaced token
    - if it (and only it) were to be moved in the original permutation, what would be the best place?
    - evaluate with nr. of intersections
try domain adaptation for word alignment, EMNLP 2011 paper
comb and comment the code
add help files
integrate with the rest of Addicter
approach applicable to learner's corpora
- see Anne Lüdelig, TLT9
adapt Addicter to Sara's program
alternative to reference-based evaluation: “Inconsistencies in Penn parsing”, M. Dickinson

Institute of Formal and Applied Linguistics Wiki