Addicter

The introductory page on the Addicter project is here.

This page lies in the external name space and is intended for collaboration with people outside of ÚFAL.

| | Precision/Recall/F-score: ||||

via weighed HMM

	Precision/Recall/F-score:
	Lex	Order	Punct	Miss
meteor+hmm	0.162/0.426/0.234	0.068/0.309/0.112	0.286/0.794/0.421	0.025/0.400/0.047
ter+hmm	0.116/0.402/0.180	0.030/0.184/0.051	0.145/0.912/0.251	0.026/0.181/0.046
gizadiag+hmm	0.186/0.515/0.273	0.040/0.215/0.067	0.297/0.836/0.438	0.039/0.238/0.067
gizainter+hmm	0.194/0.505/0.281	0.062/0.282/0.101	0.299/0.806/0.436	0.033/0.382/0.061
berkeley+hmm	0.203/0.548/0.297	0.049/0.320/0.085	0.290/0.816/0.428	0.041/0.277/0.071
czengdiag+hmm	0.190/0.517/0.278	0.073/0.457/0.126	0.291/0.841/0.432	0.039/0.238/0.067
czenginter+hmm	0.214/0.545/0.307	0.093/0.525/0.158	0.304/0.818/0.443	0.038/0.363/0.068

berkeley+czenginter+hmm	0.219/0.568/0.316	0.070/0.432/0.120	0.298/0.817/0.436	0.048/0.290/0.082
berkeley+czenginter+gizainter+hmm	0.220/0.569/0.317	0.068/0.420/0.118	0.298/0.812/0.436	0.048/0.290/0.083
berkeley+czenginter+meteor+hmm	0.220/0.569/0.317	0.070/0.440/0.121	0.295/0.810/0.433	0.048/0.290/0.083

try domain adaptation for word alignment, EMNLP 2011 paper
test alignment with synonym detection (cz_wn required) = separating @@lex@@ and @@disam@@
order evaluation
- a lot of background research
- currently finds misplaced items, but their shift distances are off
  - to fix – for every misplaced token
    - if it (and only it) were to be moved in the original permutation, what would be the best place?
    - evaluate with nr. of intersections
comb and comment the code
add help files
integrate with the rest of Addicter
learner's corpus
- see Anne Lüdelig, TLT9
adapt to Sara's program
alternative to reference-based evaluation: “Inconsistencies in Penn parsing”, M. Dickinson

Institute of Formal and Applied Linguistics Wiki