Differences

This shows you the differences between two versions of the page.

--- external:addicter [2011/05/16 07:59]
mphi
+++ external:addicter [2011/05/22 09:27]
mphi
@@ Line 4: / Line 4: @@
 This page lies in the external name space and is intended for collaboration with people outside of ÚFAL.
+==== TODOs ====
+  * test alignment with synonym detection (cz_wn required) = separating ''lex::'' and ''disam::''
+  * order evaluation
+    * a lot of background research
+    * currently finds misplaced items, but their shift distances are off
+      * to fix -- for every misplaced token
+        * if it (and only it) were to be moved in the original permutation, what would be the best place?
+        * evaluate with nr. of intersections
+  * try domain adaptation for word alignment, EMNLP 2011 paper
+  * comb and comment the code
+  * add help files
+  * integrate with the rest of Addicter
+  * approach applicable to learner's corpora
+    * see Anne Lüdelig, TLT9
+  * adapt Addicter to Sara's program
+  * alternative to reference-based evaluation: "Inconsistencies in Penn parsing", M. Dickinson
 ==== Word Alignment -- Progress and Results ====
@@ Line 11: / Line 28: @@
 === Alternative model comparison ===
+hmm = lightweight direct alignment method (in our ACL/TSD article)
+gizainter = GIZA++, intersection -- applied to hypotheses+references directly
+gizadiag = GIZA++, grow-diag -- applied to hypotheses+references directly
+czenginter = align source+CzEng to reference+CzEng, and source+CzEng to hypotheses+CzEng with GIZA++, intersection, extract hypothesis-reference alignments from there ("Dan's method")
+czengdiag = same, but with GIZA++ grow-diag
 |           ^     Precision/Recall/F-score:     ^^^^
 |           ^          Lex         ^         Order        ^         Punct        ^         Miss         ^
@@ Line 19: / Line 43: @@
 ^ gizainter |0.170/0.483/**0.252** |0.049/0.137/**0.072** |0.284/0.878/**0.429** |0.029/0.409/**0.054** |
 ^ gizadiag* |0.183/0.512/**0.270** |0.044/0.250/**0.075** |0.285/0.784/**0.417** |0.038/0.224/**0.065** |
+^ czengdiag* |0.187/0.514/**0.275** |0.069/0.455/**0.120** |0.230/0.883/**0.365** |0.035/0.234/**0.061** |
 ^ berkeley* |0.200/0.540/**0.291** |0.050/0.330/**0.087** |0.292/0.844/**0.434** |0.039/0.267/**0.068** |
+^ czenginter |0.197/0.543/**0.290** |0.108/0.475/**0.176** |0.233/0.926/**0.372** |0.032/0.402/**0.060** |
-=== Explicit wrong lex choice detection ===
+* non-1-to-1 alignments, converted to 1-to-1 via "align-hmm.pl -x -a ..."
-  * "Dan's method"
-  * align input+czeng to reference+czeng and input+czeng to hypotheses+czeng
-  * extract hypothesis-to-reference alignments from there
-|            ^      Precision/Recall/F-score:      ^^^^
-|            ^     Lex     ^     Order     ^     Punct     ^     Miss     ^
-^ czengdiag* |0.187/0.514/**0.275** |0.069/0.455/**0.120** |0.230/0.883/**0.365** |0.035/0.234/**0.061** |
-^ czenginter |0.197/0.543/**0.290** |0.108/0.475/**0.176** |0.233/0.926/**0.372** |0.032/0.402/**0.060** |
 === Alignment combinations ===
@@ Line 49: / Line 67: @@
 ^ berk+czengint+meteor+gizaint+hmm |0.221/0.571/**0.318** |0.068/0.424/**0.118** |0.298/0.808/**0.436** |0.049/0.292/**0.084** |
-==== TODOs ====
-  * test alignment with synonym detection (cz_wn required) = separating ''lex::'' and ''disam::''
-  * order evaluation
-    * a lot of background research
-    * currently finds misplaced items, but their shift distances are off
-      * to fix -- for every misplaced token
-        * if it (and only it) were to be moved in the original permutation, what would be the best place?
-        * evaluate with nr. of intersections
-  * try domain adaptation for word alignment, EMNLP 2011 paper
-  * comb and comment the code
-  * add help files
-  * integrate with the rest of Addicter
-  * approach applicable to learner's corpora
-    * see Anne Lüdelig, TLT9
-  * adapt Addicter to Sara's program
-  * alternative to reference-based evaluation: "Inconsistencies in Penn parsing", M. Dickinson

[ Back to the navigation ] [ Back to the content ]

Institute of Formal and Applied Linguistics Wiki

Differences