[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
user:zeman:addicter [2012/05/15 14:10]
zeman Static files.
user:zeman:addicter [2012/05/16 13:41]
zeman
Line 108: Line 108:
 The error classifier currently uses its own monlingual word-alignment of reference translation and the hypothesis. It is invoked as follows: The error classifier currently uses its own monlingual word-alignment of reference translation and the hypothesis. It is invoked as follows:
  
-<code bash>${ADDICTER}/prepare/detecterr.pl -s srcfile -r reffile -h hypfile -w workdir</code>+<code bash>${ADDICTER}/prepare/detecter.pl -s srcfile -r reffile -h hypfile [-a alignment] -w workdir</code>
  
-and it creates the files ''workdir/tcali.txt'' and ''workdir/tcerr.txt''. The input files (src, ref and hyp) can also be gzipped.+and it creates the files ''workdir/tcali.txt'' and ''workdir/tcerr.txt''. The input files (src, ref and hyp) can also be gzipped. Custom alignment between hypothesis and reference can be supplied. If it is not supplied, then the default aligner (''${ADDICTER}/testchamber/align-greedy.pl'') is invoked.
  
-Place the files ''tcali.txt'' and ''tcerr.txt'' in the experiment subfolder of ''$CGI'' and the error classes will be displayed during test data browsing in the viewer.+Place the files ''tcali.txt'' and ''tcerr.txt'' in the experiment subfolder of ''$CGI'' and the error classes will be displayed during test data browsing in the viewer. The viewer can work with several alternating alignments (perhaps using different aligning algorithms) of the same data. For each of those alignments, you have to run ''detecter.pl'' separately.
  
 ==== How to use the viewer ==== ==== How to use the viewer ====
Line 127: Line 127:
  
 This research has been supported by the grant of the Czech Ministry of Education no. MSM0021620838 (2010), by the grants of the Czech Science Foundation no. P406/11/1499 and P406/10/P259 and the Estonian Science Foundation target financed theme SF0180078s08 (2011). This research has been supported by the grant of the Czech Ministry of Education no. MSM0021620838 (2010), by the grants of the Czech Science Foundation no. P406/11/1499 and P406/10/P259 and the Estonian Science Foundation target financed theme SF0180078s08 (2011).
 +
 +===== Publications =====
 +
 +  * Mark Fishel, Ondřej Bojar, Daniel Zeman, Jan Berka: //[[http://ufal.mff.cuni.cz:8080/bib/?section=publication&id=-8080356071532247643&mode=view|Automatic Translation Error Analysis]].// In: TSD 2011, Plzeň, Czechia, 2011
 +  * Daniel Zeman, Mark Fishel, Jan Berka, Ondřej Bojar: //[[http://ufal.mff.cuni.cz:8080/bib/?section=publication&id=-4951860774549764116&mode=view|Addicter: What Is Wrong with My Translations?]]// In: The Prague Bulletin of Mathematical Linguistics, vol. 96, pp. 79–88; MT Marathon, Trento, Italy, 2011
 +  * Maja Popović: //[[http://ufal.ms.mff.cuni.cz/pbml/96/art-popovic.pdf|Hjerson: An Open Source Tool for Automatic Error Classification of Machine Translation Output]].// In: The Prague Bulletin of Mathematical Linguistics, vol. 96, pp. 59–68; MT Marathon, Trento, Italy, 2011
 +  * Jan Berka, Ondřej Bojar, Mark Fishel, Maja Popović, Daniel Zeman: //[[http://ufal.mff.cuni.cz:8080/bib/?section=publication&id=-6688512557324036032&mode=view|Automatic MT Error Analysis: Hjerson Helping Addicter]].// In: Proceedings of LREC 2012, İstanbul, Turkey, 2012
  

[ Back to the navigation ] [ Back to the content ]