Differences

This shows you the differences between two versions of the page.

--- courses:rg:2012:spe-for-smt [2012/10/12 12:45]
jindra.helcl
+++ courses:rg:2012:spe-for-smt [2012/10/12 13:12]
jindra.helcl
@@ Line 1: / Line 1: @@
 ====== Statistical Post-Editing for a Statistical MT System ======
-//*** !! Under construction !! ***//
 //Hanna Béchara, Yanjun Ma, Josef van Genabith//
 MT Summit 2011
+[[http://mt-archive.info/MTS-2011-Bechara.pdf|PDF]]
 Presented by Rudolf Rosa
 Report by Jindřich Helcl
 ===== Introduction =====
 This article was about statistical post-editing on results of a statistical machine translation system. The most interesting part on this article was that authors claim that they achieved improvement of about 2 BLEU score points by pipelining two statistical MT systems, which was until then considered useless.
@@ Line 18: / Line 17: @@
   * **Enhancements:** However, the basic architecture of this system did not produce any improvements. There was a drop of 0.15 BLEU points against the baseline without post-editing in English-to-French translation and only 0.65 BLEU points increase in French-to-English. So they introduced following enhancements:
     * Contextual SPE, which means that the translated words was created by concatenating the English word and the translation separated by hash sign to one resulting word. This new dataset is called **E#F'** in the paper. With this enhancement, they were able to do post-editing of translated text with regard to original text.
-    * Next, they striped off the #-postfixes of non-translated words.
+    * Next, they striped off the #-postfixes of non-translated words (OOV).
     * Then, they do alignment between the source text and the translation and use the contextual enhancement only where the alignment weight was over some threshold.
 &nbsp;&nbsp;&nbsp;&nbsp;With the last enhancement, they achieve improvement of 2 BLEU points in French-English translation.
 ===== Discussion =====
 Following topics about the article were discussed on RG meeting:
-  * malý data (translation memory)
+  * As the main possible flaw of the experiment was assumed the size of the data (only 55k sentences). On the other hand, the data from translation memory were mentioned to be clean and there were not duplicities. However, the authors do not explain why they took so small data when other options are easily available. One possible explanation is that their translation system was built for the domain from the Symantec data - but this is not explicitly said in the article.
-  * 10-fold cv
+  * In the paper, they state that they use 10-fold cross validation approach to build a new dataset. Many of us have got confuset by this statement and found unclear what exactly the authors meant by this. We finally agreed that the new dataset is created fold-by-fold by training the SMT on the other 9 folds of **E** and **F** and then running it on the tenth fold of source language.
-  * struktura článku
+  * We found pointless for authors to present explicit results of Contextual SPE without removing the #-postfixes, as it was plain enough to remove them right away. This simple objection lead us to idea of removing the #-postfixes even before the OOV utterance is put to the language model, while it could bring some improvements.
-  * #
+  * When the authors wrote about Contextual SPE with thresholding, they did not clarify how exactly they get the alignment after first-stage translation from **E** to **E#F'** to apply the thresholding to.
-  * alignment
 ===== Conclusion =====
+Despite the structure of the paper was often critisized and possible flaws was found, the article was considered to be well-readable and simple enough to be the opening article for this semester's reading group.
-- zhodnocení

[ Back to the navigation ] [ Back to the content ]

Institute of Formal and Applied Linguistics Wiki

Differences