According to Amir, about half of the paper is just a copy/description of authors' previous work (namely Minkov et al.,2007).
Also Minkov et al.(2007) is reported as if the authors did not apply their work to MT. However, the name of the paper is “Generating complex morphology for machine translation”.
The improvement on the Treelet system (using Method 3) is quite impressive (2.5 BLEU points), but the improvement on phrasal MT (using Method 1) is smaller (0.7 BLEU points). Given the phrasal MT clearly outperforms the treelet MT system (29 vs. 36 BLEU points), the most interesting improvements are those of the phrasal MT, but there is no significance test (nor human evaluation) reported.
Method 3 (base MT trained on stems) and Method 2 (base MT trained on stems, but alignment on forms) could (and should) be applied also to the phrasal MT.
Table 1 caption mentions “(accuracy, %)”. However, the last row of the table uses different units - number of word forms. This may confuse someone.