Differences
This shows you the differences between two versions of the page.
| Both sides previous revision Previous revision Next revision | Previous revision | ||
|
courses:rg:2012:encouraging-consistent-translation [2012/10/17 13:41] dusek |
courses:rg:2012:encouraging-consistent-translation [2012/10/23 11:04] (current) popel my remarks |
||
|---|---|---|---|
| Line 41: | Line 41: | ||
| **Choice of features** | **Choice of features** | ||
| - | * They define 3 features that are designed to be biased | + | * They define 3 features that are designed to be biased |
| * If e.g. two variants are used 2 times each, they will have roughly the same score | * If e.g. two variants are used 2 times each, they will have roughly the same score | ||
| * The BM25 function is a refined version of the [[http:// | * The BM25 function is a refined version of the [[http:// | ||
| Line 81: | Line 81: | ||
| **Performance** | **Performance** | ||
| * Adding any single feature improves the performance, | * Adding any single feature improves the performance, | ||
| - | * "Most works in MT just acchieve | + | * "Most works in MT just achieve |
| * There are no significance tests !! | * There are no significance tests !! | ||
| * Even 1.0 BLEU doesn' | * Even 1.0 BLEU doesn' | ||
| Line 94: | Line 94: | ||
| * The baseline has the same problem | * The baseline has the same problem | ||
| * The human translators use different expression in the reference for a reason (even if the meaning is roughly the same, there can be style differences etc.) | * The human translators use different expression in the reference for a reason (even if the meaning is roughly the same, there can be style differences etc.) | ||
| - | * We must be careful when we critisize | + | * We must be careful when we criticize |
| * It's always better to back up our argument by human rankings | * It's always better to back up our argument by human rankings | ||
| * Why didn't they run METEOR or other metric and left it for future work? | * Why didn't they run METEOR or other metric and left it for future work? | ||
| Line 105: | Line 105: | ||
| * It's true that choosing just document works well for news articles, but not for most of the content we wish to translate | * It's true that choosing just document works well for news articles, but not for most of the content we wish to translate | ||
| * Domain feature, topic modelling or word classes should be worth trying | * Domain feature, topic modelling or word classes should be worth trying | ||
| + | |||
| + | ===== Our conclusion ===== | ||
| + | |||
| + | Nice paper with a very good idea that probably can improve translations, | ||
| + | |||
| + | ===== Martin' | ||
| + | * The approach (without modifications) does not seem to be suitable for translating to a morphologically rich language. Different forms of the same lemma would be considered different senses (if not grouped together due to 1/2 of character being same), so the system would produce e.g. only nominatives. | ||
| + | * Also, there should be a modification for source-side words with more possible PoS. E.g. " | ||
