Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
courses:rg:2012:meant [2012/11/13 00:07] rosa spellcheck |
courses:rg:2012:meant [2012/11/13 16:25] (current) popel |
||
---|---|---|---|
Line 31: | Line 31: | ||
The group discussed whether HMEANT evaluations are really faster than HTER annotations, | The group discussed whether HMEANT evaluations are really faster than HTER annotations, | ||
- | ==== Question 3: What does the set J contain in the //C_precision// formula? ==== | + | ==== Question 3: What does the set J contain in the C_precision formula? ==== |
The answer is that it contains the arguments of the predicate. It actually contains all // | The answer is that it contains the arguments of the predicate. It actually contains all // | ||
Line 45: | Line 45: | ||
For MT2, // | For MT2, // | ||
- | For MT3, the predicates do not match, and therefore no arguments are taken into account. Martin and Ruda agreed that most probably not even a partial match of predicates can be annotated, as there is no support for such annotation in the formulas, which Martin suggested to be a possible flaw of the method. | + | For MT3, the predicates do not match, and therefore no arguments are taken into account |
Karel Bílek also noted that it is hard to annotate semantics on incorrect sentences, which is not mentioned in the paper. | Karel Bílek also noted that it is hard to annotate semantics on incorrect sentences, which is not mentioned in the paper. | ||
===== 4 Meta-evaluation methodology ===== | ===== 4 Meta-evaluation methodology ===== | ||
- | Here, we reminded the difference between Kendall' | + | Here, we reminded the difference between Kendall' |
- | Martin also remarks | + | Martin also remarked |
===== 6 Experiment: Monolinguals vs. bilinguals ===== | ===== 6 Experiment: Monolinguals vs. bilinguals ===== | ||
Line 62: | Line 62: | ||
For the rest of the session, Martin took the lead to express some more objections to the paper. The group agreed with the objections, and even added some more. | For the rest of the session, Martin took the lead to express some more objections to the paper. The group agreed with the objections, and even added some more. | ||
- | Table 3 seems to represent the main results of the paper. | + | Table 3 seems to represent the main results of the paper.It is shocking that the authors used **only 40 sentences**; |
- | It is shocking that the authors used **only 40 sentences**; | + | The grid search they use to tune the parameters means to "try everything and find the best-correlating parameters" |
- | The grid search they use to tune the parameters means to "try everything and find the best-correlating parameters" | + | |
- | They ran the grid search optimization on the 40 sentences they have, but then they evaluated HMEANT on the same data. | + | |
- | The group agreed that such evaluation is completely flawed and it is not clear why it was performed and included in the paper. | + | |
Karel Bílek also notes that it is quite ridiculous to state the precision to 4 decimal digits when only 40 sentences are used. | Karel Bílek also notes that it is quite ridiculous to state the precision to 4 decimal digits when only 40 sentences are used. | ||