[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
courses:rg:2012:sigtest-mt [2012/11/12 17:36]
tamchyna
courses:rg:2012:sigtest-mt [2012/11/12 17:43] (current)
tamchyna
Line 58: Line 58:
 Answering Question 4: A is better with 97% confidence (and at least as good with 98% confidence). Answering Question 4: A is better with 97% confidence (and at least as good with 98% confidence).
  
-We don't think this corresponds to confidence -- it's just the proportion of times that A beat B, a kind of ML estimate.+We don't think this corresponds to confidence -- it's just the proportion of times that A beat B, a kind of ML estimate. In fact, we could use the differences between systems as input for a paired t-test (and get a true confidence interval).
  
 Notes on p-value: concepts of confidence intervals/significance testing were developed independently by different researches, then they were somehow merged. P-value is often misunderstood and misused. Notes on p-value: concepts of confidence intervals/significance testing were developed independently by different researches, then they were somehow merged. P-value is often misunderstood and misused.

[ Back to the navigation ] [ Back to the content ]