[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision Both sides next revision
external:spr [2011/10/21 14:32]
smejkalova
external:spr [2011/10/21 15:09]
smejkalova
Line 16: Line 16:
 Malostranské náměstí 25 Malostranské náměstí 25
 CZ-118 00 Praha CZ-118 00 Praha
 +
 +
 +
  
  
Line 22: Line 25:
 "CPA Verb Validation Sample 30 (En)" is a newly developed lexical resource. It contains descriptions of the following 30 English verbs: ​ "CPA Verb Validation Sample 30 (En)" is a newly developed lexical resource. It contains descriptions of the following 30 English verbs: ​
  
 +|//access // |//​ally// ​  ​|//​analyse//​ |//​arrive// ​  ​|//​breathe//​ |
 +|//​claim// ​  ​|//​cool// ​  ​|//​cry// ​    ​|//​crush// ​   |//​deny// ​   |
 +|//​enlarge//​ |//enlist// |//​forge// ​  ​|//​frighten//​ |//​furnish//​ |
 +|//​hail// ​   |//​halt// ​  ​|//​part// ​   |//​plug// ​    ​|//​plough// ​ |
 +|//​pour// ​   |//​smash// ​ |//​smell// ​  ​|//​steer// ​   |//​submit// ​ |
 +|//​swell// ​  ​|//​throw// ​ |//​trouble//​ |//​wake// ​    ​|//​yield// ​  |
 +
 +
 +
 +Here we present a small portion of the data just to illustrate its structure. For each verb we define a set of semantic patterns and provide a  sample of manually annotated corpus concordances. Then we do a detailed interannotator disagreement analysis, error detection, and adjudication.
  
-|//access // |//ally// |//​analyse//​ |//arrive// |//​breathe//​ | 
-|//​claim// ​ |//cool// |//​cry// ​    ​|//​crush// ​ |//​deny// ​   | 
-|//​enlarge//​ |//enlist// |forge |frighten |furnish | 
-|hail |halt |part |plug |plough | 
-|pour |smash |smell |steer |submit | 
-|swell | throw |trouble |wake |yield | 
  
  
-Here we present a small portion of the data just to show its structure. For each verb we define a set of semantic patterns and provide a  sample of manually annotated corpus concordances. Then we do a detailed interannotator disagreement analysis, error detection and adjudication. 
  
  
Line 40: Line 46:
  
  
-==== Pattern Definitions Preview ==== 
  
-Pattern ​definitions - we have revised ​the pattern definitions for 30 verbs. Here is the sample of three of them (after revision):+ 
 + 
 + 
 + 
 + 
 + 
 + 
 +==== Pattern ​Definitions Preview ==== 
 +A few examples of revised ​PDEV entries:
      * [[http://​ufal.mff.cuni.cz/​~smejkalova/​pdev/​cool_patterns.html|cool]]      * [[http://​ufal.mff.cuni.cz/​~smejkalova/​pdev/​cool_patterns.html|cool]]
-     * [[http://​ufal.mff.cuni.cz/​~smejkalova/​pdev/​deny_patterns.html|deny]] ​- detailed [[http://​ufal.mff.cuni.cz/​~smejkalova/​pdev/​deny_7.png|view]] of pattern number 7+     * [[http://​ufal.mff.cuni.cz/​~smejkalova/​pdev/​deny_patterns.html|deny]]
      * [[http://​ufal.mff.cuni.cz/​~smejkalova/​pdev/​yield_patterns.html|yield]]      * [[http://​ufal.mff.cuni.cz/​~smejkalova/​pdev/​yield_patterns.html|yield]]
 +
 +An example of a pattern definition form in detail:
 +     * [[http://​ufal.mff.cuni.cz/​~smejkalova/​pdev/​deny_7.png|deny - pattern number 7]]
 +
 +
 +
 +
 +
  
  
  
 ==== Annotated Data Preview ==== ==== Annotated Data Preview ====
-Annotation ​of 50 concordances ​per each of these verbs by three annotators. Here is a little sample of three verbs. There has been already finished the manual ​disagreements ​analysis and all instances were adjudicated ​and gold sample ​was createdManual disagreements analysis and adjudication:​+Each of the examples below contains a multiple annotated set of 50 corpus ​concordances ​with manual ​disagreement ​analysis and final adjudication. The adjudicated ​data are used as a gold standard ​sample.
     * cool - [[http://​ufal.mff.cuni.cz/​~smejkalova/​pdev/​cool_analysis.pdf|pdf]]     * cool - [[http://​ufal.mff.cuni.cz/​~smejkalova/​pdev/​cool_analysis.pdf|pdf]]
     * deny - [[http://​ufal.mff.cuni.cz/​~smejkalova/​pdev/​deny_analysis.pdf|pdf]]     * deny - [[http://​ufal.mff.cuni.cz/​~smejkalova/​pdev/​deny_analysis.pdf|pdf]]
Line 56: Line 77:
  
  
-==== Disagreement Analysis Preview ==== 
  
-Automatic disagreements analysis - confusion matrix ​for each pair of the annotators ​in one file+ 
 + 
 + 
 +==== Disagreement Analysis Preview ==== 
 +Confusion matrices ​for each pair of annotators ​and automatic disagreements analysis:
     * cool - [[http://​ufal.mff.cuni.cz/​~smejkalova/​pdev/​cool_res.txt|txt]]     * cool - [[http://​ufal.mff.cuni.cz/​~smejkalova/​pdev/​cool_res.txt|txt]]
     * deny - [[http://​ufal.mff.cuni.cz/​~smejkalova/​pdev/​deny_res.txt|txt]]     * deny - [[http://​ufal.mff.cuni.cz/​~smejkalova/​pdev/​deny_res.txt|txt]]
Line 68: Line 92:
  
  
- 
-TODO: ne odlisna barva pozadi a rozsirit chlivky a kurziva (v tabulce sloves) + vycentrovat na strance 
-TODO: zkontrolovat obsah souboru (specifikace formulare) 
  

[ Back to the navigation ] [ Back to the content ]