[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Last revision Both sides next revision
external:spr [2011/10/21 14:32]
smejkalova
external:spr [2011/10/21 15:15]
smejkalova
Line 5: Line 5:
 ===== People and contacts ===== ===== People and contacts =====
  
-    * Silvie Cinková cinkova@ufal.mff.cuni.cz +    * Silvie Cinková <cinkova (at) ufal.mff.cuni.cz> 
-    * Martin Holub holub@ufal.mff.cuni.cz +    * Martin Holub <holub (at) ufal.mff.cuni.cz> 
-    * Lenka Smejkalová smejkalova@ufal.mff.cuni.cz +    * Lenka Smejkalová <smejkalova (at) ufal.mff.cuni.cz> 
-    * Vincent Kríž vincent.kriz@gmail.com+    * Vincent Kríž <vincent.kriz (at) gmail.com>
  
 Institute of Formal and Applied Linguistics Institute of Formal and Applied Linguistics
Line 16: Line 16:
 Malostranské náměstí 25 Malostranské náměstí 25
 CZ-118 00 Praha CZ-118 00 Praha
 +
 +
 +
  
  
Line 22: Line 25:
 "CPA Verb Validation Sample 30 (En)" is a newly developed lexical resource. It contains descriptions of the following 30 English verbs:  "CPA Verb Validation Sample 30 (En)" is a newly developed lexical resource. It contains descriptions of the following 30 English verbs: 
  
 +|//access // |//ally//   |//analyse// |//arrive//   |//breathe// |
 +|//claim//   |//cool//   |//cry//     |//crush//    |//deny//    |
 +|//enlarge// |//enlist// |//forge//   |//frighten// |//furnish// |
 +|//hail//    |//halt//   |//part//    |//plug//     |//plough//  |
 +|//pour//    |//smash//  |//smell//   |//steer//    |//submit//  |
 +|//swell//   |//throw//  |//trouble// |//wake//     |//yield//   |
 +
 +
 +
 +Here we present a small portion of the data just to illustrate its structure. For each verb we define a set of semantic patterns and provide a  sample of manually annotated corpus concordances. Then we do a detailed interannotator disagreement analysis, error detection, and adjudication.
  
-|//access // |//ally// |//analyse// |//arrive// |//breathe// | 
-|//claim//  |//cool// |//cry//     |//crush//  |//deny//    | 
-|//enlarge// |//enlist// |forge |frighten |furnish | 
-|hail |halt |part |plug |plough | 
-|pour |smash |smell |steer |submit | 
-|swell | throw |trouble |wake |yield | 
  
  
-Here we present a small portion of the data just to show its structure. For each verb we define a set of semantic patterns and provide a  sample of manually annotated corpus concordances. Then we do a detailed interannotator disagreement analysis, error detection and adjudication. 
  
  
Line 40: Line 46:
  
  
-==== Pattern Definitions Preview ==== 
  
-Pattern definitions - we have revised the pattern definitions for 30 verbs. Here is the sample of three of them (after revision):+ 
 + 
 + 
 + 
 + 
 + 
 + 
 +==== Pattern Definitions Preview ==== 
 +A few examples of revised PDEV entries:
      * [[http://ufal.mff.cuni.cz/~smejkalova/pdev/cool_patterns.html|cool]]      * [[http://ufal.mff.cuni.cz/~smejkalova/pdev/cool_patterns.html|cool]]
-     * [[http://ufal.mff.cuni.cz/~smejkalova/pdev/deny_patterns.html|deny]] - detailed [[http://ufal.mff.cuni.cz/~smejkalova/pdev/deny_7.png|view]] of pattern number 7+     * [[http://ufal.mff.cuni.cz/~smejkalova/pdev/deny_patterns.html|deny]]
      * [[http://ufal.mff.cuni.cz/~smejkalova/pdev/yield_patterns.html|yield]]      * [[http://ufal.mff.cuni.cz/~smejkalova/pdev/yield_patterns.html|yield]]
 +
 +An example of a pattern definition form in detail:
 +     * [[http://ufal.mff.cuni.cz/~smejkalova/pdev/deny_7.png|deny - pattern number 7]]
 +
 +
 +
 +
 +
  
  
  
 ==== Annotated Data Preview ==== ==== Annotated Data Preview ====
-Annotation of 50 concordances per each of these verbs by three annotators. Here is a little sample of three verbs. There has been already finished the manual disagreements analysis and all instances were adjudicated and gold sample was createdManual disagreements analysis and adjudication:+Each of the examples below contains a multiple annotated set of 50 corpus concordances with manual disagreement analysis and final adjudication. The adjudicated data are used as a gold standard sample.
     * cool - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/cool_analysis.pdf|pdf]]     * cool - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/cool_analysis.pdf|pdf]]
     * deny - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/deny_analysis.pdf|pdf]]     * deny - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/deny_analysis.pdf|pdf]]
Line 56: Line 77:
  
  
-==== Disagreement Analysis Preview ==== 
  
-Automatic disagreements analysis - confusion matrix for each pair of the annotators in one file+ 
 + 
 + 
 +==== Disagreement Analysis Preview ==== 
 +Confusion matrices for each pair of annotators and automatic disagreements analysis:
     * cool - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/cool_res.txt|txt]]     * cool - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/cool_res.txt|txt]]
     * deny - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/deny_res.txt|txt]]     * deny - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/deny_res.txt|txt]]
Line 68: Line 92:
  
  
- 
-TODO: ne odlisna barva pozadi a rozsirit chlivky a kurziva (v tabulce sloves) + vycentrovat na strance 
-TODO: zkontrolovat obsah souboru (specifikace formulare) 
  

[ Back to the navigation ] [ Back to the content ]