[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
external:spr [2011/10/20 15:32]
smejkalova
external:spr [2011/10/21 14:32]
smejkalova
Line 1: Line 1:
-====== Semantic Pattern Recognition ====== +====== Semantic Pattern Recognition (SPR)  --  project webpage ======
-Doplnit strucny popis.+
  
  
  
 +===== People and contacts =====
  
-====== Data ======+    * Silvie Cinková cinkova@ufal.mff.cuni.cz 
 +    * Martin Holub holub@ufal.mff.cuni.cz 
 +    * Lenka Smejkalová smejkalova@ufal.mff.cuni.cz 
 +    * Vincent Kríž vincent.kriz@gmail.com
  
-     * Manual for Annotators - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/annotation_manual.pdf|pdf]] +Institute of Formal and Applied Linguistics 
-     * Specification of form for defining patterns - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/PDEV2.1-pattern-form.pdf|pdf]] +Charles University in Prague 
-     * Pattern definitions - we have revised the pattern definitions for 30 verbs. Here is the sample of three of them (after revision): +Faculty of Mathematics and Physics
-          * [[http://ufal.mff.cuni.cz/~smejkalova/pdev/cool_patterns.html|cool]] +
-          * [[http://ufal.mff.cuni.cz/~smejkalova/pdev/deny_patterns.html|deny]] - detailed [[http://ufal.mff.cuni.cz/~smejkalova/pdev/deny_7.png|view]] of pattern number 7 +
-          * [[http://ufal.mff.cuni.cz/~smejkalova/pdev/yield_patterns.html|yield]] +
-     * Annotation of 50 concordances per each of these verbs by three annotators. Here is a little sample of three verbs. There has been already finished the manual disagreements analysis and all instances were adjudicated and gold sample was created. +
-          * Manual disagreements analysis and adjudication: +
-                * cool - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/cool_analysis.pdf|pdf]] +
-                * deny - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/deny_analysis.pdf|pdf]] +
-                * yield - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/yield_analysis.pdf|pdf]] +
-          * Automatic disagreements analysis - confusion matrix for each pair of the annotators in one file +
-                * cool - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/cool_res.txt|txt]] +
-                * deny - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/deny_res.txt|txt]] +
-                * yield - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/yield_res.txt|txt]] +
-          * Inter-annotator agreement (Cohen's kappa for each pair, Fleiss' kappa for all together+
  
-Results of inter-annotator agreement +Malostranské náměstí 25 
-^verb      size  ^ #N ^ Fleiss' kappa ^        Cohen's kappa            ^^^ +CZ-118 00 Praha 
-                                A2 vs. A3 | A2 vs. A1 | A3 vs. A1 + 
-|cool         50|  16|    0.685     0.743     0.669     0.646+ 
-|deny        50|  10         0.524     0.434     0.571     0.582+===== CPA Verb Validation Sample 30 (En) ===== 
-|yield    |      50|  10|    0.500     0.489     0.588     0.429|+ 
 +"CPA Verb Validation Sample 30 (En)" is a newly developed lexical resource. It contains descriptions of the following 30 English verbs:  
 + 
 + 
 +|//access // |//ally// |//analyse// |//arrive// |//breathe// 
 +|//claim//  |//cool// |//cry//     |//crush//  |//deny//    | 
 +|//enlarge// |//enlist// |forge |frighten |furnish 
 +|hail |halt |part |plug |plough | 
 +|pour |smash |smell |steer |submit | 
 +|swell | throw |trouble |wake |yield | 
 + 
 + 
 +Here we present a small portion of the data just to show its structure. For each verb we define a set of semantic patterns and provide a  sample of manually annotated corpus concordances. Then we do a detailed interannotator disagreement analysis, error detection and adjudication. 
 + 
 + 
 + 
 +==== Annotation Scheme Description ==== 
 +     * Pattern Definition Form - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/PDEV2.1-pattern-form.pdf|pdf]] 
 +     * Annotation Manual - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/annotation_manual.pdf|pdf]] 
 + 
 + 
 +==== Pattern Definitions Preview ==== 
 + 
 +Pattern definitions - we have revised the pattern definitions for 30 verbsHere is the sample of three of them (after revision): 
 +     * [[http://ufal.mff.cuni.cz/~smejkalova/pdev/cool_patterns.html|cool]] 
 +     * [[http://ufal.mff.cuni.cz/~smejkalova/pdev/deny_patterns.html|deny]] - detailed [[http://ufal.mff.cuni.cz/~smejkalova/pdev/deny_7.png|view]] of pattern number 7 
 +     * [[http://ufal.mff.cuni.cz/~smejkalova/pdev/yield_patterns.html|yield]] 
 + 
 + 
 + 
 +==== Annotated Data Preview ==== 
 +Annotation of 50 concordances per each of these verbs by three annotators. Here is a little sample of three verbs. There has been already finished the manual disagreements analysis and all instances were adjudicated and gold sample was created. Manual disagreements analysis and adjudication: 
 +    * cool - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/cool_analysis.pdf|pdf]] 
 +    * deny - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/deny_analysis.pdf|pdf]] 
 +    * yield - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/yield_analysis.pdf|pdf]] 
 + 
 + 
 +==== Disagreement Analysis Preview ==== 
 + 
 +Automatic disagreements analysis - confusion matrix for each pair of the annotators in one file 
 +    * cool - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/cool_res.txt|txt]] 
 +    * deny - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/deny_res.txt|txt]] 
 +    * yield - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/yield_res.txt|txt]] 
 + 
 + 
 + 
 + 
 + 
 + 
 + 
 +TODO: ne odlisna barva pozadi a rozsirit chlivky a kurziva (v tabulce sloves) + vycentrovat na strance 
 +TODO: zkontrolovat obsah souboru (specifikace formulare)
  

[ Back to the navigation ] [ Back to the content ]