[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Next revision
Previous revision
Next revision Both sides next revision
external:spr [2011/10/20 15:29]
smejkalova vytvořeno
external:spr [2011/10/21 14:32]
smejkalova
Line 1: Line 1:
-====== Semantic Pattern Recognition ====== +====== Semantic Pattern Recognition (SPR)  --  project webpage ======
-Doplnit strucny popis.+
  
  
  
 +===== People and contacts =====
  
-====== What we have already done ======+    * Silvie Cinková cinkova@ufal.mff.cuni.cz 
 +    * Martin Holub holub@ufal.mff.cuni.cz 
 +    * Lenka Smejkalová smejkalova@ufal.mff.cuni.cz 
 +    * Vincent Kríž vincent.kriz@gmail.com
  
-* Manual for Annotators - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/annotation_manual.pdf|pdf]] +Institute of Formal and Applied Linguistics 
-* Specification of form for defining patterns - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/PDEV2.1-pattern-form.pdf|pdf]] +Charles University in Prague 
-* Pattern definitions - we have revised the pattern definitions for 30 verbs. Here is the sample of three of them (after revision): +Faculty of Mathematics and Physics
-    * [[http://ufal.mff.cuni.cz/~smejkalova/pdev/cool_patterns.html|cool]] +
-    * [[http://ufal.mff.cuni.cz/~smejkalova/pdev/deny_patterns.html|deny]] - detailed [[http://ufal.mff.cuni.cz/~smejkalova/pdev/deny_7.png|view]] of pattern number 7 +
-    * [[http://ufal.mff.cuni.cz/~smejkalova/pdev/yield_patterns.html|yield]] +
-* Annotation of 50 concordances per each of these verbs by three annotators. Here is a little sample of three verbs. There has been already finished the manual disagreements analysis and all instances were adjudicated and gold sample was created. +
-    * Manual disagreements analysis and adjudication: +
-          * cool - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/cool_analysis.pdf|pdf]] +
-          * deny - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/deny_analysis.pdf|pdf]] +
-          * yield - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/yield_analysis.pdf|pdf]] +
-    * Automatic disagreements analysis - confusion matrix for each pair of the annotators in one file +
-          * cool - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/cool_res.txt|txt]] +
-          * deny - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/deny_res.txt|txt]] +
-          * yield - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/yield_res.txt|txt]] +
-    * Inter-annotator agreement (Cohen's kappa for each pair, Fleiss' kappa for all together+
  
-Results of inter-annotator agreement +Malostranské náměstí 25 
-^verb      size  ^ #N ^ Fleiss' kappa ^        Cohen's kappa            ^^^ +CZ-118 00 Praha 
-                                A2 vs. A3 | A2 vs. A1 | A3 vs. A1 + 
-|cool         50|  16|    0.685     0.743     0.669     0.646+ 
-|deny        50|  10         0.524     0.434     0.571     0.582+===== CPA Verb Validation Sample 30 (En) ===== 
-|yield    |      50|  10|    0.500     0.489     0.588     0.429|+ 
 +"CPA Verb Validation Sample 30 (En)" is a newly developed lexical resource. It contains descriptions of the following 30 English verbs:  
 + 
 + 
 +|//access // |//ally// |//analyse// |//arrive// |//breathe// 
 +|//claim//  |//cool// |//cry//     |//crush//  |//deny//    | 
 +|//enlarge// |//enlist// |forge |frighten |furnish 
 +|hail |halt |part |plug |plough | 
 +|pour |smash |smell |steer |submit | 
 +|swell | throw |trouble |wake |yield | 
 + 
 + 
 +Here we present a small portion of the data just to show its structure. For each verb we define a set of semantic patterns and provide a  sample of manually annotated corpus concordances. Then we do a detailed interannotator disagreement analysis, error detection and adjudication. 
 + 
 + 
 + 
 +==== Annotation Scheme Description ==== 
 +     * Pattern Definition Form - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/PDEV2.1-pattern-form.pdf|pdf]] 
 +     * Annotation Manual - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/annotation_manual.pdf|pdf]] 
 + 
 + 
 +==== Pattern Definitions Preview ==== 
 + 
 +Pattern definitions - we have revised the pattern definitions for 30 verbsHere is the sample of three of them (after revision): 
 +     * [[http://ufal.mff.cuni.cz/~smejkalova/pdev/cool_patterns.html|cool]] 
 +     * [[http://ufal.mff.cuni.cz/~smejkalova/pdev/deny_patterns.html|deny]] - detailed [[http://ufal.mff.cuni.cz/~smejkalova/pdev/deny_7.png|view]] of pattern number 7 
 +     * [[http://ufal.mff.cuni.cz/~smejkalova/pdev/yield_patterns.html|yield]] 
 + 
 + 
 + 
 +==== Annotated Data Preview ==== 
 +Annotation of 50 concordances per each of these verbs by three annotators. Here is a little sample of three verbs. There has been already finished the manual disagreements analysis and all instances were adjudicated and gold sample was created. Manual disagreements analysis and adjudication: 
 +    * cool - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/cool_analysis.pdf|pdf]] 
 +    * deny - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/deny_analysis.pdf|pdf]] 
 +    * yield - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/yield_analysis.pdf|pdf]] 
 + 
 + 
 +==== Disagreement Analysis Preview ==== 
 + 
 +Automatic disagreements analysis - confusion matrix for each pair of the annotators in one file 
 +    * cool - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/cool_res.txt|txt]] 
 +    * deny - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/deny_res.txt|txt]] 
 +    * yield - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/yield_res.txt|txt]] 
 + 
 + 
 + 
 + 
 + 
 + 
 + 
 +TODO: ne odlisna barva pozadi a rozsirit chlivky a kurziva (v tabulce sloves) + vycentrovat na strance 
 +TODO: zkontrolovat obsah souboru (specifikace formulare)
  

[ Back to the navigation ] [ Back to the content ]