====== Semantic Pattern Recognition (SPR) -- project webpage ====== ===== People and contacts ===== * Silvie Cinková * Martin Holub * Lenka Smejkalová * Vincent Kríž Institute of Formal and Applied Linguistics Charles University in Prague Faculty of Mathematics and Physics Malostranské náměstí 25 CZ-118 00 Praha ===== CPA Verb Validation Sample 30 (En) ===== "CPA Verb Validation Sample 30 (En)" is a newly developed lexical resource. It contains descriptions of the following 30 English verbs: |//access // |//ally// |//arrive// |//breathe// |//claim// | |//cool// |//cry// |//crush// |//deny// |//enlarge// | |//enlist// |//forge// |//furnish// |//hail// |//halt// | |//part// |//plug// |//plough// |//pour// |//say// | |//smash// |//smell// |//steer// |//submit// |//swell// | |//tell// | //throw// |//trouble// |//wake// |//yield// | Here we present a small portion of the data just to illustrate its structure. For each verb we define a set of semantic patterns and provide a sample of manually annotated corpus concordances. Then we do a detailed interannotator disagreement analysis, error detection, and adjudication. ==== Annotation Scheme Description ==== * Pattern Definition Form - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/PDEV2.1-pattern-form.pdf|pdf]] * Annotation Manual - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/annotation_manual.pdf|pdf]] ==== Pattern Definitions Preview ==== A few examples of revised PDEV entries: * [[http://ufal.mff.cuni.cz/~smejkalova/pdev/cool_patterns.html|cool]] * [[http://ufal.mff.cuni.cz/~smejkalova/pdev/deny_patterns.html|deny]] * [[http://ufal.mff.cuni.cz/~smejkalova/pdev/yield_patterns.html|yield]] An example of a pattern definition form in detail: * [[http://ufal.mff.cuni.cz/~smejkalova/pdev/deny_7.png|deny - pattern number 7]] ==== Annotated Data Preview ==== Each of the examples below contains a multiple annotated set of 50 corpus concordances with manual disagreement analysis and final adjudication. The adjudicated data are used as a gold standard sample. * cool - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/cool_analysis.pdf|pdf]] * deny - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/deny_analysis.pdf|pdf]] * yield - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/yield_analysis.pdf|pdf]] ==== Disagreement Analysis Preview ==== Confusion matrices for each pair of annotators and automatic disagreements analysis: * cool - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/cool_res.txt|txt]] * deny - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/deny_res.txt|txt]] * yield - [[http://ufal.mff.cuni.cz/~smejkalova/pdev/yield_res.txt|txt]]