[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki

[ Back to the navigation ]

This is an old revision of the document!

Table of Contents

Semantic Pattern Recognition (SPR) -- project webpage

People and contacts

Institute of Formal and Applied Linguistics
Charles University in Prague
Faculty of Mathematics and Physics

Malostranské náměstí 25
CZ-118 00 Praha

CPA Verb Validation Sample 30 (En)

“CPA Verb Validation Sample 30 (En)” is a newly developed lexical resource. It contains descriptions of the following 30 English verbs:

access ally analyse arrive breathe
claim cool cry crush deny
enlarge enlist forge frighten furnish
hail halt part plug plough
pour smash smell steer submit
swell throw trouble wake yield

Here we present a small portion of the data just to show its structure. For each verb we define a set of semantic patterns and provide a sample of manually annotated corpus concordances. Then we do a detailed interannotator disagreement analysis, error detection and adjudication.

Annotation Scheme Description

Pattern Definitions Preview

Pattern definitions - we have revised the pattern definitions for 30 verbs. Here is the sample of three of them (after revision):

Annotated Data Preview

Annotation of 50 concordances per each of these verbs by three annotators. Here is a little sample of three verbs. There has been already finished the manual disagreements analysis and all instances were adjudicated and gold sample was created. Manual disagreements analysis and adjudication:

Disagreement Analysis Preview

Automatic disagreements analysis - confusion matrix for each pair of the annotators in one file

TODO: ne odlisna barva pozadi a rozsirit chlivky a kurziva (v tabulce sloves) + vycentrovat na strance
TODO: zkontrolovat obsah souboru (specifikace formulare)

[ Back to the navigation ] [ Back to the content ]