[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
user:zeman:interset:drivers [2008/04/03 14:27]
zeman Restructuralization.
user:zeman:interset:drivers [2008/04/03 18:04]
zeman Portuguese.
Line 98: Line 98:
 Work finished: 31.3.2008 Work finished: 31.3.2008
 Total work time: 10 min Total work time: 10 min
 +
 +
 +
 +===== Portuguese (pt) =====
 +
 +The Portuguese CoNLL treebank contains tags with 149 different features. Big part of them are noise, probably introduced by the conversion procedure from the original Floresta format to the CoNLL format. The driver is designed so that it accepts all incorrect tags on decoding but encodes only corrected tags. Incorrect tags are not on the list of possible tags so the driver tester will not complain.
 +
 +| **Feature** | **Explanation** | **Examples** |
 +| _ | no features | prepositions, punctuation etc. |
 +| 1 | 1st person | |
 +| 1/3S | 1st person or 3rd person singular | leia, disse, seria, prefira |
 +| 1S | 1st person singular | tenho, tinha, usei, vivo, vou |
 +| 1P | 1st person plural | tomámos, vamos, vemos, víamos |
 +| 2S | 2nd person singular | compreendeste, queres, te, ti, veja, vives |
 +| 2P | 2nd person plural | chamais, vós |
 +| 3S | 3rd person singular | viu, viva |
 +| 3S/P | 3rd person singular or plural | se, si |
 +| 3P | 3rd person plural | vivem |
 +| ACC | pronoun as direct accusative object | se, te, vos |
 +| ACC/DAT | pronouns in accusative or dative | nos, se |
 +| <ALT> | indicates typo in word | |
 +| <artd> | definite article or determiner pronoun | a, as, o, os |
 +| <arti> | indefinite article or determiner pronoun | uma, um |
 +| <card> | cardinal number | um, uma, dois, três, quatro, cinco |
 +| <co-acc> | coordination of direct accusative objects | |
 +| <co-advl> | coordination of adjunct adverbials | |
 +| <co-advo> | coordination of argument adverbials, object related | |
 +| <co-advs> | coordination of argument adverbials, subject related | |
 +| <co-app> | coordination of adnominal appositions | |
 +| <co-fmc> | coordination of main clauses | |
 +| <co-ger> | coordination of gerunds | |
 +| <co-inf> | coordination of infinitives | |
 +| <co-oc> | coordination of object complements | |
 +| <co-pass> | coordination of passive adjuncts | |
 +| <co-pcv> | coordination of predicative participles | |
 +| <co-piv> | coordination of prepositional objects | |
 +| <co-postad> | coordination of postpositioned dependents in ap or advp | |
 +| <co-postnom> | coordination of postpositioned dependents in np | |
 +| <co-pred> | coordination of adjunct predicatives | |
 +| <co-prenom> | coordination of prepositioned dependents in np | |
 +| <co-prparg> | coordination of preposition arguments | |
 +| <co-sc> | coordination of subject complements | |
 +| <co-subj> | coordination of subjects | |
 +| <co-vfin> | coordination of finite verbs | |
 +| <coll> | collective reflexive pronoun | se (reunir-se, associar-se) |
 +| COND | verb in conditional mood | precisariam, seriam, tentaria, venderia, viriam |
 +| DAT | pronoun as dative object | lhe, lhes, me, no, nos, se, vos |
 +| > | noise; should be ignored | |
 +| 0/1/3S | noise; should probably be 1/3S | |
 +| 1S> | noise; should be 1S | meu, meus, minha, minhas |
 +| 1P> | noise; should be 1P | nossa, nossas, nosso, nossos |
 +| 2S> | noise; should be 2S | seu, teu |
 +| 2P> | noise; should be 2P | vossa, vosso |
 +| 3S> | noise; should be 3S | seu, seus, sua, suas |
 +| 3S/P> | noise; should be 3S/P | seu, seus, sua |
 +| 3P> | noise; should be 3P | seu, seus, sua |
 +| <adv> | noise? | fundo |
 +| <advl> | noise; should be <co-advl> | e |
 +| <co-adv> | noise; should be <co-advl> | |
 +| >co-fmc> | noise; should be <co-fmc> | |
 +| <co-fmv> | noise; should be <co-fmc> | |
 +| convidado-> | noise; should be <ALT> | |
 +| <co-postnom | noise; should be <co-postnom> | |
 +| <co-prparg | noise; should be <co-prparg | |
 +| <corr | noise; should be <ALT> | |
 +| <co-vfin><co-fmc> | noise; should be two features | |
  
 ===== Swedish (sv) ===== ===== Swedish (sv) =====

[ Back to the navigation ] [ Back to the content ]