===== Spanish (es) ===== There is [[http://clic.ub.edu/corpus/|one treebank]] versions of which were known in different times under different names: * CESS-Cast * Cast3LB * AnCora-ES ==== Versions ==== * CoNLL 2006 (Cast3LB) * CoNLL 2009 (AnCora-ES) The dependency treebank Cast3LB was extracted automatically from an earlier constituent-based annotation (see Montserrat Civit, Ma. Antònia Martí, Núria Bufí: [[http://www.springerlink.com/content/978-3-540-37334-6/#section=474512&page=8&locus=86|Cat3LB and Cast3LB: From Constituents to Dependencies]]. In: T. Salakoski et al. (eds.): FinTAL 2006, LNAI 4139, pp. 141–152, 2006, Springer, Berlin / Heidelberg) ==== Obtaining and License ==== The AnCora-ES corpus ought to be freely downloadable from [[http://clic.ub.edu/corpus/en/ancora-descarregues|its website]]. The download will not work for unregistered and not signed in users. The website offers creating new account but it is not automatic, one has to wait for approval. Republication of the two CoNLL versions in LDC is planned but it has not happenned yet. The CoNLL 2006 license in short: * research and demonstrative usage * no redistribution * cite in publications * The original CoNLL 2006 license required a reference to the 3LB //project:// (Palomar, M., M. Civit, A. Díaz, L. Moreno, E. Bisbal, M. Aranzabe, A. Ageno, M.A. Martí y B. Navarro (2004) 3LB: Construcción de una base de datos de árboles sintáctico-semánticos para el catalán, euskera y español, XX Congreso de la Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN), pp. 81-88. Barcelona, Julio de 2004 ISSN 1135-5948. * Later there was [[http://www.lrec-conf.org/proceedings/lrec2008/summaries/35.html|the LREC paper]], which is now the required reference for the AnCora corpus. AnCora-ES was created by members of the [[http://clic.ub.edu/|Centre de Llenguatge i Computació (CLiC)]], Universitat de Barcelona, Gran Via de les Corts Catalanes 585, E-08007 Barcelona, Spain, together with Universidad de Alicante (UA), Universitat Politècnica de Catalunya (UPC), Universitat Politècnica de Valencia (UPV) and Euskal Herriko Unibertsitatea (EHU-UPV). ==== References ==== * Website * http://clic.ub.edu/corpus/ * Data * //no separate citation// * Principal publications * Mariona Taulé, M. Antònia Martí, Marta Recasens: [[http://www.lrec-conf.org/proceedings/lrec2008/summaries/35.html|AnCora: Multilevel Annotated Corpora for Catalan and Spanish]]. In: Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08), Marrakech, Morocco, 2008. ISBN 2-9517408-4-0 * Documentation * Maria Antònia Martí, Mariona Taulé, Manu Bertran, Lluís Màrquez: [[http://clic.ub.edu/corpus/webfm_send/13|AnCora: Multilingual and Multilevel Annotated Corpora]]. Draft Technical report, online. * [[http://clic.ub.edu/corpus/webfm_send/18|Morphology]] * [[http://clic.ub.edu/corpus/webfm_send/17|Syntactic guidelines]] ==== Domain ==== Mostly newswire (EFE news, El Periódico, 2000). ==== Size ==== The CoNLL 2006 version contains 95028 tokens in 3512 sentences, yielding 27.06 tokens per sentence on average (CoNLL 2006 data split: 89334 tokens / 3306 sentences training, 5694 tokens / 206 sentences test). The CoNLL 2009 version contains 528,440 tokens in 17709 sentences, yielding 29.84 tokens per sentence on average (CoNLL 2009 data split: 427,442 tokens / 14329 sentences training, 50368 tokens / 1655 sentences development, 50630 tokens / 1725 sentences test). ==== Inside ==== The original morphosyntactic tags (EAGLES?) have been converted to fit into the three columns (CPOS, POS and FEAT) columns of the CoNLL 2006/7 format, resp. the two columns (POS and FEAT) of the CoNLL 2009 format. Note that the missing CPOS column is not the only difference between the two conversion schemes. [[http://clic.ub.edu/corpus/webfm_send/18|Feature names and values]] in the FEAT column are different, too. The morphosyntactic tags have been disambiguated manually. The CoNLL 2009 version also contains automatically disambiguated tags. Multi-word expressions have been collapsed into one token, using underscore as the joining character. This includes named entities (e.g. Comunidad_de_Madrid, Unión_Europea, Alejandro_Couceiro, 1,8_por_ciento, siglo_XXI), prepositional compounds and others (como_puede_ser, a_fondo, a_partir_de). Empty (underscore) tokens have been inserted to represent missing subjects (Spanish is a pro-drop language). ==== Sample ==== The first sentence of the CoNLL 2006 training data: | 1 | El | el | d | da | gen=m|num=s | 2 | _ | 2 | _ | | 2 | presidente | presidente | n | nc | gen=m|num=s | 10 | SUJ | 10 | SUJ | | 3 | del | del | s | sp | gen=m|num=s|for=c | 2 | _ | 2 | _ | | 4 | órgano | órgano | n | nc | gen=m|num=s | 3 | _ | 3 | _ | | 5 | regulador | regulador | a | aq | gen=m|num=s | 4 | _ | 4 | _ | | 6 | de | de | s | sp | for=s | 4 | _ | 4 | _ | | 7 | las | el | d | da | gen=f|num=p | 8 | _ | 8 | _ | | 8 | Telecomunicaciones | Telecomunicaciones | n | np | _ | 6 | _ | 6 | _ | | 9 | se | él | p | p0 | per=3 | 10 | _ | 10 | _ | | 10 | mostró | mostrar | v | vm | num=s|per=3|mod=i|tmp=s | 0 | ROOT | 0 | ROOT | | 11 | partidario | partidario | a | aq | gen=m|num=s | 10 | CPRED.SUJ | 10 | CPRED.SUJ | | 12 | de | de | s | sp | for=s | 11 | _ | 11 | _ | | 13 | completar | completar | v | vm | mod=n | 19 | _ | 19 | _ | | 14 | esta | este | d | dd | num=s|gen=f | 15 | _ | 15 | _ | | 15 | liberalización | liberalización | n | nc | num=s|gen=f | 19 | CD | 19 | CD | | 16 | de | de | s | sp | for=s | 15 | _ | 15 | _ | | 17 | las | el | d | da | gen=f|num=p | 18 | _ | 18 | _ | | 18 | telecomunicaciones | telecomunicación | n | nc | gen=f|num=p | 16 | _ | 16 | _ | | 19 | con | con | s | sp | for=s | 12 | _ | 12 | _ | | 20 | otras | otro | d | di | gen=f|num=p | 21 | _ | 21 | _ | | 21 | medidas | medida | n | nc | gen=f|num=p | 19 | _ | 19 | _ | | 22 | que | que | p | pr | gen=c|num=n | 23 | SUJ | 23 | SUJ | | 23 | incentiven | incentivar | v | vm | num=p|per=3|mod=s|tmp=p | 21 | _ | 21 | _ | | 24 | la | el | d | da | num=s|gen=f | 25 | _ | 25 | _ | | 25 | competencia | competencia | n | nc | num=s|gen=f | 23 | CD | 23 | CD | | 26 | como_puede_ser | como_puede_ser | s | sp | for=s | 21 | _ | 21 | _ | | 27 | abrir | abrir | v | vm | mod=n | 38 | _ | 38 | _ | | 28 | el | el | d | da | gen=m|num=s | 29 | _ | 29 | _ | | 29 | acceso | acceso | n | nc | gen=m|num=s | 38 | CD | 38 | CD | | 30 | a | a | s | sp | for=s | 29 | _ | 29 | _ | | 31 | la | el | d | da | num=s|gen=f | 32 | _ | 32 | _ | | 32 | información | información | n | nc | num=s|gen=f | 30 | _ | 30 | _ | | 33 | de | de | s | sp | for=s | 32 | _ | 32 | _ | | 34 | los | el | d | da | gen=m|num=p | 35 | _ | 35 | _ | | 35 | clientes | cliente | n | nc | num=p|gen=c | 33 | _ | 33 | _ | | 36 | de | de | s | sp | for=s | 35 | _ | 35 | _ | | 37 | Telefónica | Telefónica | n | np | _ | 36 | _ | 36 | _ | | 38 | a | a | s | sp | for=s | 26 | _ | 26 | _ | | 39 | otros | otro | d | di | gen=m|num=p | 40 | _ | 40 | _ | | 40 | operadores | operador | n | nc | gen=m|num=p | 38 | _ | 38 | _ | | 41 | . | . | F | Fp | _ | 10 | PUNC | 10 | PUNC | The first sentence of the CoNLL 2006 test data: | 1 | Las | el | d | da | gen=f|num=p | 2 | _ | 2 | _ | | 2 | reservas | reserva | n | nc | gen=f|num=p | 9 | SUJ | 9 | SUJ | | 3 | de | de | s | sp | for=s | 2 | _ | 2 | _ | | 4 | oro | oro | n | nc | gen=m|num=s | 3 | _ | 3 | _ | | 5 | y | y | c | cc | _ | 4 | _ | 4 | _ | | 6 | divisas | divisa | n | nc | gen=f|num=p | 4 | _ | 4 | _ | | 7 | de | de | s | sp | for=s | 2 | _ | 2 | _ | | 8 | Rusia | Rusia | n | np | _ | 7 | _ | 7 | _ | | 9 | subieron | subir | v | vm | num=p|mod=i|tmp=s|per=3 | 18 | CD | 18 | CD | | 10 | 800_millones_de_dólares | 800000000_dólar | Z | Zm | _ | 9 | CC | 9 | CC | | 11 | y | y | c | cc | _ | 9 | _ | 9 | _ | | 12 | el | el | d | da | gen=m|num=s | 13 | _ | 13 | _ | | 13 | 26_de_mayo | 26_de_mayo | w | w | _ | 14 | CC | 14 | CC | | 14 | equivalían | equivaler | v | vm | num=p|mod=i|per=3|tmp=i | 9 | _ | 9 | _ | | 15 | a | a | s | sp | for=s | 14 | CREG | 14 | CREG | | 16 | 19.100_millones_de_dólares | 19100000000_dólar | Z | Zm | _ | 15 | _ | 15 | _ | | 17 | , | , | F | Fc | _ | 9 | PUNC | 9 | PUNC | | 18 | informó | informar | v | vm | num=s|mod=i|tmp=s|per=3 | 0 | ROOT | 0 | ROOT | | 19 | hoy | hoy | r | rg | _ | 18 | CC | 18 | CC | | 20 | un | uno | d | di | gen=m|num=s | 21 | _ | 21 | _ | | 21 | comunicado | comunicado | n | nc | gen=m|num=s | 18 | SUJ | 18 | SUJ | | 22 | del | del | s | sp | gen=m|num=s|for=c | 21 | _ | 21 | _ | | 23 | Banco_Central | Banco_Central | n | np | _ | 22 | _ | 22 | _ | | 24 | . | . | F | Fp | _ | 18 | PUNC | 18 | PUNC | The first sentence of the CoNLL 2009 training data: | 1 | El | el | el | d | d | postype=article|gen=m|num=s | postype=article|gen=m|num=s | 2 | 2 | spec | spec | _ | _ | _ | _ | _ | _ | | 2 | presidente | presidente | presidente | n | n | postype=common|gen=m|num=s | postype=common|gen=m|num=s | 10 | 10 | suj | suj | _ | _ | arg1-tem | _ | _ | _ | | 3 | del | del | del | s | s | postype=preposition|gen=m|num=s|contracted=yes | postype=preposition|gen=m|num=s|contracted=yes | 2 | 2 | sp | sp | _ | _ | _ | _ | _ | _ | | 4 | órgano | órgano | órgano | n | n | postype=common|gen=m|num=s | postype=common|gen=m|num=s | 3 | 3 | sn | sn | _ | _ | _ | _ | _ | _ | | 5 | regulador | regulador | regulador | a | a | postype=qualificative|gen=m|num=s | postype=qualificative|gen=m|num=s | 4 | 4 | s.a | s.a | _ | _ | _ | _ | _ | _ | | 6 | de | de | de | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 4 | 4 | sp | sp | _ | _ | _ | _ | _ | _ | | 7 | las | el | el | d | d | postype=article|gen=f|num=p | postype=article|gen=f|num=p | 8 | 8 | spec | spec | _ | _ | _ | _ | _ | _ | | 8 | Telecomunicaciones | Telecomunicaciones | telecomunicación | n | n | postype=proper|gen=c|num=c | postype=common|gen=f|num=p | 6 | 6 | sn | sn | _ | _ | _ | _ | _ | _ | | 9 | se | él | se | p | p | gen=c|num=c|person=3 | gen=c|num=c | 10 | 10 | morfema.pronominal | morfema.pronominal | _ | _ | _ | _ | _ | _ | | 10 | mostró | mostrar | mostrar | v | v | postype=main|gen=c|num=s|person=3|mood=indicative|tense=past | postype=main|gen=c|num=s|person=3|mood=indicative|tense=past | 0 | 0 | sentence | sentence | Y | mostrar.c2 | _ | _ | _ | _ | | 11 | partidario | partidario | partidario | a | a | postype=qualificative|gen=m|num=s | postype=qualificative|gen=m|num=s | 10 | 10 | cpred | cpred | _ | _ | arg2-atr | _ | _ | _ | | 12 | de | de | de | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 11 | 11 | sp | sp | _ | _ | _ | _ | _ | _ | | 13 | completar | completar | completar | v | v | postype=main|gen=c|num=c|mood=infinitive | postype=main|gen=c|num=c|mood=infinitive | 12 | 12 | S | S | Y | completar.a2 | _ | _ | _ | _ | | 14 | esta | este | este | d | d | postype=demonstrative|gen=f|num=s | postype=demonstrative|gen=f|num=s | 15 | 15 | spec | spec | _ | _ | _ | _ | _ | _ | | 15 | liberalización | liberalización | liberalización | n | n | postype=common|gen=f|num=s | postype=common|gen=f|num=s | 13 | 13 | cd | cd | _ | _ | _ | arg1-pat | _ | _ | | 16 | de | de | de | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 15 | 15 | sp | sp | _ | _ | _ | _ | _ | _ | | 17 | las | el | el | d | d | postype=article|gen=f|num=p | postype=article|gen=f|num=p | 18 | 18 | spec | spec | _ | _ | _ | _ | _ | _ | | 18 | telecomunicaciones | telecomunicación | telecomunicación | n | n | postype=common|gen=f|num=p | postype=common|gen=f|num=p | 16 | 16 | sn | sn | _ | _ | _ | _ | _ | _ | | 19 | con | con | con | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 13 | 13 | cc | cc | _ | _ | _ | argM-adv | _ | _ | | 20 | otras | otro | otro | d | d | postype=indefinite|gen=f|num=p | postype=indefinite|gen=f|num=p | 21 | 21 | spec | spec | _ | _ | _ | _ | _ | _ | | 21 | medidas | medida | medida | n | n | postype=common|gen=f|num=p | postype=common|gen=f|num=p | 19 | 19 | sn | sn | _ | _ | _ | _ | _ | _ | | 22 | que | que | que | p | p | postype=relative|gen=c|num=c | postype=relative|gen=c|num=c | 23 | 23 | suj | suj | _ | _ | _ | _ | arg0-agt | _ | | 23 | incentiven | incentivar | incentivar | v | v | postype=main|gen=c|num=p|person=3|mood=subjunctive|tense=present | postype=main|gen=c|num=p|person=3|mood=subjunctive|tense=present | 21 | 21 | S | S | Y | incentivar.a2 | _ | _ | _ | _ | | 24 | la | el | el | d | d | postype=article|gen=f|num=s | postype=article|gen=f|num=s | 25 | 25 | spec | spec | _ | _ | _ | _ | _ | _ | | 25 | competencia | competencia | competencia | n | n | postype=common|gen=f|num=s | postype=common|gen=f|num=s | 23 | 23 | cd | cd | _ | _ | _ | _ | arg1-pat | _ | | 26 | como_puede_ser | como_puede_ser | como_puede_ser | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 21 | 23 | sp | cc | _ | _ | _ | _ | _ | _ | | 27 | abrir | abrir | abrir | v | v | postype=main|gen=c|num=c|mood=infinitive | postype=main|gen=c|num=c|mood=infinitive | 26 | 26 | S | S | Y | abrir.a2 | _ | _ | _ | _ | | 28 | el | el | el | d | d | postype=article|gen=m|num=s | postype=article|gen=m|num=s | 29 | 29 | spec | spec | _ | _ | _ | _ | _ | _ | | 29 | acceso | acceso | acceso | n | n | postype=common|gen=m|num=s | postype=common|gen=m|num=s | 27 | 27 | cd | cd | _ | _ | _ | _ | _ | arg1-pat | | 30 | a | a | a | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 29 | 29 | sp | sp | _ | _ | _ | _ | _ | _ | | 31 | la | el | el | d | d | postype=article|gen=f|num=s | postype=article|gen=f|num=s | 32 | 32 | spec | spec | _ | _ | _ | _ | _ | _ | | 32 | información | información | información | n | n | postype=common|gen=f|num=s | postype=common|gen=f|num=s | 30 | 30 | sn | sn | _ | _ | _ | _ | _ | _ | | 33 | de | de | de | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 32 | 32 | sp | sp | _ | _ | _ | _ | _ | _ | | 34 | los | el | el | d | d | postype=article|gen=m|num=p | postype=article|gen=m|num=p | 35 | 35 | spec | spec | _ | _ | _ | _ | _ | _ | | 35 | clientes | cliente | cliente | n | n | postype=common|gen=c|num=p | postype=common|gen=c|num=p | 33 | 33 | sn | sn | _ | _ | _ | _ | _ | _ | | 36 | de | de | de | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 35 | 35 | sp | sp | _ | _ | _ | _ | _ | _ | | 37 | Telefónica | Telefónica | telefónico | n | a | postype=proper|gen=c|num=c | postype=qualificative|gen=f|num=s | 36 | 36 | sn | sn | _ | _ | _ | _ | _ | _ | | 38 | a | a | a | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 27 | 29 | ci | sp | _ | _ | _ | _ | _ | arg2-ben | | 39 | otros | otro | otro | d | d | postype=indefinite|gen=m|num=p | postype=indefinite|gen=m|num=p | 40 | 40 | spec | spec | _ | _ | _ | _ | _ | _ | | 40 | operadores | operador | operador | n | a | postype=common|gen=m|num=p | postype=qualificative|gen=m|num=p | 38 | 38 | sn | sn | _ | _ | _ | _ | _ | _ | | 41 | . | . | . | f | f | punct=period | punct=period | 10 | 10 | f | f | _ | _ | _ | _ | _ | _ | The first sentence of the CoNLL 2009 development data: | 1 | El | el | el | d | d | postype=article|gen=m|num=s | postype=article|gen=m|num=s | 2 | 2 | spec | spec | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 2 | gobernante | gobernante | gobernante | n | n | postype=common|gen=m|num=s | postype=common|gen=c|num=s | 30 | 30 | suj | suj | _ | _ | _ | _ | _ | arg1-tem | _ | _ | _ | | 3 | , | , | , | f | f | punct=comma | punct=comma | 22 | 4 | f | f | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 4 | con | con | con | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 22 | 2 | sp | sp | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 5 | ganada | ganado | ganar | a | v | postype=qualificative|gen=f|num=s|posfunction=participle | postype=main|gen=f|num=s|mood=pastparticiple | 6 | 6 | S | S | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 6 | fama | fama | fama | n | n | postype=common|gen=f|num=s | postype=common|gen=f|num=s | 4 | 4 | sn | sn | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 7 | desde | desde | desde | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 6 | 2 | sp | sp | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 8 | que | que | que | c | c | postype=subordinating | postype=subordinating | 10 | 10 | conj | conj | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 9 | _ | _ | _ | p | p | _ | _ | 10 | 10 | suj | suj | _ | _ | arg1-tem | _ | _ | _ | _ | _ | _ | | 10 | llegó | llegar | llegar | v | v | postype=main|gen=c|num=s|person=3|mood=indicative|tense=past | postype=main|gen=c|num=s|person=3|mood=indicative|tense=past | 7 | 7 | S | S | Y | llegar.b1 | _ | _ | _ | _ | _ | _ | _ | | 11 | hace | hacer | hacer | v | v | postype=main|gen=c|num=s|person=3|mood=indicative|tense=present | postype=main|gen=c|num=s|person=3|mood=indicative|tense=present | 13 | 13 | v | v | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 12 | 16 | 16 | 16 | z | n | _ | postype=proper|gen=c|num=c | 13 | 13 | z | spec | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 13 | meses | mes | mesar | n | v | postype=common|gen=m|num=p | postype=main|gen=c|num=s|person=2|mood=subjunctive|tense=present | 10 | 10 | cc | cd | _ | _ | argM-tmp | _ | _ | _ | _ | _ | _ | | 14 | al | al | al | s | s | postype=preposition|gen=m|num=s|contracted=yes | postype=preposition|gen=m|num=s|contracted=yes | 10 | 10 | cc | cc | _ | _ | argM-loc | _ | _ | _ | _ | _ | _ | | 15 | poder | poder | poder | n | v | postype=common|gen=m|num=s | postype=main|gen=c|num=c|mood=infinitive | 14 | 14 | sn | sn | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 16 | de | de | de | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 6 | 10 | sp | cc | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 17 | explotar | explotar | explotar | v | v | postype=main|gen=c|num=c|mood=infinitive | postype=main|gen=c|num=c|mood=infinitive | 16 | 16 | S | S | Y | explotar.a2 | _ | _ | _ | _ | _ | _ | _ | | 18 | al_máximo | al_máximo | al_máximo | r | r | _ | _ | 17 | 17 | cc | cc | _ | _ | _ | argM-mnr | _ | _ | _ | _ | _ | | 19 | su | su | su | d | d | postype=possessive|gen=c|num=s|person=3 | postype=possessive|gen=c|num=s|person=3 | 20 | 20 | spec | spec | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 20 | oratoria | oratoria | oratoria | n | n | postype=common|gen=f|num=s | postype=common|gen=f|num=s | 17 | 17 | cd | cd | _ | _ | _ | arg1-pat | _ | _ | _ | _ | _ | | 21 | y | y | y | c | c | postype=coordinating | postype=coordinating | 22 | 10 | coord | coord | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 22 | acusado | acusado | acusar | a | v | postype=qualificative|gen=m|num=s|posfunction=participle | postype=main|gen=m|num=s|mood=pastparticiple | 2 | 10 | S | S | Y | acusado.b2 | _ | _ | _ | _ | _ | _ | _ | | 23 | por | por | por | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 22 | 22 | cag | cag | _ | _ | _ | _ | arg0-agt | _ | _ | _ | _ | | 24 | sus | su | su | d | d | postype=possessive|gen=c|num=p|person=3 | postype=possessive|gen=c|num=p|person=3 | 25 | 25 | spec | spec | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 25 | detractores | detractor | detractor | n | n | postype=common|gen=m|num=p | postype=common|gen=m|num=p | 23 | 23 | sn | sn | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 26 | de | de | de | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 22 | 25 | creg | sp | _ | _ | _ | _ | arg2-null | _ | _ | _ | _ | | 27 | incontinencia | incontinencia | incontinencia | n | n | postype=common|gen=f|num=s | postype=common|gen=f|num=s | 26 | 26 | sn | sn | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 28 | verbal | verbal | verbal | a | a | postype=qualificative|gen=c|num=s | postype=qualificative|gen=c|num=s | 27 | 27 | s.a | s.a | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 29 | , | , | , | f | f | punct=comma | punct=comma | 22 | 2 | f | f | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 30 | enmudeció | enmudecer | enmudecer | v | v | postype=main|gen=c|num=s|person=3|mood=indicative|tense=past | postype=main|gen=c|num=s|person=3|mood=indicative|tense=past | 0 | 0 | sentence | sentence | Y | enmudecer.b2 | _ | _ | _ | _ | _ | _ | _ | | 31 | desde | desde | desde | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 30 | 30 | cc | cc | _ | _ | _ | _ | _ | argM-tmp | _ | _ | _ | | 32 | el | el | el | d | d | postype=article|gen=m|num=s | postype=article|gen=m|num=s | 33 | 33 | spec | spec | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 33 | momento | momento | momento | n | n | postype=common|gen=m|num=s | postype=common|gen=m|num=s | 31 | 31 | sn | sn | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 34 | en | en | en | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 42 | 42 | cc | cc | _ | _ | _ | _ | _ | _ | argM-tmp | _ | _ | | 35 | el | el | el | d | d | postype=article|gen=m|num=s | postype=article|gen=m|num=s | 36 | 36 | spec | spec | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 36 | que | que | que | p | p | postype=relative|gen=c|num=c | postype=relative|gen=c|num=c | 34 | 34 | sn | sn | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 37 | el | el | el | d | d | postype=article|gen=m|num=s | postype=article|gen=m|num=s | 38 | 38 | spec | spec | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 38 | Tribunal_Supremo_de_Justicia | Tribunal_Supremo_de_Justicia | Tribunal_Supremo_de_Justicia | n | n | postype=proper|gen=c|num=c | postype=proper|gen=c|num=c | 42 | 42 | suj | suj | _ | _ | _ | _ | _ | _ | arg0-agt | _ | _ | | 39 | ( | ( | ( | f | f | punct=bracket|punctenclose=open | punct=bracket|punctenclose=open | 40 | 40 | f | f | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 40 | TSJ | TSJ | TSJ | n | n | postype=proper|gen=c|num=c | postype=proper|gen=c|num=c | 38 | 38 | sn | sn | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 41 | ) | ) | ) | f | f | punct=bracket|punctenclose=close | punct=bracket|punctenclose=close | 40 | 40 | f | f | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 42 | decidió | decidir | decidir | v | v | postype=main|gen=c|num=s|person=3|mood=indicative|tense=past | postype=main|gen=c|num=s|person=3|mood=indicative|tense=past | 33 | 33 | S | S | Y | decidir.a2 | _ | _ | _ | _ | _ | _ | _ | | 43 | suspender | suspender | suspender | v | v | postype=main|gen=c|num=c|mood=infinitive | postype=main|gen=c|num=c|mood=infinitive | 42 | 42 | cd | cd | Y | suspender.a2 | _ | _ | _ | _ | arg1-pat | _ | _ | | 44 | temporalmente | temporalmente | temporalmente | r | r | _ | _ | 43 | 43 | cc | cc | _ | _ | _ | _ | _ | _ | _ | argM-tmp | _ | | 45 | los | el | el | d | d | postype=article|gen=m|num=p | postype=article|gen=m|num=p | 46 | 46 | spec | spec | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 46 | comicios | comicio | comicio | n | n | postype=common|gen=m|num=p | postype=common|gen=m|num=p | 43 | 43 | cd | cd | _ | _ | _ | _ | _ | _ | _ | arg1-pat | _ | | 47 | múltiples | múltiple | múltiple | a | a | postype=qualificative|gen=c|num=p | postype=qualificative|gen=c|num=p | 46 | 46 | s.a | s.a | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 48 | ante | ante | ante | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 43 | 43 | cc | cc | _ | _ | _ | _ | _ | _ | _ | argM-adv | _ | | 49 | la | el | el | d | d | postype=article|gen=f|num=s | postype=article|gen=f|num=s | 50 | 50 | spec | spec | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 50 | imposibilidad | imposibilidad | imposibilidad | n | n | postype=common|gen=f|num=s | postype=common|gen=f|num=s | 48 | 48 | sn | sn | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 51 | " | " | " | f | f | punct=quotation | punct=quotation | 52 | 52 | f | f | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 52 | técnica | técnico | técnica | a | n | postype=qualificative|gen=f|num=s | postype=common|gen=f|num=s | 50 | 50 | s.a | sn | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 53 | " | " | " | f | f | punct=quotation | punct=quotation | 52 | 52 | f | f | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 54 | de | de | de | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 50 | 50 | sp | sp | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 55 | celebrarlos | celebrar | celebrar | v | v | postype=main|gen=c|num=c|mood=infinitive | postype=main|gen=c|num=c|mood=infinitive | 54 | 54 | S | S | Y | celebrar.a2 | _ | _ | _ | _ | _ | _ | _ | | 56 | el | el | el | d | d | postype=article|gen=m|num=s | postype=article|gen=m|num=s | 57 | 57 | spec | spec | _ | _ | _ | _ | _ | _ | _ | _ | _ | | 57 | 28_de_mayo | 28_de_mayo | 28_de_mayo | w | w | _ | _ | 55 | 55 | cc | cc | _ | _ | _ | _ | _ | _ | _ | _ | argM-tmp | | 58 | . | . | . | f | f | punct=period | punct=period | 30 | 30 | f | f | _ | _ | _ | _ | _ | _ | _ | _ | _ | The first sentence of the CoNLL 2009 test data: | 1 | Partidario | partidario | partidario | a | a | postype=qualificative|gen=m|num=s | postype=qualificative|gen=m|num=s | _ | _ | _ | _ | _ | | 2 | de | de | de | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | _ | _ | _ | _ | _ | | 3 | la | el | el | d | d | postype=article|gen=f|num=s | postype=article|gen=f|num=s | _ | _ | _ | _ | _ | | 4 | " | " | " | f | f | punct=quotation | punct=quotation | _ | _ | _ | _ | _ | | 5 | perestroika | perestroika | perestroika | n | n | postype=common|gen=f|num=s | postype=common|gen=f|num=s | _ | _ | _ | _ | _ | | 6 | " | " | " | f | f | punct=quotation | punct=quotation | _ | _ | _ | _ | _ | | 7 | de | de | de | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | _ | _ | _ | _ | _ | | 8 | Mijail_Gorbachov | Mijail_Gorbachov | Mijail_Gorbachov | n | n | postype=proper|gen=c|num=c | postype=proper|gen=c|num=c | _ | _ | _ | _ | _ | | 9 | en | en | en | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | _ | _ | _ | _ | _ | | 10 | la | el | el | d | d | postype=article|gen=f|num=s | postype=article|gen=f|num=s | _ | _ | _ | _ | _ | | 11 | Unión_Soviética | Unión_Soviética | Unión_Soviética | n | n | postype=proper|gen=c|num=c | postype=proper|gen=c|num=c | _ | _ | _ | _ | _ | | 12 | , | , | , | f | f | punct=comma | punct=comma | _ | _ | _ | _ | _ | | 13 | en | en | en | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | _ | _ | _ | _ | _ | | 14 | 1989 | 1989 | 1989 | w | n | _ | postype=proper|gen=c|num=c | _ | _ | _ | _ | _ | | 15 | _ | _ | _ | p | p | _ | _ | _ | _ | _ | _ | _ | | 16 | entró | entrar | entrar | v | v | postype=main|gen=c|num=s|person=3|mood=indicative|tense=past | postype=main|gen=c|num=s|person=3|mood=indicative|tense=past | _ | _ | _ | _ | Y | | 17 | en | en | en | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | _ | _ | _ | _ | _ | | 18 | conflicto | conflicto | conflicto | n | n | postype=common|gen=m|num=s | postype=common|gen=m|num=s | _ | _ | _ | _ | _ | | 19 | con | con | con | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | _ | _ | _ | _ | _ | | 20 | Yívkov | Yívkov | Yívkov | n | n | postype=proper|gen=c|num=c | postype=proper|gen=c|num=c | _ | _ | _ | _ | _ | | 21 | , | , | , | f | f | punct=comma | punct=comma | _ | _ | _ | _ | _ | | 22 | líder | líder | líder | n | n | postype=common|gen=c|num=s | postype=common|gen=c|num=s | _ | _ | _ | _ | _ | | 23 | durante | durante | durante | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | _ | _ | _ | _ | _ | | 24 | 35 | 35 | 35 | z | n | _ | postype=proper|gen=c|num=c | _ | _ | _ | _ | _ | | 25 | años | año | año | n | n | postype=common|gen=m|num=p | postype=common|gen=m|num=p | _ | _ | _ | _ | _ | | 26 | del | del | del | s | s | postype=preposition|gen=m|num=s|contracted=yes | postype=preposition|gen=m|num=s|contracted=yes | _ | _ | _ | _ | _ | | 27 | Partido_Comunista | Partido_Comunista | Partido_Comunista | n | n | postype=proper|gen=c|num=c | postype=proper|gen=c|num=c | _ | _ | _ | _ | _ | | 28 | y | y | y | c | c | postype=coordinating | postype=coordinating | _ | _ | _ | _ | _ | | 29 | del | del | del | s | s | postype=preposition|gen=m|num=s|contracted=yes | postype=preposition|gen=m|num=s|contracted=yes | _ | _ | _ | _ | _ | | 30 | Estado | Estado | estado | n | n | postype=proper|gen=c|num=c | postype=common|gen=m|num=s | _ | _ | _ | _ | _ | | 31 | búlgaro | búlgaro | búlgaro | a | a | postype=qualificative|gen=m|num=s | postype=qualificative|gen=m|num=s | _ | _ | _ | _ | _ | | 32 | , | , | , | f | f | punct=comma | punct=comma | _ | _ | _ | _ | _ | | 33 | y | y | y | c | c | postype=coordinating | postype=coordinating | _ | _ | _ | _ | _ | | 34 | _ | _ | _ | p | p | _ | _ | _ | _ | _ | _ | _ | | 35 | le | él | LE | p | n | postype=personal|gen=c|num=s|person=3|case=accusative | postype=proper|gen=c|num=c | _ | _ | _ | _ | _ | | 36 | acusó | acusar | acusar | v | v | postype=main|gen=c|num=s|person=3|mood=indicative|tense=past | postype=main|gen=c|num=s|person=3|mood=indicative|tense=past | _ | _ | _ | _ | Y | | 37 | en | en | en | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | _ | _ | _ | _ | _ | | 38 | una | uno | uno | d | d | postype=indefinite|gen=f|num=s | postype=numeral|gen=f|num=s | _ | _ | _ | _ | _ | | 39 | carta | carta | carta | n | n | postype=common|gen=f|num=s | postype=common|gen=f|num=s | _ | _ | _ | _ | _ | | 40 | abierta | abierto | abrir | a | v | postype=qualificative|gen=f|num=s|posfunction=participle | postype=main|gen=f|num=s|mood=pastparticiple | _ | _ | _ | _ | _ | | 41 | de | de | de | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | _ | _ | _ | _ | _ | | 42 | utilizar | utilizar | utilizar | v | v | postype=main|gen=c|num=c|mood=infinitive | postype=main|gen=c|num=c|mood=infinitive | _ | _ | _ | _ | Y | | 43 | métodos | método | método | n | n | postype=common|gen=m|num=p | postype=common|gen=m|num=p | _ | _ | _ | _ | _ | | 44 | poco | poco | poco | r | r | _ | _ | _ | _ | _ | _ | _ | | 45 | democráticos | democrático | democrático | a | a | postype=qualificative|gen=m|num=p | postype=qualificative|gen=m|num=p | _ | _ | _ | _ | _ | | 46 | de | de | de | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | _ | _ | _ | _ | _ | | 47 | gobierno | gobierno | Gobierno | n | n | postype=common|gen=m|num=s | postype=proper|gen=c|num=c | _ | _ | _ | _ | _ | | 48 | . | . | . | f | f | punct=period | punct=period | _ | _ | _ | _ | _ | ==== Parsing ==== In the CoNLL 2009 version, there are no nonprojectivities at all. The results of the CoNLL 2006 shared task are [[http://ilk.uvt.nl/conll/results.html|available online]]. They have been published in [[http://aclweb.org/anthology-new/W/W06/W06-2920.pdf|(Buchholz and Marsi, 2006)]]. The evaluation procedure was non-standard because it excluded punctuation tokens. These are the best results for Spanish: ^ Parser (Authors) ^ LAS ^ UAS ^ | MST (McDonald et al.) | 82.25 | 86.05 | | Nara (Cheng) | 80.46 | 85.15 | | Microsoft (Corston-Oliver and Aue) | 80.36 | 84.87 | | Malt (Nivre et al.) | 81.29 | 84.67 | | Basis (O'Neil) | 79.82 | 84.11 | The results of the CoNLL 2009 shared task are [[http://ufal.mff.cuni.cz/conll2009-st/results/results.php|available online]]. They have been published in [[http://aclweb.org/anthology/W/W09/W09-1201.pdf|(Hajič et al., 2009)]]. Unlabeled attachment score was not published. These are the best results for Spanish: ^ Parser (Authors) ^ LAS ^ | Merlo | 87.64 | | Che | 87.33 | | Bohnet | 87.19 | | Chen | 86.29 |