There is one treebank versions of which were known in different times under different names:
The dependency treebank Cast3LB was extracted automatically from an earlier constituent-based annotation (see Montserrat Civit, Ma. Antònia Martí, Núria Bufí: Cat3LB and Cast3LB: From Constituents to Dependencies. In: T. Salakoski et al. (eds.): FinTAL 2006, LNAI 4139, pp. 141–152, 2006, Springer, Berlin / Heidelberg)
The AnCora-ES corpus ought to be freely downloadable from its website. The download will not work for unregistered and not signed in users. The website offers creating new account but it is not automatic, one has to wait for approval.
Republication of the two CoNLL versions in LDC is planned but it has not happenned yet.
The CoNLL 2006 license in short:
AnCora-ES was created by members of the Centre de Llenguatge i Computació (CLiC), Universitat de Barcelona, Gran Via de les Corts Catalanes 585, E-08007 Barcelona, Spain, together with Universidad de Alicante (UA), Universitat Politècnica de Catalunya (UPC), Universitat Politècnica de Valencia (UPV) and Euskal Herriko Unibertsitatea (EHU-UPV).
Mostly newswire (EFE news, El Periódico, 2000).
The CoNLL 2006 version contains 95028 tokens in 3512 sentences, yielding 27.06 tokens per sentence on average (CoNLL 2006 data split: 89334 tokens / 3306 sentences training, 5694 tokens / 206 sentences test).
The CoNLL 2009 version contains 528,440 tokens in 17709 sentences, yielding 29.84 tokens per sentence on average (CoNLL 2009 data split: 427,442 tokens / 14329 sentences training, 50368 tokens / 1655 sentences development, 50630 tokens / 1725 sentences test).
The original morphosyntactic tags (EAGLES?) have been converted to fit into the three columns (CPOS, POS and FEAT) columns of the CoNLL 2006/7 format, resp. the two columns (POS and FEAT) of the CoNLL 2009 format. Note that the missing CPOS column is not the only difference between the two conversion schemes. Feature names and values in the FEAT column are different, too.
The morphosyntactic tags have been disambiguated manually. The CoNLL 2009 version also contains automatically disambiguated tags.
Multi-word expressions have been collapsed into one token, using underscore as the joining character. This includes named entities (e.g. Comunidad_de_Madrid, Unión_Europea, Alejandro_Couceiro, 1,8_por_ciento, siglo_XXI), prepositional compounds and others (como_puede_ser, a_fondo, a_partir_de). Empty (underscore) tokens have been inserted to represent missing subjects (Spanish is a pro-drop language).
The first sentence of the CoNLL 2006 training data:
1 | El | el | d | da | gen=m|num=s | 2 | _ | 2 | _ |
2 | presidente | presidente | n | nc | gen=m|num=s | 10 | SUJ | 10 | SUJ |
3 | del | del | s | sp | gen=m|num=s|for=c | 2 | _ | 2 | _ |
4 | órgano | órgano | n | nc | gen=m|num=s | 3 | _ | 3 | _ |
5 | regulador | regulador | a | aq | gen=m|num=s | 4 | _ | 4 | _ |
6 | de | de | s | sp | for=s | 4 | _ | 4 | _ |
7 | las | el | d | da | gen=f|num=p | 8 | _ | 8 | _ |
8 | Telecomunicaciones | Telecomunicaciones | n | np | _ | 6 | _ | 6 | _ |
9 | se | él | p | p0 | per=3 | 10 | _ | 10 | _ |
10 | mostró | mostrar | v | vm | num=s|per=3|mod=i|tmp=s | 0 | ROOT | 0 | ROOT |
11 | partidario | partidario | a | aq | gen=m|num=s | 10 | CPRED.SUJ | 10 | CPRED.SUJ |
12 | de | de | s | sp | for=s | 11 | _ | 11 | _ |
13 | completar | completar | v | vm | mod=n | 19 | _ | 19 | _ |
14 | esta | este | d | dd | num=s|gen=f | 15 | _ | 15 | _ |
15 | liberalización | liberalización | n | nc | num=s|gen=f | 19 | CD | 19 | CD |
16 | de | de | s | sp | for=s | 15 | _ | 15 | _ |
17 | las | el | d | da | gen=f|num=p | 18 | _ | 18 | _ |
18 | telecomunicaciones | telecomunicación | n | nc | gen=f|num=p | 16 | _ | 16 | _ |
19 | con | con | s | sp | for=s | 12 | _ | 12 | _ |
20 | otras | otro | d | di | gen=f|num=p | 21 | _ | 21 | _ |
21 | medidas | medida | n | nc | gen=f|num=p | 19 | _ | 19 | _ |
22 | que | que | p | pr | gen=c|num=n | 23 | SUJ | 23 | SUJ |
23 | incentiven | incentivar | v | vm | num=p|per=3|mod=s|tmp=p | 21 | _ | 21 | _ |
24 | la | el | d | da | num=s|gen=f | 25 | _ | 25 | _ |
25 | competencia | competencia | n | nc | num=s|gen=f | 23 | CD | 23 | CD |
26 | como_puede_ser | como_puede_ser | s | sp | for=s | 21 | _ | 21 | _ |
27 | abrir | abrir | v | vm | mod=n | 38 | _ | 38 | _ |
28 | el | el | d | da | gen=m|num=s | 29 | _ | 29 | _ |
29 | acceso | acceso | n | nc | gen=m|num=s | 38 | CD | 38 | CD |
30 | a | a | s | sp | for=s | 29 | _ | 29 | _ |
31 | la | el | d | da | num=s|gen=f | 32 | _ | 32 | _ |
32 | información | información | n | nc | num=s|gen=f | 30 | _ | 30 | _ |
33 | de | de | s | sp | for=s | 32 | _ | 32 | _ |
34 | los | el | d | da | gen=m|num=p | 35 | _ | 35 | _ |
35 | clientes | cliente | n | nc | num=p|gen=c | 33 | _ | 33 | _ |
36 | de | de | s | sp | for=s | 35 | _ | 35 | _ |
37 | Telefónica | Telefónica | n | np | _ | 36 | _ | 36 | _ |
38 | a | a | s | sp | for=s | 26 | _ | 26 | _ |
39 | otros | otro | d | di | gen=m|num=p | 40 | _ | 40 | _ |
40 | operadores | operador | n | nc | gen=m|num=p | 38 | _ | 38 | _ |
41 | . | . | F | Fp | _ | 10 | PUNC | 10 | PUNC |
The first sentence of the CoNLL 2006 test data:
1 | Las | el | d | da | gen=f|num=p | 2 | _ | 2 | _ |
2 | reservas | reserva | n | nc | gen=f|num=p | 9 | SUJ | 9 | SUJ |
3 | de | de | s | sp | for=s | 2 | _ | 2 | _ |
4 | oro | oro | n | nc | gen=m|num=s | 3 | _ | 3 | _ |
5 | y | y | c | cc | _ | 4 | _ | 4 | _ |
6 | divisas | divisa | n | nc | gen=f|num=p | 4 | _ | 4 | _ |
7 | de | de | s | sp | for=s | 2 | _ | 2 | _ |
8 | Rusia | Rusia | n | np | _ | 7 | _ | 7 | _ |
9 | subieron | subir | v | vm | num=p|mod=i|tmp=s|per=3 | 18 | CD | 18 | CD |
10 | 800_millones_de_dólares | 800000000_dólar | Z | Zm | _ | 9 | CC | 9 | CC |
11 | y | y | c | cc | _ | 9 | _ | 9 | _ |
12 | el | el | d | da | gen=m|num=s | 13 | _ | 13 | _ |
13 | 26_de_mayo | 26_de_mayo | w | w | _ | 14 | CC | 14 | CC |
14 | equivalían | equivaler | v | vm | num=p|mod=i|per=3|tmp=i | 9 | _ | 9 | _ |
15 | a | a | s | sp | for=s | 14 | CREG | 14 | CREG |
16 | 19.100_millones_de_dólares | 19100000000_dólar | Z | Zm | _ | 15 | _ | 15 | _ |
17 | , | , | F | Fc | _ | 9 | PUNC | 9 | PUNC |
18 | informó | informar | v | vm | num=s|mod=i|tmp=s|per=3 | 0 | ROOT | 0 | ROOT |
19 | hoy | hoy | r | rg | _ | 18 | CC | 18 | CC |
20 | un | uno | d | di | gen=m|num=s | 21 | _ | 21 | _ |
21 | comunicado | comunicado | n | nc | gen=m|num=s | 18 | SUJ | 18 | SUJ |
22 | del | del | s | sp | gen=m|num=s|for=c | 21 | _ | 21 | _ |
23 | Banco_Central | Banco_Central | n | np | _ | 22 | _ | 22 | _ |
24 | . | . | F | Fp | _ | 18 | PUNC | 18 | PUNC |
The first sentence of the CoNLL 2009 training data:
1 | El | el | el | d | d | postype=article|gen=m|num=s | postype=article|gen=m|num=s | 2 | 2 | spec | spec | _ | _ | _ | _ | _ | _ |
2 | presidente | presidente | presidente | n | n | postype=common|gen=m|num=s | postype=common|gen=m|num=s | 10 | 10 | suj | suj | _ | _ | arg1-tem | _ | _ | _ |
3 | del | del | del | s | s | postype=preposition|gen=m|num=s|contracted=yes | postype=preposition|gen=m|num=s|contracted=yes | 2 | 2 | sp | sp | _ | _ | _ | _ | _ | _ |
4 | órgano | órgano | órgano | n | n | postype=common|gen=m|num=s | postype=common|gen=m|num=s | 3 | 3 | sn | sn | _ | _ | _ | _ | _ | _ |
5 | regulador | regulador | regulador | a | a | postype=qualificative|gen=m|num=s | postype=qualificative|gen=m|num=s | 4 | 4 | s.a | s.a | _ | _ | _ | _ | _ | _ |
6 | de | de | de | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 4 | 4 | sp | sp | _ | _ | _ | _ | _ | _ |
7 | las | el | el | d | d | postype=article|gen=f|num=p | postype=article|gen=f|num=p | 8 | 8 | spec | spec | _ | _ | _ | _ | _ | _ |
8 | Telecomunicaciones | Telecomunicaciones | telecomunicación | n | n | postype=proper|gen=c|num=c | postype=common|gen=f|num=p | 6 | 6 | sn | sn | _ | _ | _ | _ | _ | _ |
9 | se | él | se | p | p | gen=c|num=c|person=3 | gen=c|num=c | 10 | 10 | morfema.pronominal | morfema.pronominal | _ | _ | _ | _ | _ | _ |
10 | mostró | mostrar | mostrar | v | v | postype=main|gen=c|num=s|person=3|mood=indicative|tense=past | postype=main|gen=c|num=s|person=3|mood=indicative|tense=past | 0 | 0 | sentence | sentence | Y | mostrar.c2 | _ | _ | _ | _ |
11 | partidario | partidario | partidario | a | a | postype=qualificative|gen=m|num=s | postype=qualificative|gen=m|num=s | 10 | 10 | cpred | cpred | _ | _ | arg2-atr | _ | _ | _ |
12 | de | de | de | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 11 | 11 | sp | sp | _ | _ | _ | _ | _ | _ |
13 | completar | completar | completar | v | v | postype=main|gen=c|num=c|mood=infinitive | postype=main|gen=c|num=c|mood=infinitive | 12 | 12 | S | S | Y | completar.a2 | _ | _ | _ | _ |
14 | esta | este | este | d | d | postype=demonstrative|gen=f|num=s | postype=demonstrative|gen=f|num=s | 15 | 15 | spec | spec | _ | _ | _ | _ | _ | _ |
15 | liberalización | liberalización | liberalización | n | n | postype=common|gen=f|num=s | postype=common|gen=f|num=s | 13 | 13 | cd | cd | _ | _ | _ | arg1-pat | _ | _ |
16 | de | de | de | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 15 | 15 | sp | sp | _ | _ | _ | _ | _ | _ |
17 | las | el | el | d | d | postype=article|gen=f|num=p | postype=article|gen=f|num=p | 18 | 18 | spec | spec | _ | _ | _ | _ | _ | _ |
18 | telecomunicaciones | telecomunicación | telecomunicación | n | n | postype=common|gen=f|num=p | postype=common|gen=f|num=p | 16 | 16 | sn | sn | _ | _ | _ | _ | _ | _ |
19 | con | con | con | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 13 | 13 | cc | cc | _ | _ | _ | argM-adv | _ | _ |
20 | otras | otro | otro | d | d | postype=indefinite|gen=f|num=p | postype=indefinite|gen=f|num=p | 21 | 21 | spec | spec | _ | _ | _ | _ | _ | _ |
21 | medidas | medida | medida | n | n | postype=common|gen=f|num=p | postype=common|gen=f|num=p | 19 | 19 | sn | sn | _ | _ | _ | _ | _ | _ |
22 | que | que | que | p | p | postype=relative|gen=c|num=c | postype=relative|gen=c|num=c | 23 | 23 | suj | suj | _ | _ | _ | _ | arg0-agt | _ |
23 | incentiven | incentivar | incentivar | v | v | postype=main|gen=c|num=p|person=3|mood=subjunctive|tense=present | postype=main|gen=c|num=p|person=3|mood=subjunctive|tense=present | 21 | 21 | S | S | Y | incentivar.a2 | _ | _ | _ | _ |
24 | la | el | el | d | d | postype=article|gen=f|num=s | postype=article|gen=f|num=s | 25 | 25 | spec | spec | _ | _ | _ | _ | _ | _ |
25 | competencia | competencia | competencia | n | n | postype=common|gen=f|num=s | postype=common|gen=f|num=s | 23 | 23 | cd | cd | _ | _ | _ | _ | arg1-pat | _ |
26 | como_puede_ser | como_puede_ser | como_puede_ser | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 21 | 23 | sp | cc | _ | _ | _ | _ | _ | _ |
27 | abrir | abrir | abrir | v | v | postype=main|gen=c|num=c|mood=infinitive | postype=main|gen=c|num=c|mood=infinitive | 26 | 26 | S | S | Y | abrir.a2 | _ | _ | _ | _ |
28 | el | el | el | d | d | postype=article|gen=m|num=s | postype=article|gen=m|num=s | 29 | 29 | spec | spec | _ | _ | _ | _ | _ | _ |
29 | acceso | acceso | acceso | n | n | postype=common|gen=m|num=s | postype=common|gen=m|num=s | 27 | 27 | cd | cd | _ | _ | _ | _ | _ | arg1-pat |
30 | a | a | a | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 29 | 29 | sp | sp | _ | _ | _ | _ | _ | _ |
31 | la | el | el | d | d | postype=article|gen=f|num=s | postype=article|gen=f|num=s | 32 | 32 | spec | spec | _ | _ | _ | _ | _ | _ |
32 | información | información | información | n | n | postype=common|gen=f|num=s | postype=common|gen=f|num=s | 30 | 30 | sn | sn | _ | _ | _ | _ | _ | _ |
33 | de | de | de | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 32 | 32 | sp | sp | _ | _ | _ | _ | _ | _ |
34 | los | el | el | d | d | postype=article|gen=m|num=p | postype=article|gen=m|num=p | 35 | 35 | spec | spec | _ | _ | _ | _ | _ | _ |
35 | clientes | cliente | cliente | n | n | postype=common|gen=c|num=p | postype=common|gen=c|num=p | 33 | 33 | sn | sn | _ | _ | _ | _ | _ | _ |
36 | de | de | de | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 35 | 35 | sp | sp | _ | _ | _ | _ | _ | _ |
37 | Telefónica | Telefónica | telefónico | n | a | postype=proper|gen=c|num=c | postype=qualificative|gen=f|num=s | 36 | 36 | sn | sn | _ | _ | _ | _ | _ | _ |
38 | a | a | a | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 27 | 29 | ci | sp | _ | _ | _ | _ | _ | arg2-ben |
39 | otros | otro | otro | d | d | postype=indefinite|gen=m|num=p | postype=indefinite|gen=m|num=p | 40 | 40 | spec | spec | _ | _ | _ | _ | _ | _ |
40 | operadores | operador | operador | n | a | postype=common|gen=m|num=p | postype=qualificative|gen=m|num=p | 38 | 38 | sn | sn | _ | _ | _ | _ | _ | _ |
41 | . | . | . | f | f | punct=period | punct=period | 10 | 10 | f | f | _ | _ | _ | _ | _ | _ |
The first sentence of the CoNLL 2009 development data:
1 | El | el | el | d | d | postype=article|gen=m|num=s | postype=article|gen=m|num=s | 2 | 2 | spec | spec | _ | _ | _ | _ | _ | _ | _ | _ | _ |
2 | gobernante | gobernante | gobernante | n | n | postype=common|gen=m|num=s | postype=common|gen=c|num=s | 30 | 30 | suj | suj | _ | _ | _ | _ | _ | arg1-tem | _ | _ | _ |
3 | , | , | , | f | f | punct=comma | punct=comma | 22 | 4 | f | f | _ | _ | _ | _ | _ | _ | _ | _ | _ |
4 | con | con | con | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 22 | 2 | sp | sp | _ | _ | _ | _ | _ | _ | _ | _ | _ |
5 | ganada | ganado | ganar | a | v | postype=qualificative|gen=f|num=s|posfunction=participle | postype=main|gen=f|num=s|mood=pastparticiple | 6 | 6 | S | S | _ | _ | _ | _ | _ | _ | _ | _ | _ |
6 | fama | fama | fama | n | n | postype=common|gen=f|num=s | postype=common|gen=f|num=s | 4 | 4 | sn | sn | _ | _ | _ | _ | _ | _ | _ | _ | _ |
7 | desde | desde | desde | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 6 | 2 | sp | sp | _ | _ | _ | _ | _ | _ | _ | _ | _ |
8 | que | que | que | c | c | postype=subordinating | postype=subordinating | 10 | 10 | conj | conj | _ | _ | _ | _ | _ | _ | _ | _ | _ |
9 | _ | _ | _ | p | p | _ | _ | 10 | 10 | suj | suj | _ | _ | arg1-tem | _ | _ | _ | _ | _ | _ |
10 | llegó | llegar | llegar | v | v | postype=main|gen=c|num=s|person=3|mood=indicative|tense=past | postype=main|gen=c|num=s|person=3|mood=indicative|tense=past | 7 | 7 | S | S | Y | llegar.b1 | _ | _ | _ | _ | _ | _ | _ |
11 | hace | hacer | hacer | v | v | postype=main|gen=c|num=s|person=3|mood=indicative|tense=present | postype=main|gen=c|num=s|person=3|mood=indicative|tense=present | 13 | 13 | v | v | _ | _ | _ | _ | _ | _ | _ | _ | _ |
12 | 16 | 16 | 16 | z | n | _ | postype=proper|gen=c|num=c | 13 | 13 | z | spec | _ | _ | _ | _ | _ | _ | _ | _ | _ |
13 | meses | mes | mesar | n | v | postype=common|gen=m|num=p | postype=main|gen=c|num=s|person=2|mood=subjunctive|tense=present | 10 | 10 | cc | cd | _ | _ | argM-tmp | _ | _ | _ | _ | _ | _ |
14 | al | al | al | s | s | postype=preposition|gen=m|num=s|contracted=yes | postype=preposition|gen=m|num=s|contracted=yes | 10 | 10 | cc | cc | _ | _ | argM-loc | _ | _ | _ | _ | _ | _ |
15 | poder | poder | poder | n | v | postype=common|gen=m|num=s | postype=main|gen=c|num=c|mood=infinitive | 14 | 14 | sn | sn | _ | _ | _ | _ | _ | _ | _ | _ | _ |
16 | de | de | de | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 6 | 10 | sp | cc | _ | _ | _ | _ | _ | _ | _ | _ | _ |
17 | explotar | explotar | explotar | v | v | postype=main|gen=c|num=c|mood=infinitive | postype=main|gen=c|num=c|mood=infinitive | 16 | 16 | S | S | Y | explotar.a2 | _ | _ | _ | _ | _ | _ | _ |
18 | al_máximo | al_máximo | al_máximo | r | r | _ | _ | 17 | 17 | cc | cc | _ | _ | _ | argM-mnr | _ | _ | _ | _ | _ |
19 | su | su | su | d | d | postype=possessive|gen=c|num=s|person=3 | postype=possessive|gen=c|num=s|person=3 | 20 | 20 | spec | spec | _ | _ | _ | _ | _ | _ | _ | _ | _ |
20 | oratoria | oratoria | oratoria | n | n | postype=common|gen=f|num=s | postype=common|gen=f|num=s | 17 | 17 | cd | cd | _ | _ | _ | arg1-pat | _ | _ | _ | _ | _ |
21 | y | y | y | c | c | postype=coordinating | postype=coordinating | 22 | 10 | coord | coord | _ | _ | _ | _ | _ | _ | _ | _ | _ |
22 | acusado | acusado | acusar | a | v | postype=qualificative|gen=m|num=s|posfunction=participle | postype=main|gen=m|num=s|mood=pastparticiple | 2 | 10 | S | S | Y | acusado.b2 | _ | _ | _ | _ | _ | _ | _ |
23 | por | por | por | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 22 | 22 | cag | cag | _ | _ | _ | _ | arg0-agt | _ | _ | _ | _ |
24 | sus | su | su | d | d | postype=possessive|gen=c|num=p|person=3 | postype=possessive|gen=c|num=p|person=3 | 25 | 25 | spec | spec | _ | _ | _ | _ | _ | _ | _ | _ | _ |
25 | detractores | detractor | detractor | n | n | postype=common|gen=m|num=p | postype=common|gen=m|num=p | 23 | 23 | sn | sn | _ | _ | _ | _ | _ | _ | _ | _ | _ |
26 | de | de | de | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 22 | 25 | creg | sp | _ | _ | _ | _ | arg2-null | _ | _ | _ | _ |
27 | incontinencia | incontinencia | incontinencia | n | n | postype=common|gen=f|num=s | postype=common|gen=f|num=s | 26 | 26 | sn | sn | _ | _ | _ | _ | _ | _ | _ | _ | _ |
28 | verbal | verbal | verbal | a | a | postype=qualificative|gen=c|num=s | postype=qualificative|gen=c|num=s | 27 | 27 | s.a | s.a | _ | _ | _ | _ | _ | _ | _ | _ | _ |
29 | , | , | , | f | f | punct=comma | punct=comma | 22 | 2 | f | f | _ | _ | _ | _ | _ | _ | _ | _ | _ |
30 | enmudeció | enmudecer | enmudecer | v | v | postype=main|gen=c|num=s|person=3|mood=indicative|tense=past | postype=main|gen=c|num=s|person=3|mood=indicative|tense=past | 0 | 0 | sentence | sentence | Y | enmudecer.b2 | _ | _ | _ | _ | _ | _ | _ |
31 | desde | desde | desde | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 30 | 30 | cc | cc | _ | _ | _ | _ | _ | argM-tmp | _ | _ | _ |
32 | el | el | el | d | d | postype=article|gen=m|num=s | postype=article|gen=m|num=s | 33 | 33 | spec | spec | _ | _ | _ | _ | _ | _ | _ | _ | _ |
33 | momento | momento | momento | n | n | postype=common|gen=m|num=s | postype=common|gen=m|num=s | 31 | 31 | sn | sn | _ | _ | _ | _ | _ | _ | _ | _ | _ |
34 | en | en | en | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 42 | 42 | cc | cc | _ | _ | _ | _ | _ | _ | argM-tmp | _ | _ |
35 | el | el | el | d | d | postype=article|gen=m|num=s | postype=article|gen=m|num=s | 36 | 36 | spec | spec | _ | _ | _ | _ | _ | _ | _ | _ | _ |
36 | que | que | que | p | p | postype=relative|gen=c|num=c | postype=relative|gen=c|num=c | 34 | 34 | sn | sn | _ | _ | _ | _ | _ | _ | _ | _ | _ |
37 | el | el | el | d | d | postype=article|gen=m|num=s | postype=article|gen=m|num=s | 38 | 38 | spec | spec | _ | _ | _ | _ | _ | _ | _ | _ | _ |
38 | Tribunal_Supremo_de_Justicia | Tribunal_Supremo_de_Justicia | Tribunal_Supremo_de_Justicia | n | n | postype=proper|gen=c|num=c | postype=proper|gen=c|num=c | 42 | 42 | suj | suj | _ | _ | _ | _ | _ | _ | arg0-agt | _ | _ |
39 | ( | ( | ( | f | f | punct=bracket|punctenclose=open | punct=bracket|punctenclose=open | 40 | 40 | f | f | _ | _ | _ | _ | _ | _ | _ | _ | _ |
40 | TSJ | TSJ | TSJ | n | n | postype=proper|gen=c|num=c | postype=proper|gen=c|num=c | 38 | 38 | sn | sn | _ | _ | _ | _ | _ | _ | _ | _ | _ |
41 | ) | ) | ) | f | f | punct=bracket|punctenclose=close | punct=bracket|punctenclose=close | 40 | 40 | f | f | _ | _ | _ | _ | _ | _ | _ | _ | _ |
42 | decidió | decidir | decidir | v | v | postype=main|gen=c|num=s|person=3|mood=indicative|tense=past | postype=main|gen=c|num=s|person=3|mood=indicative|tense=past | 33 | 33 | S | S | Y | decidir.a2 | _ | _ | _ | _ | _ | _ | _ |
43 | suspender | suspender | suspender | v | v | postype=main|gen=c|num=c|mood=infinitive | postype=main|gen=c|num=c|mood=infinitive | 42 | 42 | cd | cd | Y | suspender.a2 | _ | _ | _ | _ | arg1-pat | _ | _ |
44 | temporalmente | temporalmente | temporalmente | r | r | _ | _ | 43 | 43 | cc | cc | _ | _ | _ | _ | _ | _ | _ | argM-tmp | _ |
45 | los | el | el | d | d | postype=article|gen=m|num=p | postype=article|gen=m|num=p | 46 | 46 | spec | spec | _ | _ | _ | _ | _ | _ | _ | _ | _ |
46 | comicios | comicio | comicio | n | n | postype=common|gen=m|num=p | postype=common|gen=m|num=p | 43 | 43 | cd | cd | _ | _ | _ | _ | _ | _ | _ | arg1-pat | _ |
47 | múltiples | múltiple | múltiple | a | a | postype=qualificative|gen=c|num=p | postype=qualificative|gen=c|num=p | 46 | 46 | s.a | s.a | _ | _ | _ | _ | _ | _ | _ | _ | _ |
48 | ante | ante | ante | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 43 | 43 | cc | cc | _ | _ | _ | _ | _ | _ | _ | argM-adv | _ |
49 | la | el | el | d | d | postype=article|gen=f|num=s | postype=article|gen=f|num=s | 50 | 50 | spec | spec | _ | _ | _ | _ | _ | _ | _ | _ | _ |
50 | imposibilidad | imposibilidad | imposibilidad | n | n | postype=common|gen=f|num=s | postype=common|gen=f|num=s | 48 | 48 | sn | sn | _ | _ | _ | _ | _ | _ | _ | _ | _ |
51 | “ | ” | “ | f | f | punct=quotation | punct=quotation | 52 | 52 | f | f | _ | _ | _ | _ | _ | _ | _ | _ | _ |
52 | técnica | técnico | técnica | a | n | postype=qualificative|gen=f|num=s | postype=common|gen=f|num=s | 50 | 50 | s.a | sn | _ | _ | _ | _ | _ | _ | _ | _ | _ |
53 | ” | “ | ” | f | f | punct=quotation | punct=quotation | 52 | 52 | f | f | _ | _ | _ | _ | _ | _ | _ | _ | _ |
54 | de | de | de | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | 50 | 50 | sp | sp | _ | _ | _ | _ | _ | _ | _ | _ | _ |
55 | celebrarlos | celebrar | celebrar | v | v | postype=main|gen=c|num=c|mood=infinitive | postype=main|gen=c|num=c|mood=infinitive | 54 | 54 | S | S | Y | celebrar.a2 | _ | _ | _ | _ | _ | _ | _ |
56 | el | el | el | d | d | postype=article|gen=m|num=s | postype=article|gen=m|num=s | 57 | 57 | spec | spec | _ | _ | _ | _ | _ | _ | _ | _ | _ |
57 | 28_de_mayo | 28_de_mayo | 28_de_mayo | w | w | _ | _ | 55 | 55 | cc | cc | _ | _ | _ | _ | _ | _ | _ | _ | argM-tmp |
58 | . | . | . | f | f | punct=period | punct=period | 30 | 30 | f | f | _ | _ | _ | _ | _ | _ | _ | _ | _ |
The first sentence of the CoNLL 2009 test data:
1 | Partidario | partidario | partidario | a | a | postype=qualificative|gen=m|num=s | postype=qualificative|gen=m|num=s | _ | _ | _ | _ | _ |
2 | de | de | de | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | _ | _ | _ | _ | _ |
3 | la | el | el | d | d | postype=article|gen=f|num=s | postype=article|gen=f|num=s | _ | _ | _ | _ | _ |
4 | “ | ” | “ | f | f | punct=quotation | punct=quotation | _ | _ | _ | _ | _ |
5 | perestroika | perestroika | perestroika | n | n | postype=common|gen=f|num=s | postype=common|gen=f|num=s | _ | _ | _ | _ | _ |
6 | ” | “ | ” | f | f | punct=quotation | punct=quotation | _ | _ | _ | _ | _ |
7 | de | de | de | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | _ | _ | _ | _ | _ |
8 | Mijail_Gorbachov | Mijail_Gorbachov | Mijail_Gorbachov | n | n | postype=proper|gen=c|num=c | postype=proper|gen=c|num=c | _ | _ | _ | _ | _ |
9 | en | en | en | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | _ | _ | _ | _ | _ |
10 | la | el | el | d | d | postype=article|gen=f|num=s | postype=article|gen=f|num=s | _ | _ | _ | _ | _ |
11 | Unión_Soviética | Unión_Soviética | Unión_Soviética | n | n | postype=proper|gen=c|num=c | postype=proper|gen=c|num=c | _ | _ | _ | _ | _ |
12 | , | , | , | f | f | punct=comma | punct=comma | _ | _ | _ | _ | _ |
13 | en | en | en | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | _ | _ | _ | _ | _ |
14 | 1989 | 1989 | 1989 | w | n | _ | postype=proper|gen=c|num=c | _ | _ | _ | _ | _ |
15 | _ | _ | _ | p | p | _ | _ | _ | _ | _ | _ | _ |
16 | entró | entrar | entrar | v | v | postype=main|gen=c|num=s|person=3|mood=indicative|tense=past | postype=main|gen=c|num=s|person=3|mood=indicative|tense=past | _ | _ | _ | _ | Y |
17 | en | en | en | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | _ | _ | _ | _ | _ |
18 | conflicto | conflicto | conflicto | n | n | postype=common|gen=m|num=s | postype=common|gen=m|num=s | _ | _ | _ | _ | _ |
19 | con | con | con | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | _ | _ | _ | _ | _ |
20 | Yívkov | Yívkov | Yívkov | n | n | postype=proper|gen=c|num=c | postype=proper|gen=c|num=c | _ | _ | _ | _ | _ |
21 | , | , | , | f | f | punct=comma | punct=comma | _ | _ | _ | _ | _ |
22 | líder | líder | líder | n | n | postype=common|gen=c|num=s | postype=common|gen=c|num=s | _ | _ | _ | _ | _ |
23 | durante | durante | durante | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | _ | _ | _ | _ | _ |
24 | 35 | 35 | 35 | z | n | _ | postype=proper|gen=c|num=c | _ | _ | _ | _ | _ |
25 | años | año | año | n | n | postype=common|gen=m|num=p | postype=common|gen=m|num=p | _ | _ | _ | _ | _ |
26 | del | del | del | s | s | postype=preposition|gen=m|num=s|contracted=yes | postype=preposition|gen=m|num=s|contracted=yes | _ | _ | _ | _ | _ |
27 | Partido_Comunista | Partido_Comunista | Partido_Comunista | n | n | postype=proper|gen=c|num=c | postype=proper|gen=c|num=c | _ | _ | _ | _ | _ |
28 | y | y | y | c | c | postype=coordinating | postype=coordinating | _ | _ | _ | _ | _ |
29 | del | del | del | s | s | postype=preposition|gen=m|num=s|contracted=yes | postype=preposition|gen=m|num=s|contracted=yes | _ | _ | _ | _ | _ |
30 | Estado | Estado | estado | n | n | postype=proper|gen=c|num=c | postype=common|gen=m|num=s | _ | _ | _ | _ | _ |
31 | búlgaro | búlgaro | búlgaro | a | a | postype=qualificative|gen=m|num=s | postype=qualificative|gen=m|num=s | _ | _ | _ | _ | _ |
32 | , | , | , | f | f | punct=comma | punct=comma | _ | _ | _ | _ | _ |
33 | y | y | y | c | c | postype=coordinating | postype=coordinating | _ | _ | _ | _ | _ |
34 | _ | _ | _ | p | p | _ | _ | _ | _ | _ | _ | _ |
35 | le | él | LE | p | n | postype=personal|gen=c|num=s|person=3|case=accusative | postype=proper|gen=c|num=c | _ | _ | _ | _ | _ |
36 | acusó | acusar | acusar | v | v | postype=main|gen=c|num=s|person=3|mood=indicative|tense=past | postype=main|gen=c|num=s|person=3|mood=indicative|tense=past | _ | _ | _ | _ | Y |
37 | en | en | en | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | _ | _ | _ | _ | _ |
38 | una | uno | uno | d | d | postype=indefinite|gen=f|num=s | postype=numeral|gen=f|num=s | _ | _ | _ | _ | _ |
39 | carta | carta | carta | n | n | postype=common|gen=f|num=s | postype=common|gen=f|num=s | _ | _ | _ | _ | _ |
40 | abierta | abierto | abrir | a | v | postype=qualificative|gen=f|num=s|posfunction=participle | postype=main|gen=f|num=s|mood=pastparticiple | _ | _ | _ | _ | _ |
41 | de | de | de | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | _ | _ | _ | _ | _ |
42 | utilizar | utilizar | utilizar | v | v | postype=main|gen=c|num=c|mood=infinitive | postype=main|gen=c|num=c|mood=infinitive | _ | _ | _ | _ | Y |
43 | métodos | método | método | n | n | postype=common|gen=m|num=p | postype=common|gen=m|num=p | _ | _ | _ | _ | _ |
44 | poco | poco | poco | r | r | _ | _ | _ | _ | _ | _ | _ |
45 | democráticos | democrático | democrático | a | a | postype=qualificative|gen=m|num=p | postype=qualificative|gen=m|num=p | _ | _ | _ | _ | _ |
46 | de | de | de | s | s | postype=preposition|gen=c|num=c | postype=preposition|gen=c|num=c | _ | _ | _ | _ | _ |
47 | gobierno | gobierno | Gobierno | n | n | postype=common|gen=m|num=s | postype=proper|gen=c|num=c | _ | _ | _ | _ | _ |
48 | . | . | . | f | f | punct=period | punct=period | _ | _ | _ | _ | _ |
In the CoNLL 2009 version, there are no nonprojectivities at all.
The results of the CoNLL 2006 shared task are available online. They have been published in (Buchholz and Marsi, 2006). The evaluation procedure was non-standard because it excluded punctuation tokens. These are the best results for Spanish:
Parser (Authors) | LAS | UAS |
---|---|---|
MST (McDonald et al.) | 82.25 | 86.05 |
Nara (Cheng) | 80.46 | 85.15 |
Microsoft (Corston-Oliver and Aue) | 80.36 | 84.87 |
Malt (Nivre et al.) | 81.29 | 84.67 |
Basis (O'Neil) | 79.82 | 84.11 |
The results of the CoNLL 2009 shared task are available online. They have been published in (Hajič et al., 2009). Unlabeled attachment score was not published. These are the best results for Spanish:
Parser (Authors) | LAS |
---|---|
Merlo | 87.64 |
Che | 87.33 |
Bohnet | 87.19 |
Chen | 86.29 |