Differences
This shows you the differences between two versions of the page.
Next revision Both sides next revision | |||
user:zeman:treebanks:fa [2012/01/28 18:45] zeman vytvořeno |
user:zeman:treebanks:fa [2012/01/28 23:04] zeman Some more changes. |
||
---|---|---|---|
Line 10: | Line 10: | ||
==== Obtaining and License ==== | ==== Obtaining and License ==== | ||
- | The treebank is available for free after completing the [[http:// | + | The treebank is available for free after completing the [[http:// |
* non-commercial research usage | * non-commercial research usage | ||
Line 16: | Line 16: | ||
* citation of publications not specified | * citation of publications not specified | ||
- | PDT was created by members of the [[http:// | + | PDT was created by members of the [[http:// |
==== References ==== | ==== References ==== | ||
* Website | * Website | ||
- | * http://www.bultreebank.org/indexBTB.html | + | * http://dadegan.ir/en/ |
* Data | * Data | ||
* //no separate citation// | * //no separate citation// | ||
* Principal publications | * Principal publications | ||
- | * Kiril Simov, Petya Osenova, Alexander Simov, Milen Kouylekov: //Design and Implementation of the Bulgarian HPSG-based Treebank.// In: Erhard Hinrichs, Kiril Simov (eds.): Journal of Research on Language and Computation, Special Issue, vol. 2, no. 4, pp. 495–522, Kluwer Academic Publishers, ISSN 1570-7075. 2004. | + | * Mohammad Sadegh Rasooli, Amirsaeid Moloodi, Manouchehr Kouhestani, Behrouz Minaei-Bidgoli: |
* Documentation | * Documentation | ||
- | * Kiril Simov, Petya Osenova, Milena Slavcheva: [[http://www.bultreebank.org/TechRep/BTB-TR03.pdf|BTB-TR03: | + | * //none so far// |
- | * Petya Osenova, Kiril Simov: [[http:// | + | |
- | * http:// | + | |
==== Domain ==== | ==== Domain ==== | ||
- | Unknown | + | Unknown. |
==== Size ==== | ==== Size ==== | ||
- | The CoNLL 2006 version contains 196,151 tokens in 13221 sentences, yielding 14.84 tokens per sentence on average (CoNLL 2006 data split: 190,217 tokens / 12823 sentences training, 5934 tokens / 398 sentences test). | + | Unknown. |
==== Inside ==== | ==== Inside ==== | ||
Line 48: | Line 46: | ||
==== Sample ==== | ==== Sample ==== | ||
- | |||
- | The first three sentences of the CoNLL 2006 training data: | ||
- | |||
- | | 1 | Глава | _ | N | Nc | _ | 0 | ROOT | 0 | ROOT | | ||
- | | 2 | трета | _ | M | Mo | gen=f< | ||
- | | |||||||||| | ||
- | | 1 | НАРОДНО | _ | A | An | gen=n< | ||
- | | 2 | СЪБРАНИЕ | _ | N | Nc | gen=n< | ||
- | | |||||||||| | ||
- | | 1 | Народното | _ | A | An | gen=n< | ||
- | | 2 | събрание | _ | N | Nc | gen=n< | ||
- | | 3 | осъществява | _ | V | Vpi | trans=t< | ||
- | | 4 | законодателната | _ | A | Af | gen=f< | ||
- | | 5 | власт | _ | N | Nc | _ | 3 | obj | 3 | obj | | ||
- | | 6 | и | _ | C | Cp | _ | 3 | conj | 3 | conj | | ||
- | | 7 | упражнява | _ | V | Vpi | trans=t< | ||
- | | 8 | парламентарен | _ | A | Am | gen=m< | ||
- | | 9 | контрол | _ | N | Nc | gen=m< | ||
- | | 10 | . | _ | Punct | Punct | _ | 3 | punct | 3 | punct | | ||
- | |||
- | The first three sentences of the CoNLL 2006 test data: | ||
- | |||
- | | 1 | Единственото | _ | A | An | gen=n< | ||
- | | 2 | решение | _ | N | Nc | gen=n< | ||
- | | |||||||||| | ||
- | | 1 | Ерик | _ | N | Np | gen=m< | ||
- | | 2 | Франк | _ | N | Np | gen=m< | ||
- | | 3 | Ръсел | _ | H | Hm | gen=m< | ||
- | | |||||||||| | ||
- | | 1 | Пълен | _ | A | Am | gen=m< | ||
- | | 2 | мрак | _ | N | Nc | gen=m< | ||
- | | 3 | и | _ | C | Cp | _ | 2 | conj | 2 | conj | | ||
- | | 4 | пълна | _ | A | Af | gen=f< | ||
- | | 5 | самота | _ | N | Nc | _ | 2 | conjarg | 2 | conjarg | | ||
- | | 6 | . | _ | Punct | Punct | _ | 2 | punct | 2 | punct | | ||
==== Parsing ==== | ==== Parsing ==== |