Next revision
|
Previous revision
Next revision
Both sides next revision
|
user:zeman:treebanks:ta [2012/03/22 10:35] zeman vytvořeno |
user:zeman:treebanks:ta [2012/03/22 10:43] zeman Links to publications. |
* //no separate citation// | * //no separate citation// |
* Principal publications | * Principal publications |
* Loganathan Ramasamy, Zdeněk Žabokrtský: Tamil Dependency Parsing: Results using Rule Based and Corpus Based Approaches. In: //Proceedings of the 12th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing 2011) – Volume Part I//, pages 82-95, Tokyo, Japan, 2011, published by Springer Berlin / Heidelberg, ISBN 978-3-642-19399-6. | * Loganathan Ramasamy, Zdeněk Žabokrtský: [[http://www.springerlink.com/content/w18v7621070h51g1/|Tamil Dependency Parsing: Results using Rule Based and Corpus Based Approaches]]. In: //Proceedings of the 12th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing 2011) – Volume Part I//, pages 82-95, Tokyo, Japan, 2011, published by Springer Berlin / Heidelberg, ISBN 978-3-642-19399-6. |
* Loganathan Ramasamy, Zdeněk Žabokrtský: Prague Dependency Style Treebank for Tamil. In: //Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012)//, İstanbul, Turkey, 2012 | * Loganathan Ramasamy, Zdeněk Žabokrtský: Prague Dependency Style Treebank for Tamil. In: //Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012)//, İstanbul, Turkey, 2012 |
* Documentation | * Documentation |
* [[http://ufal.mff.cuni.cz/~ramasamy/tamiltb/0.1/morph_annotation.html|Morphological annotation]] | * [[http://ufal.mff.cuni.cz/~ramasamy/tamiltb/0.1/morph_annotation.html|Morphological annotation]] |
* [[http://ufal.mff.cuni.cz/~ramasamy/tamiltb/0.1/dependency_annotation.html|Syntactic annotation]] | * [[http://ufal.mff.cuni.cz/~ramasamy/tamiltb/0.1/dependency_annotation.html|Syntactic annotation]] |
| * Loganathan Ramasamy, Zdeněk Žabokrtský: [[http://ufal.mff.cuni.cz/~ramasamy/papers/2011-TamilTB-TR.pdf|Tamil Dependency Treebank (TamilTB) – 0.1 Annotation Manual]]. Technical Report TR-2011-42, ÚFAL MFF UK, Praha, Czechia, 2011 |
| |
==== Domain ==== | ==== Domain ==== |
Tamil script has been [[http://ufal.mff.cuni.cz/~ramasamy/tamiltb/0.1/introduction.html#Text_preprocessing|romanized]] (the romanization is case-sensitive). | Tamil script has been [[http://ufal.mff.cuni.cz/~ramasamy/tamiltb/0.1/introduction.html#Text_preprocessing|romanized]] (the romanization is case-sensitive). |
| |
The treebank is distributed in three formats: TMT ([[http://ufal.mff.cuni.cz/tectomt/|TectoMT]] XML), [[formát CoNLL|CoNLL]] and TnT-tagger style (only POS-tagged layer). | The treebank is distributed in three formats: TMT ([[http://ufal.mff.cuni.cz/tectomt/|TectoMT]] XML), [[:formát CoNLL|CoNLL]] and TnT-tagger style (only POS-tagged layer). |
| |
Morphological annotation is manual and it includes lemmas, parts of speech and morphosyntactic features. Syntactic annotation follows the style of the [[cs|Prague Dependency Treebank]]. | Morphological annotation is manual and it includes lemmas, parts of speech and morphosyntactic features. Syntactic annotation follows the style of the [[cs|Prague Dependency Treebank]]. |