Table of Contents
References for DZ Interset
If you report on something interesting you did with DZ Interset, please cite the following paper:
- Daniel Zeman: Reusable Tagset Conversion Using Tagset Drivers. In: Proceedings of the Language Resources and Evaluation Conference, LREC 2008. CD full edition + printed Conference Abstracts (ISBN 2-9517408-4-0). Marrakech, Morocco, 2008. (Available here.)
Follow-up papers:
- Daniel Zeman: Hard Problems of Tagset Conversion. In: Proceedings of the Second International Conference on Global Interoperability for Language Resources (ICGL-2010), City University of Hong Kong, Hong Kong, China, 2010. (Available here.)
Used in / cited by
Also, if you do something relevant to DZ Interset, I can include a pointer to your work here:
- First used to convert Danish, Swedish and English tags in experiments with cross-language parser adaptation.
- Daniel Zeman, Philip Resnik: Cross-Language Parser Adaptation between Related Languages. In: Proceedings of IJCNLP 2008 Workshop on NLP for Less Privileged Languages, pp. 35-42. Hyderabad, India, 2008. (Available here.)
- MorphCon (cs) is a tool for conversion of various tagsets for Czech. It is based on Interset. It has been presented here:
- Petr Pořízka (UP Olomouc) & Markus Schäfer (Universität Bonn): MorphCon – program pro konverzi českých morfologických tagsetů. In: Čeština ve formální gramatice 2009, Brno, Filozofická fakulta Masarykovy univerzity, February 2009
- Petr Pořízka, Markus Schäfer: MorphCon - A Software for Conversion of Czech Morphological Tagsets. In: Jana Levická, Radovan Garabík (eds.): NLP, Corpus Linguistics, Corpus Based Grammar Research, Fifth International Conference, Smolenice, Slovakia, 25-27 November 2009, Proceedings, pp. 292-301. Slovenská akadémia vied, Bratislava, Slovakia, 2009
- http://www.korpling.webnode.cz/softwarove-nastroje/ (Olomouc, starší stránka)
- HamleDT (Harmonized Multi-LanguagE Dependency Treebank)
- LREC 2012 paper (Daniel Zeman, David Mareček, Martin Popel, Loganathan Ramasamy, Jan Štěpánek, Zdeněk Žabokrtský, Jan Hajič)
Related work
- The AMALGAM project provides lists of tags with descriptions and examples for 8 English tagsets used in different corpora
-
- Christian Chiarcos, Tomaž Erjavec: OWL/DL formalization of the MULTEXT-East morphosyntactic specifications. In: Proceedings of the Fifth Law Workshop (LAW V), pages 11–20, Portland, Oregon, 23-24 June 2011. © 2011 Association for Computational Linguistics.
Other links
- Ajka is a Czech morphological analyzer from Brno, now deprecated. It can be downloaded but the archive is password-protected. The website does not say anything about license terms. However, there is a working web interface where Ajka can be tested. Ajka gave name to the Czech tagset it uses.