This is an old revision of the document!
Table of Contents
References for DZ Interset
If you report on something interesting you did with DZ Interset, please cite the following paper:
- Daniel Zeman: Reusable Tagset Conversion Using Tagset Drivers. In: Proceedings of the Language Resources and Evaluation Conference, LREC 2008. CD full edition + printed Conference Abstracts (ISBN 2-9517408-4-0). Marrakech, Morocco, 2008. (Available here.)
Follow-up papers:
- Daniel Zeman: Hard Problems of Tagset Conversion. In: Proceedings of the Second International Conference on Global Interoperability for Language Resources (ICGL-2010), City University of Hong Kong, Hong Kong, China, 2010. (Available here.)
Used in / cited by
Also, if you do something relevant to DZ Interset, I can include a pointer to your work here:
- First used to convert Danish, Swedish and English tags in experiments with cross-language parser adaptation.
- Daniel Zeman, Philip Resnik: Cross-Language Parser Adaptation between Related Languages. In: Proceedings of IJCNLP 2008 Workshop on NLP for Less Privileged Languages, pp. 35-42. Hyderabad, India, 2008. (Available here.)
- MorphCon (cs) is a tool for conversion of various tagsets for Czech. It is based on Interset. It has been presented here:
- Petr Pořízka (UP Olomouc) & Markus Schäfer (Universität Bonn): MorphCon – program pro konverzi českých morfologických tagsetů. In: Čeština ve formální gramatice 2009, Brno, Filozofická fakulta Masarykovy univerzity, February 2009
- Petr Pořízka, Markus Schäfer: MorphCon - A Software for Conversion of Czech Morphological Tagsets. In: Jana Levická, Radovan Garabík (eds.): NLP, Corpus Linguistics, Corpus Based Grammar Research, Fifth International Conference, Smolenice, Slovakia, 25-27 November 2009, Proceedings, pp. 292-301. Slovenská akadémia vied, Bratislava, Slovakia, 2009
- http://www.korpling.webnode.cz/softwarove-nastroje/ (Olomouc, starší stránka)
- HamleDT (Harmonized Multi-LanguagE Dependency Treebank)
- LREC 2012 paper (Daniel Zeman, David Mareček, Martin Popel, Loganathan Ramasamy, Jan Štěpánek, Zdeněk Žabokrtský, Jan Hajič)
Related work
- The AMALGAM project provides lists of tags with descriptions and examples for 8 English tagsets used in different corpora
-
- Christian Chiarcos, Tomaž Erjavec: OWL/DL formalization of the MULTEXT-East morphosyntactic specifications. In: Proceedings of the Fifth Law Workshop (LAW V), pages 11–20, Portland, Oregon, 23-24 June 2011. © 2011 Association for Computational Linguistics.
Other links
- Ajka is a Czech morphological analyzer from Brno, now deprecated. It can be downloaded but the archive is password-protected. The website does not say anything about license terms. However, there is a working web interface where Ajka can be tested. Ajka gave name to the Czech tagset it uses.