[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
user:zeman:interset [2009/02/19 15:35]
zeman
user:zeman:interset [2017/01/16 12:57] (current)
zeman Interset and UD.
Line 1: Line 1:
-====== DZ Interset ======+{{:user:zeman:interset-logo.png|}}
  
-DZ Interset is a means of converting among various tag sets in natural language processing. The core idea is similar to interlingua-based machine translation. DZ Interset defines a set of features that are encoded by the various tag sets. The set of features should be as universal as possible. It does not need to encode everything that is encoded by any tag set but it should encode all information that people may want to access and/or port from one tag set to another.+Interset is a means of converting among various tag sets in natural language processing. The core idea is similar to interlingua-based machine translation. Interset defines a set of features that are encoded by the various tag sets. The set of features should be as universal as possible. It does not need to encode everything that is encoded by any tag set but it should encode all information that people may want to access and/or port from one tag set to another. (The features and their values are very similar to those defined in [[http://universaldependencies.org/|Universal Dependencies]]. UD features are based on Interset. However, Interset still uses a slightly different notation, e.g. all features and values are lowercased.)
  
 New tag sets are attached by writing a //driver// for them. Once the driver is ready, you can easily convert tags between the new set and any other set for which you also have a driver. This reusability is an obvious advantage over writing a targeted conversion procedure each time you need to convert between a particular pair of tag sets. New tag sets are attached by writing a //driver// for them. Once the driver is ready, you can easily convert tags between the new set and any other set for which you also have a driver. This reusability is an obvious advantage over writing a targeted conversion procedure each time you need to convert between a particular pair of tag sets.
  
   * [[user:zeman:interset:License]]   * [[user:zeman:interset:License]]
-  * [[http://ufal.mff.cuni.cz/~zeman/soubory/interset.zip|Download]] or [[http://svn.ms.mff.cuni.cz/projects/interset|browse source]] using Trac (skip authentication to gain default read-only access)+  * [[user:zeman:interset:Download]] 
 +    * [[user:zeman:interset:Versions]] 
 +    * [[http://quest.ms.mff.cuni.cz/cgi-bin/interset/index.pl|Web interface]]
   * [[user:zeman:interset:How to use]]   * [[user:zeman:interset:How to use]]
   * [[user:zeman:interset:How to write a driver]]   * [[user:zeman:interset:How to write a driver]]
-  * [[user:zeman:interset:References]] 
   * [[user:zeman:interset:Features]]   * [[user:zeman:interset:Features]]
   * [[user:zeman:interset:Drivers]]   * [[user:zeman:interset:Drivers]]
 +    * [[user:zeman:interset:Tagsets|Tagsets and corpora]]
   * [[user:zeman:interset:Common problems]]   * [[user:zeman:interset:Common problems]]
     * [[user:zeman:interset:Pronouns|Pronouns and determiners]]     * [[user:zeman:interset:Pronouns|Pronouns and determiners]]
Line 17: Line 19:
   * [[user:zeman:interset:Brainstorming]]   * [[user:zeman:interset:Brainstorming]]
   * [[user:zeman:interset:To do]]   * [[user:zeman:interset:To do]]
 +  * [[user:zeman:interset:References]]
 +  * [[internal:zeman:interset:Poznámky o správě projektu]] (not publicly accessible)
 +
 +===== Acknowledgements =====
  
 +This research has been supported by the grant MSM 0021620838 of the Ministry of Education of the Czech Republic.

[ Back to the navigation ] [ Back to the content ]