[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision Both sides next revision
user:zeman:interset:to-do [2008/03/31 16:17]
zeman
user:zeman:interset:to-do [2008/03/31 18:10]
zeman Removed values of definiteness that are now in prontype.
Line 12: Line 12:
 ===== Features and values ===== ===== Features and values =====
  
-  * Udělat pořádek v zájmenechdeterminátorechtázacích příslovcích apodU starších ovladačů jsem používal jiný přístup než u novějších (počínaje bulharštinou), mělo by se to sjednotit. Samostatný slovní druh zájmeno přestane existovat. +  * Normalize processing of pronounsdeterminersinterrogative adverbs etcOld drivers use a different approach from the new ones (beginning with Bulgarian). Pronoun as an independent part of speech will cease to exist.
-    * Reduce value range of ''definiteness'' to ''def'' and ''ind''. Map the other values to the values of ''prontype''. Note that now ''definiteness="ind"'' does not necessarily mean ''prontype="ind"'', ''definiteness="def"'' does not imply ''prontype="dem"'', and ''negativeness="neg"'' need not correspond to ''prontype="neg"''. However, since for most drivers there is no difference, the driver tester might issue a warning if a decoder does not set these features in parallel. Test the affected drivers thoroughly.+
     * Remove ''pos="det"''. Instead, ''det'' will be a ''subpos'' of adjectives, similarly to ''pdt''. Setting ''prontype'' or leaving it empty determines how determiners will be treated in tagsets where there is no such category. With empty ''prontype'', they will become adjectives. If ''prontype'' is set, they will become pronouns.     * Remove ''pos="det"''. Instead, ''det'' will be a ''subpos'' of adjectives, similarly to ''pdt''. Setting ''prontype'' or leaving it empty determines how determiners will be treated in tagsets where there is no such category. With empty ''prontype'', they will become adjectives. If ''prontype'' is set, they will become pronouns.
     * Remove ''pos="pron"''. Distribute pronouns to nouns, adjectives and adverbs. When encoding into a tagset that distinguishes pronouns, detect pronouns by non-empty ''prontype''.     * Remove ''pos="pron"''. Distribute pronouns to nouns, adjectives and adverbs. When encoding into a tagset that distinguishes pronouns, detect pronouns by non-empty ''prontype''.
-    * Ze subpos=clit udělat samostatnou vlastnost, aby se usnadnil dotaz, zda je zájmeno osobníNebo tuto vlastnost spíš zrušitTohle je jednak problém změny práce se zájmenyjednak připravované koncepce práce se staženými tvary (viz níže).+    * Move ''subpos=clit'' to an independent feature so that it is easier to ask whether a pronoun is personalOr remove the featureThis is connected to the problem of changed processing of pronounsand of the processing of contracted word forms (see below).
   * Find more fine-grained classification of punctuation and symbols. Danish has punctuation proper, symbols (+, $), and strange strings like "U-21".   * Find more fine-grained classification of punctuation and symbols. Danish has punctuation proper, symbols (+, $), and strange strings like "U-21".
   * Classification of coordinative conjunctions: copulative, adversative etc. Example: sv::mamba.   * Classification of coordinative conjunctions: copulative, adversative etc. Example: sv::mamba.

[ Back to the navigation ] [ Back to the content ]