[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
user:zeman:interset:features [2014/06/16 16:41]
zeman Další hodnoty byly ze subpos přesunuty jinam.
user:zeman:interset:features [2014/06/26 16:04]
zeman New feature: nametype.
Line 3: Line 3:
 ===== pos ===== ===== pos =====
  
-Part of speech. Pronouns, determiners, predeterminers and articles are roofed by nouns and adjectives, and distinguished by values of other features.+Part of speech. Pronouns, determiners, predeterminers and articles are roofed by nouns and adjectives, and distinguished by values of other features. The ''num'' value is intended for cardinal numbers (''numtype = card''). Other types of numerals are roofed by syntactically defined parts of speech (adjectives or adverbs) and distinguished by values of ''numtype''.
  
 | **Value** | **Description** |  | **Value** | **Description** |
 | noun | noun | | noun | noun |
 | adj | adjective | | adj | adjective |
-| num | numeralnumber |+| num | numeral (cardinal number|
 | verb | verb | | verb | verb |
 | adv | adverb | | adv | adverb |
Line 16: Line 16:
 | int | interjection | | int | interjection |
 | punc | punctuation or symbol | | punc | punctuation or symbol |
- 
-===== subpos ===== 
- 
-Detailed part of speech. Subject to change - more value groups will probably be made separate features, e.g. ''verbtype''. 
- 
-| **Value** | **Main pos** | **Description** | 
-| mod | adv, part | modal particle (Bulgarian "май" = "possibly", "нека" = "let"; Czech "ať", "kéž", "nechť") or adverb of modal nature (Bulgarian "апропо"); note that verbs have their own ''mod'' subpos | 
-| adadj | adv | ad-adjective: special form in Finnish, derived from adjectives, used only to modify other adjectives (http://archives.conlang.info/pei/juenchen/phaelbhaduen.html) | 
-| ex | adv | existential "there" in English | 
-| emp | part | particle of emphasis (Bulgarian "даже" = "even") | 
-| res | part | particle of response ("yes", "no") | 
-| inf | part | infinitive marker (English "to", German "zu", Danish "at", Swedish "att"). Sometimes tagged as particle, sometimes as conjunction, sometimes has its own part of speech. | 
-| vbp | part | separated verb prefix (German "vor" in "stellen Sie sich vor"); analogical verbal particles in English? | 
  
 ===== nountype ===== ===== nountype =====
Line 36: Line 23:
 | prop | proper noun ("George", "Bush", "Paris") | | prop | proper noun ("George", "Bush", "Paris") |
 | class | classifier (measure word) between number and counted noun, e.g. Chinese "個 gè" | | class | classifier (measure word) between number and counted noun, e.g. Chinese "個 gè" |
 +
 +===== nametype =====
 +
 +Semantic classification of named entities.
 +
 +| **Value** | **Description** |
 +| geo | geographical name ("Praha", "Ústí nad Labem") |
 +| prs | personal name (no first/last distinction available) |
 +| giv | given (first) name ("Petr", "John") |
 +| sur | surname (last name) ("Dvořák", "Zelený", "Agassi", "Bush") |
 +| nat | nationality ("Čech", "Kolumbijec") or a name of an inhabitant of certain location ("Pražan") |
 +| com | company ("Tatra" (the company)) |
 +| pro | product ("Tatra" (the car)) |
 +
 +If the value is empty but ''nountype'' is ''prop'' then it is a named entity of unknown type, e.g. mines, stadiums, guerilla bases etc. The same approach may be used for functional words in names.
  
 ===== adjtype ===== ===== adjtype =====
Line 71: Line 73:
 | mult | multiplier number ("five times") | | mult | multiplier number ("five times") |
 | frac | fraction ("one fifth") | | frac | fraction ("one fifth") |
-| gen | generic numeral ("twofold", Czech "jedny",tvery", "čtvero", "čtverý") |+| gen | generic numeral ("twofold", Czech "jedny",tvero", "čtverý") | 
 +| sets | number of sets of thingsor of pluralia tantum (Czech tvery") |
 | dist | distributive numeral (Hungarian "három-három" in "gyermekenként három-három ezer forinttal" = "three thousand forint per child") | | dist | distributive numeral (Hungarian "három-három" in "gyermekenként három-három ezer forinttal" = "three thousand forint per child") |
 | range | range of values, subtype of card ("two-five" = "two to five") | | range | range of values, subtype of card ("two-five" = "two to five") |
Line 127: Line 130:
 | deg | adverb of quantity or degree ("how much") | | deg | adverb of quantity or degree ("how much") |
 | cau | adverb of cause ("why") | | cau | adverb of cause ("why") |
 +| mod | adverb of modal nature (Bulgarian "апропо") |
 +| adadj | ad-adjective: special form in Finnish, derived from adjectives, used only to modify other adjectives (http://archives.conlang.info/pei/juenchen/phaelbhaduen.html) |
 +| ex | existential "there" in English |
  
 ===== adpostype ===== ===== adpostype =====
Line 143: Line 149:
 | sub | subordinating conjunction | | sub | subordinating conjunction |
 | comp | comparing conjunction (German "wie", "als") | | comp | comparing conjunction (German "wie", "als") |
 +| oper | mathematical operator (Czech "krát") |
 +
 +===== parttype =====
 +
 +| **Value** | **Description** |
 +| mod | modal particle (Bulgarian "май" = "possibly", "нека" = "let"; Czech "ať", "kéž", "nechť") |
 +| emp | particle of emphasis (Bulgarian "даже" = "even") |
 +| res | particle of response ("yes", "no") |
 +| inf | infinitive marker (English "to", German "zu", Danish "at", Swedish "att"). Sometimes tagged as particle, sometimes as conjunction, sometimes has its own part of speech. |
 +| vbp | separated verb prefix (German "vor" in "stellen Sie sich vor"); analogical verbal particles in English? |
  
 ===== punctype ===== ===== punctype =====

[ Back to the navigation ] [ Back to the content ]