Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
user:zeman:interset:features [2009/10/29 23:00] zeman Finno-Ugric cases. |
user:zeman:interset:features [2013/11/18 14:56] zeman Causative voice. |
||
---|---|---|---|
Line 19: | Line 19: | ||
===== subpos ===== | ===== subpos ===== | ||
- | Detailed part of speech. | + | Detailed part of speech. Subject to change - more value groups will probably be made separate features, e.g. '' |
| **Value** | **Main pos** | **Description** | | | **Value** | **Main pos** | **Description** | | ||
Line 32: | Line 32: | ||
| verbconj | verb | finite verb with the enclitic " | | verbconj | verb | finite verb with the enclitic " | ||
| mod | adv, part | modal particle (Bulgarian " | | mod | adv, part | modal particle (Bulgarian " | ||
+ | | adadj | adv | ad-adjective: | ||
| ex | adv | existential " | | ex | adv | existential " | ||
| voc | prep | vocalized preposition (Czech " | | voc | prep | vocalized preposition (Czech " | ||
Line 61: | Line 62: | ||
| neg | Negative pronoun (" | | neg | Negative pronoun (" | ||
| ind | Indefinite pronoun (" | | ind | Indefinite pronoun (" | ||
- | | tot | Totality pronoun (" | + | | tot | Totality |
===== numtype ===== | ===== numtype ===== | ||
Line 73: | Line 74: | ||
| frac | fraction ("one fifth" | | frac | fraction ("one fifth" | ||
| gen | generic numeral (" | | gen | generic numeral (" | ||
+ | | dist | distributive numeral (Hungarian " | ||
===== numform ===== | ===== numform ===== | ||
Line 91: | Line 93: | ||
| 2 | numeric value 2 (Czech " | | 2 | numeric value 2 (Czech " | ||
| 3 | numeric value 3 or 4 (Czech " | | 3 | numeric value 3 or 4 (Czech " | ||
- | |||
===== accommodability ===== | ===== accommodability ===== | ||
Line 170: | Line 171: | ||
===== negativeness ===== | ===== negativeness ===== | ||
- | |||
- | Distinguishes also negative pronouns like " | ||
| **Value** | **Description** | | | **Value** | **Description** | | ||
Line 180: | Line 179: | ||
See also the '' | See also the '' | ||
+ | In Arabic, definiteness is also called the //state.// | ||
+ | http:// | ||
+ | http:// | ||
+ | http:// | ||
| **Value** | **Description** | | | **Value** | **Description** | | ||
| ind | indefinite | | | ind | indefinite | | ||
| def | definite | | | def | definite | | ||
- | | red | reduced: used in [[http:// | + | | red | reduced: used in [[http:// |
+ | | com | complex: used in [[http:// | ||
===== foreign ===== | ===== foreign ===== | ||
Line 229: | Line 233: | ||
| dual | dual | | | dual | dual | | ||
| plu | plural | | | plu | plural | | ||
+ | | ptan | plurale tantum | | ||
+ | | coll | collective / mass / singulare tantum | | ||
+ | |||
+ | //Pluralia tantum// is a special case of plural, occurring e.g. in Czech. It applies to words that do not have singular forms. They use grammatical plural regardless of semantic number. Czech example: //nůžky// " | ||
+ | |||
+ | // | ||
===== possnumber ===== | ===== possnumber ===== | ||
Line 239: | Line 249: | ||
| plu | plural | | | plu | plural | | ||
+ | It applies e.g. to possessive pronouns and it can be different from their grammatical number, which is governed by agreement with the modified (possessed) noun phrase. Czech example: //můj pes// "my dog" (grammatical singular, possessor singular), //mí psi// "my dogs" (grammatical plural, possessor singular), //náš pes// "our dog" (grammatical singular, possessor plural), //naši psi// "our dogs" (grammatical plural, possessor plural). | ||
- | ===== case ===== | + | ===== possednumber |
+ | |||
+ | Possednumber is the possessee' | ||
| **Value** | **Description** | | | **Value** | **Description** | | ||
- | | nom | nominative | + | | sing | singular |
- | | gen | genitive | + | | dual | dual | |
- | | dat | dative | | + | | plu | plural |
- | | acc | accusative or oblique | | + | |
- | | voc | vocative | | + | |
- | | loc | locative | | + | |
- | | ins | instrumental | + | |
- | ==== Other cases ==== | + | In Hungarian, possession can be marked on the possessor or on the possessed. It is possible, though rare, that a noun has three distinct number features: its own grammatical number, number of its possessor and number of its possession. Examples from the Multext-East Hungarian lexicon: |
- | (Not yet used in any tag set driver.) | + | * könnyedén |
+ | * könny = a tear (singular) | ||
+ | * könnyed = your tear (singular owner) | ||
+ | * könnyedé = (possession) of your tear (singular possession) | ||
+ | * könnyedén = (on the possession) of your tear (superessive case) | ||
+ | * ellenfeleié (PSS) | ||
+ | * ellenfél = an opponent (singular) | ||
+ | * ellenfele = his/her/its opponent (singular owner) | ||
+ | * ellenfelei = his/her/its opponents (core plural, singular owner) | ||
+ | * ellenfeleié = (possession) of his/her/its opponents (singular possession) | ||
+ | * életeké (SPS) | ||
+ | * él = point (singular) | ||
+ | * élek = points (plural) | ||
+ | * élén = his/her/its point (singular owner) | ||
+ | * élünk = our point (plural owner) | ||
+ | * életeké = (possession) of our point (singular possession) | ||
+ | * tárgyalópartnereinkét (PPS) | ||
+ | * tárgyalópartner = negotiator (singular) | ||
+ | * tárgyalópartnerei = his/her/its negotiators (plural, singular owner) | ||
+ | * tárgyalópartnereinkét = (possession) of our negotiators (plural, plural owner, singular possession, accusative case) | ||
- | | + | Words marked for plural possessions are very rare, though. Note that in the following example from Multext-East, |
+ | | ||
+ | * Kolumbusz = Columbus (singular) | ||
+ | * Kolumbuszéi = (possessions) of Columbus (plural possession) | ||
+ | * Kolumbuszéinál = (at the possessions) of Columbus (adessive case) | ||
+ | |||
+ | ===== case ===== | ||
+ | |||
+ | | **Value** | **Description** | **Examples** | | ||
+ | | nom | nominative | cs: dům, budova = a house, building | | ||
+ | | gen | genitive | cs: domu, budovy = of a house; in Basque, this is possessive genitive (as opposed to locative | ||
+ | | dat | dative | cs: domu, budově = to a house | | ||
+ | | acc | accusative or oblique | cs: dům, budovu = a house | | ||
+ | | voc | vocative | cs: dome, budovo = hey, you house! | | ||
+ | | loc | locative | cs: v& | ||
+ | | ins | instrumental | cs: domem, budovou = with/ | ||
+ | | ist | instructive | Used rarely in Finnish to express "with (the aid of)", i.e. semantically similar to instrumental (which is not defined in Finnish grammar). Shall we merge instructive with instrumental? | ||
+ | | abl | ablative | hu: a barátomtól jövök = jdu od kamaráda fi: pöydältä = se stolu; katolta = se střechy; rannalta = z pláže | | ||
+ | | par | partitive | Ve finštině vyjadřuje neznámou identitu a neukončené akce bez výsledku. Příklady užití: kolme taloa = tři domy (koncovka -a u talo); rakastan tätä taloa = mám rád tento dům; saanko lainata kirjaa? = můžu si půjčit tu knihu? (koncovka -a u kirja); lasissa on maitoa = ve sklenici je (nějaké) mléko; akuzativ: ammuin karhun = zastřelil jsem medvěda (a vím, že je mrtvý); partitiv: ammuin karhua = střelil jsem po medvědovi (a nevím ani, jestli jsem ho trefil); Akuzativ (v opozici k partitivu) taky může suplovat chybějící budoucí čas: akuzativ: luen kirjan = přečtu si tu knihu; partitiv: luen kirjaa = čtu knihu | | ||
+ | | dis | distributive | Vyjadřuje, že se něco stalo každému prvku množiny, jednomu po druhém. Nebo vyjadřuje frekvenci v čase. hu: fejenként = na hlavu, esetenként = v některém případě, hetenként = jednou týdně, tízpercenként = každých deset minut | | ||
+ | | ess | essive / prolative | Určuje dočasný stav, často odpovídá anglickému "as a ...". fi: lapsi = dítě, lapsena = jako dítě, když byl dítě ee: laps = dítě, lapse = dítěte (genitiv), lapsena = jako dítě. Similar case called prolative in Basque grammar: eu: erreformistatzat = as a reformer (erreformista = reformer) | | ||
+ | | tra | translative / factive | Určuje změnu stavu (" | ||
+ | | com | comitative / associative | Vyjadřuje " | ||
+ | | abe | abessive | Vyjadřuje " | ||
+ | | ine | inessive | hu: házban = v domě (ház = dům) fi: talossa = v domě (talo = dům) ee: majas = v domě (maja = dům) | | ||
+ | | ela | elative | hu: házból = z domu fi: talosta = z domu ee: majast = z domu | | ||
+ | | ill | illative | hu: házba = do domu fi: taloon = do domu ee: majasse / majja = do domu | | ||
+ | | add | additive | Distinguished by some scholars in Estonian, not recognized by traditional grammar, exists in the Multext-East Estonian tagset and in the Eesti keele puudepank. Reportedly same or similar meaning as illative. Forms of this case exist only in singular and not for all nouns. | | ||
+ | | ade | adessive | hu: asztalon = na stole (asztal = stůl) fi: pöydällä = na stole ee: laual = na stole (laud = stůl) | | ||
+ | | all | allative | fi: pöydälle = na stůl (pöytä = stůl) | | ||
+ | | sub | sublative | Used in Finno-Ugric | ||
+ | | sup | superessive | Used, chiefly [[http:// | ||
+ | | del | delative | Used, chiefly [[http:// | ||
+ | | lat | lative | Denotes movement towards/ | ||
+ | | tem | temporal | Určuje čas. hu: hétkor = v sedm, éjfélkor = o půlnoci, karácsonykor = o Vánocích | | ||
+ | | ter | terminative | Specifies where something ends in space or time. Similar case in Basque is called terminal allative (Spanish //adlativo terminal// | ||
+ | | abs | absolutive | Subject of intransitive verb, direct object of transitive verb. | | ||
+ | | erg | ergative | Subject of transitive verb. | | ||
+ | | cau | causative / motivative | Noun in this case is the cause of something. hu: Hálás leszekérte. eu: jokaeragatik = because of behavior (jokaera = behavior) | | ||
+ | | ben | benefactive / destinative | Corresponds to the preposition " | ||
+ | |||
+ | * Fine grained locative cases (Uralic | ||
| | **Where** | **Where to** | **Where from** | | | | **Where** | **Where to** | **Where from** | | ||
| **Inside of** | inessive | illative | elative | | | **Inside of** | inessive | illative | elative | | ||
| **Surface of** | adessive | allative | ablative | | | **Surface of** | adessive | allative | ablative | | ||
+ | | **Surface of** | superessive | sublative | delative | | ||
+ | |||
+ | http:// | ||
+ | http:// | ||
- | * Alternative locative cases | + | Ten příklad s knihou u partitivu máš blbě, ne? " |
- | * sublative (similar to allative?) | + | |
- | * delative (similar to ablative?) | + | |
- | * superessive (similar to adessive?) | + | |
- | | illative | hu: házba = do domu (ház = dům) ee: majasse / majja = do domu (maja = dům) fi: taloon = do domu (talo = dům) | | + | Copak já vím? Finsky neumím. Je to převzaté |
- | | inessive | hu: házban = v domě fi: talossa = v domě ee: majas = v domě | | + | |
- | | elative | hu: házból = z domu fi: talosta = z domu ee: majast = z domu | | + | |
- | | allative | fi: pöydälle = na stůl (pöytä = stůl) | | + | |
- | | adessive | hu: asztalon = na stole (asztal = stůl) fi: pöydällä = na stole ee: laual = na stole (laud = stůl) | | + | |
- | | ablative | hu: a barátomtól jövök = jdu od kamaráda fi: pöydältä = se stolu; katolta = se střechy; rannalta = z pláže | | + | |
- | | sublative | Směr "na něco", | + | |
- | | delative | Směr "z, od", ale používá se i v jiných významech (např. "o něčem" | + | |
- | | superessive | Určuje místo " | + | |
- | | partitive | Ve finštině vyjadřuje neznámou identitu a neukončené akce bez výsledku. Příklady užití: kolme taloa = tři domy (koncovka | + | |
- | | translative | Určuje změnu stavu (" | + | |
- | | terminative | Určuje, kde něco končí v prostoru nebo čase. hu: a házig = až k tomu domu hu: hat óráig = do šesti hodin ee: jõeni = až k řece ee: kella kuueni = až do šesti hodin | | + | |
- | | essive | Určuje dočasný stav, často odpovídá anglickému "as a ...". fi: lapsi = dítě, lapsena = jako dítě, když byl dítě ee: laps = dítě, lapse = dítěte (genitiv), lapsena = jako dítě | | + | |
- | | abessive | Vyjadřuje, že objekt pojmenovaný podstatným jménem chybí ("bez ..."). fi: rahatta = bez peněz (raha = peníze) | | + | |
- | | comitative | Vyjadřuje " | + | |
- | | temporal | Určuje čas. hu: hétkor = v sedm, éjfélkor = o půlnoci, karácsonykor = o Vánocích | | + | |
- | | causative | Vyjadřuje, že podstatné jméno je něčeho příčinou (" | + | |
- | | sociative | Podobný význam jako kauzativ. V& | + | |
- | | distributive | Vyjadřuje, že se něco stalo každému prvku množiny, jednomu po druhém. Nebo vyjadřuje frekvenci v čase. hu: fejenként = na hlavu, esetenként = v některém případě, hetenként = jednou týdně, tízpercenként = každých deset minut | | + | |
===== prepcase ===== | ===== prepcase ===== | ||
Line 312: | Line 364: | ||
| 2 | second (you) | | | 2 | second (you) | | ||
| 3 | third (he, she, it, they) | | | 3 | third (he, she, it, they) | | ||
+ | |||
+ | ===== possperson ===== | ||
+ | |||
+ | Possgender is possessor' | ||
+ | |||
+ | | **Value** | **Description** | | ||
+ | | 1 | first (my, our) | | ||
+ | | 2 | second (your) | | ||
+ | | 3 | third (his, her, its, their) | | ||
===== politeness ===== | ===== politeness ===== | ||
Line 336: | Line 397: | ||
| trans | transgressive, | | trans | transgressive, | ||
| ger | [[http:// | | ger | [[http:// | ||
- | |||
===== mood ===== | ===== mood ===== | ||
Line 344: | Line 404: | ||
| imp | imperative | | | imp | imperative | | ||
| cnd | conditional | | | cnd | conditional | | ||
+ | | pot | potential (Finnish: the action of the verb is likely but not certain) | | ||
| sub | subjunctive (conjunctive) (spojovací) | | | sub | subjunctive (conjunctive) (spojovací) | | ||
| jus | jussive (přací) | | | jus | jussive (přací) | | ||
+ | | qot | quotative (Estonian: denotes direct speech) | | ||
+ | | opt | optative (Turkish; "May you have a long life! If only I were rich!" | ||
+ | | des | desiderative (Turkish; "He wants to come." | ||
+ | | nec | necessitative (Turkish; "He must come. He should come." | ||
===== tense ===== | ===== tense ===== | ||
Line 353: | Line 418: | ||
| pres | present | | | pres | present | | ||
| fut | future | | | fut | future | | ||
- | |||
===== subtense ===== | ===== subtense ===== | ||
Line 364: | Line 428: | ||
| aor | aorist | | | aor | aorist | | ||
| imp | imperfect | | | imp | imperfect | | ||
+ | | nar | narrative (Turkish // | ||
| pqp | pluperfect | | | pqp | pluperfect | | ||
===== aspect ===== | ===== aspect ===== | ||
+ | |||
+ | See [[http:// | ||
| **Value** | **Description** | | | **Value** | **Description** | | ||
| imp | imperfect | | | imp | imperfect | | ||
| perf | perfect | | | perf | perfect | | ||
+ | | pro | prospective | | ||
+ | | prog | progressive | | ||
===== voice ===== | ===== voice ===== | ||
Line 377: | Line 446: | ||
| act | active | | | act | active | | ||
| pass | passive | | | pass | passive | | ||
+ | | rcp | reciprocal (Turkish " | ||
+ | | caus | causative (Turkish " | ||
+ | |||
+ | {{: | ||
===== abbr ===== | ===== abbr ===== | ||
Line 392: | Line 465: | ||
| hyph | hyphenated prefix (" | | hyph | hyphenated prefix (" | ||
+ | ===== echo ===== | ||
+ | |||
+ | Is this a reduplicative or echo word? Such words occur in Hindi. In Hyderabad Dependency Treebank they get their own part-of-speech tags RDP and ECH, respectively. We do not want to treat them as separate parts of speech because they could be assigned a POS independent of their RDP or ECH status (same as the word that they echo). We may want to merge this feature in future with '' | ||
+ | |||
+ | | **Value** | **Meaning** | **Explanation** | **Examples** | | ||
+ | | rdp | reduplicative | The word is a copy of a previous word. In Hindi, this would add the meaning of distribution ("one rupee each" | ||
+ | | ech | echo | The word rhymes with a previous word but it is not identical to it and typically id does not have any meaning of its own. In Hindi it generalizes the meaning of the previous word and eventually translates as "or something", | ||
+ | For more details see Rupert Snell and Simon Weightman: //Teach Yourself Hindi,// Section 16.4 and 16.5, pages 210 – 211. | ||
===== style ===== | ===== style ===== | ||
Line 401: | Line 482: | ||
| norm | normal, neutral | | | norm | normal, neutral | | ||
| coll | colloquial | | | coll | colloquial | | ||
+ | | vrnc | vernacular | | ||
+ | | slng | slang | | ||
+ | | derg | derogative | | ||
+ | | vulg | vulgar | | ||
===== typo ===== | ===== typo ===== |