[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Last revision Both sides next revision
courses:rg:multilingual-noise-robust-supervised-morphological-analysis-using-the-wordframe-model [2011/01/07 18:51]
kirschner
courses:rg:multilingual-noise-robust-supervised-morphological-analysis-using-the-wordframe-model [2011/01/09 17:45]
kirschner
Line 5: Line 5:
 ===== Comments ===== ===== Comments =====
  
 +=== Summary ===
   * In this paper the author presents a new supervized method for lemmatization, called WordFrame model.   * In this paper the author presents a new supervized method for lemmatization, called WordFrame model.
   * This new method is compared to existing End-Of-String method and is proven better in most of the cases.   * This new method is compared to existing End-Of-String method and is proven better in most of the cases.
Line 11: Line 12:
   * The WordFrame model algorithm trains well on noisy data, therefore it can be used in co-training with unsupervised methods.   * The WordFrame model algorithm trains well on noisy data, therefore it can be used in co-training with unsupervised methods.
      
 +=== Described models ===
 +Both models described in this paper were ment to decompose the word to some basic parts (not morphemes, but similar).
 +
 +==Extended End-of-String model==
 +Decomposition of inflection into 
 +  * prefix - //concatenation of all prefixes//
 +  * primary common substring - //the stem//
 +  * point of suffixation change - //phonologicaly induced letter change on the boundary of stem and suffix//
 +  * suffix/ending - //concatenation of all suffixes of the word//
 +
 +==WordFrame model==
 +Decomposition of inflection into 
 +  * prefix - //concatenation of all prefixes//
 +  * point of prefixation change - //phonologicaly induced letter change on the boundary of first part of stem and prefix//
 +  * secondary common substring - //the part of stem before stem vowel change//
 +  * vowel change - //the vowel change inside the stem//
 +  * primary common substring - //the part of stem after the vowel change//
 +  * point of suffixation change - //phonologicaly induced letter change on the boundary of stem and suffix//
 +  * suffix/ending - //concatenation of all suffixes of the word//
  
 ===== Suggested Additional Reading ===== ===== Suggested Additional Reading =====

[ Back to the navigation ] [ Back to the content ]