Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision | Last revision Both sides next revision | ||
courses:rg:multilingual-noise-robust-supervised-morphological-analysis-using-the-wordframe-model [2011/01/07 18:51] kirschner |
courses:rg:multilingual-noise-robust-supervised-morphological-analysis-using-the-wordframe-model [2011/01/09 17:45] kirschner |
||
---|---|---|---|
Line 5: | Line 5: | ||
===== Comments ===== | ===== Comments ===== | ||
+ | === Summary === | ||
* In this paper the author presents a new supervized method for lemmatization, | * In this paper the author presents a new supervized method for lemmatization, | ||
* This new method is compared to existing End-Of-String method and is proven better in most of the cases. | * This new method is compared to existing End-Of-String method and is proven better in most of the cases. | ||
Line 11: | Line 12: | ||
* The WordFrame model algorithm trains well on noisy data, therefore it can be used in co-training with unsupervised methods. | * The WordFrame model algorithm trains well on noisy data, therefore it can be used in co-training with unsupervised methods. | ||
| | ||
+ | === Described models === | ||
+ | Both models described in this paper were ment to decompose the word to some basic parts (not morphemes, but similar). | ||
+ | |||
+ | ==Extended End-of-String model== | ||
+ | Decomposition of inflection into | ||
+ | * prefix - // | ||
+ | * primary common substring - //the stem// | ||
+ | * point of suffixation change - // | ||
+ | * suffix/ | ||
+ | |||
+ | ==WordFrame model== | ||
+ | Decomposition of inflection into | ||
+ | * prefix - // | ||
+ | * point of prefixation change - // | ||
+ | * secondary common substring - //the part of stem before stem vowel change// | ||
+ | * vowel change - //the vowel change inside the stem// | ||
+ | * primary common substring - //the part of stem after the vowel change// | ||
+ | * point of suffixation change - // | ||
+ | * suffix/ | ||
===== Suggested Additional Reading ===== | ===== Suggested Additional Reading ===== |