Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision | Next revision Both sides next revision | ||
courses:rg:overcoming_vocabulary_sparsity_in_mt_using_lattices [2010/11/29 23:46] ivanova |
courses:rg:overcoming_vocabulary_sparsity_in_mt_using_lattices [2010/11/29 23:46] ivanova |
||
---|---|---|---|
Line 16: | Line 16: | ||
The article is about overcoming the problem of vocabulary sparsity in SMT. The sparsity occurs because many words can have inflection or can take different affixes while in the vocabulary we might not find all those forms. | The article is about overcoming the problem of vocabulary sparsity in SMT. The sparsity occurs because many words can have inflection or can take different affixes while in the vocabulary we might not find all those forms. | ||
The authors of the article introduce three problems and their methods to overcome this challenges: | The authors of the article introduce three problems and their methods to overcome this challenges: | ||
+ | |||
1. common stems are fragmented into many different forms in training data; | 1. common stems are fragmented into many different forms in training data; | ||
2. rare and unknown words are frequent in test data; | 2. rare and unknown words are frequent in test data; |