[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision Both sides next revision
courses:rg:overcoming_vocabulary_sparsity_in_mt_using_lattices [2010/11/29 23:46]
ivanova
courses:rg:overcoming_vocabulary_sparsity_in_mt_using_lattices [2010/11/29 23:46]
ivanova
Line 16: Line 16:
 The article is about overcoming the problem of vocabulary sparsity in SMT. The sparsity occurs because many words can have inflection or can take different affixes while in the vocabulary we might not find all those forms. The article is about overcoming the problem of vocabulary sparsity in SMT. The sparsity occurs because many words can have inflection or can take different affixes while in the vocabulary we might not find all those forms.
 The authors of the article introduce three problems and their methods to overcome this challenges: The authors of the article introduce three problems and their methods to overcome this challenges:
 +
 1. common stems are fragmented into many different forms in training data; 1. common stems are fragmented into many different forms in training data;
 2. rare and unknown words are frequent in test data; 2. rare and unknown words are frequent in test data;

[ Back to the navigation ] [ Back to the content ]