Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
courses:rg:extracting-parallel-sentences-from-comparable-corpora [2011/05/22 19:16] ivanova |
courses:rg:extracting-parallel-sentences-from-comparable-corpora [2011/05/22 19:23] (current) ivanova |
||
---|---|---|---|
Line 1: | Line 1: | ||
+ | **Extracting Parallel Sentences from Comparable Corpora using Document Level Alignment** | ||
+ | //Jason R. Smith Chris Quirk and Kristina Toutanova// | ||
+ | |||
====== Introduction ====== | ====== Introduction ====== | ||
+ | |||
Article is about parallel sentence extraction from Wikipedia. This resource can be viewed as comparable corpus in which the document alignment is already provided by the interwiki links. | Article is about parallel sentence extraction from Wikipedia. This resource can be viewed as comparable corpus in which the document alignment is already provided by the interwiki links. | ||
Line 76: | Line 80: | ||
* Induced word-level lexicon in combination with sentence extraction helps to achieve substantial gains. | * Induced word-level lexicon in combination with sentence extraction helps to achieve substantial gains. | ||
- | ===== Strong sides of the article ===== | + | ====== Strong sides of the article ====== |
* Novel approaches to extracting parallel sentences. | * Novel approaches to extracting parallel sentences. | ||
Line 88: | Line 93: | ||
Our understanding of this feature is: | Our understanding of this feature is: | ||
TOPIC A: EN <-> ES | TOPIC A: EN <-> ES | ||
- | | + | ↓ ↓ |
TOPIC B: EN <-> ES | TOPIC B: EN <-> ES | ||
where | where | ||
↓ is a link | ↓ is a link | ||
+ | |||
<-> is an interwiki link | <-> is an interwiki link | ||
- | --- //Angelina Ivanova // | + | --- //Comments by Angelina Ivanova // |