Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision | Next revision Both sides next revision | ||
user:zeman:transliteration-of-urdu-to-latin-script [2010/11/10 13:59] zeman Attached the software. |
user:zeman:transliteration-of-urdu-to-latin-script [2010/11/10 14:03] zeman Translations of frequent words. |
||
---|---|---|---|
Line 133: | Line 133: | ||
Some frequent words cannot be disambiguated by character-based rules alone but a vocabulary could identify them as existing unambiguous Urdu words and save much manual work by disambiguating them. Here are some examples: | Some frequent words cannot be disambiguated by character-based rules alone but a vocabulary could identify them as existing unambiguous Urdu words and save much manual work by disambiguating them. Here are some examples: | ||
- | * ہے => he | + | * ہے => he (“is”) |
- | * میں => meñ | + | * میں => meñ (“in”) |
- | * ایک => ek | + | * ایک => ek (“one”) |
- | * اور => or | + | * اور => or (“and”) |
Note however that there are inherently ambiguous words that cannot be disambiguated without human intervention (or at least without looking at the neighboring words). Examples: | Note however that there are inherently ambiguous words that cannot be disambiguated without human intervention (or at least without looking at the neighboring words). Examples: | ||
- | * تو => to | tū | + | * تو => to (“so”) |
- | * اس => is | us | + | * اس => is (“of this”) |
- | * ان => in | un | + | * ان => in (“of these”) |
===== The Transliteration Script ===== | ===== The Transliteration Script ===== |