[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision Both sides next revision
user:zeman:transliteration-of-urdu-to-latin-script [2010/11/10 13:59]
zeman Attached the software.
user:zeman:transliteration-of-urdu-to-latin-script [2010/11/10 14:03]
zeman Translations of frequent words.
Line 133: Line 133:
 Some frequent words cannot be disambiguated by character-based rules alone but a vocabulary could identify them as existing unambiguous Urdu words and save much manual work by disambiguating them. Here are some examples: Some frequent words cannot be disambiguated by character-based rules alone but a vocabulary could identify them as existing unambiguous Urdu words and save much manual work by disambiguating them. Here are some examples:
  
-  * ہے => he +  * ہے => he (“is”) 
-  * میں => meñ +  * میں => meñ (“in”) 
-  * ایک => ek +  * ایک => ek (“one”) 
-  * اور => or+  * اور => or (“and”)
  
 Note however that there are inherently ambiguous words that cannot be disambiguated without human intervention (or at least without looking at the neighboring words). Examples: Note however that there are inherently ambiguous words that cannot be disambiguated without human intervention (or at least without looking at the neighboring words). Examples:
  
-  * تو => to | tū +  * تو => to (“so”) | tū (“thou”) 
-  * اس => is | us +  * اس => is (“of this”) | us (“of that”) 
-  * ان => in | un+  * ان => in (“of these”) | un (“of those”)
  
 ===== The Transliteration Script ===== ===== The Transliteration Script =====

[ Back to the navigation ] [ Back to the content ]