[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
user:zeman:transliteration-of-urdu-to-latin-script [2010/11/09 14:29]
zeman Noon ghunna a heh goal/doachashmee.
user:zeman:transliteration-of-urdu-to-latin-script [2010/11/09 16:14]
zeman Hamza.
Line 65: Line 65:
 | 06BE | ھ | h | h | | 06BE | ھ | h | h |
 | 06CC | ی | j | y | | 06CC | ی | j | y |
 +
 +===== Vowels =====
 +
 +The consonant (or semi-vowel) و //(w)// is also ambiguously used to represent the long vowels //ū// (pronounced as //oo// in English //fool//) and //o// (pronounced as //oo// in English //door//). I want to distinguish these three pronunciations. In most cases however, the script can only output //[wūo]// and leave the disambiguation to a human judgment:
 +
 +  * In word-initial position, I assume that only consonantal pronunciation is possible and always output //w//.
 +  * Anywhere immediately before ا (ALEF), I assume that only consonantal pronunciation is possible and always output //w//.
 +  * In word-final position, I believe that vowel is more likely although I am not sure that the consonant can be completely excluded. Nevertheless, I currently output //[ūo]//.
 +  * If it appears immediately before word-final ں (NOON GHUNNA), I consider it part of plural oblique case suffix and invariably output //o//.
 +  * In all other cases I output //[wūo]//.
 +
 +The consonant (or semi-vowel) ی //(y)// is also ambiguously used to represent the long vowels //ī// (pronounced as //ee// in English //feet//) and //e// (pronounced roughly as //ai// in English //fair//). I want to distinguish these three pronunciations. In most cases however, the script can only output //[yīe]// and leave the disambiguation to a human judgment:
 +
 +  * In word-initial position, I assume that only consonantal pronunciation is possible and always output //y//.
 +  * Anywhere immediately before ا (ALEF), I assume that only consonantal pronunciation is possible and always output //y//.
 +  * In word-final position, I assume that the only possible reading is //ī//.
 +  * In all other cases I output //[yīe]//.
 +
 +The letter ے (YEH BARREE) only appears in word-final position and is transliterated as //e// (which is written in other positions using the ambiguous ی).
 +
 +The letter ا (ALEF) is ambiguous and can lead to many different readings:
 +
 +  * In word-initial position, it merely says that the word begins with a vowel. It could be any of the three short vowels //[aiu]//: افریقہ //afrīqah// “Africa”, اسلام //islām// “Islam”, اردو //urdū// “Urdu”.
 +    * If word-initial ا is followed by و or ی, they together could represent a word-initial long vowel //[ūoīe]//, such as in ایک //ek// “one”. In this case, ا should map to an empty string (because the next character itself will allow for transliteration by the long vowel).
 +  * In word-internal and word-final positions, ا is transliterated to the long vowel //ā// (pronounced as //a// in English //father//).
 +
 +The letter آ (ALEF MADDA) only appears in word-initial position and is transliterated as //ā// (which is written in other positions using normal ا).
 +
 +The YEH with the diacritic HAMZA above separates two consecutive vowels, e.g. جائے گا //jāe gā// “will go” or کوئی //koī// “some”.
 +
 +Similarly, the diacritic HAMZA above a و separates it from the preceding vowel as in ہاؤسنگ //hāūsing// “housing”. (In this case, the hamza is a separate character that is placed in the logical sequence after the و.)
 +
 +^ Unicode ^ Character ^ Pronunciation ^ Transliteration ^
 +| 0627 | ا | -, a: | a, i, u, 0, ā |
 +| 0622 | آ | a: | ā |
 +| 0648 | و | v, u:, o: | w, ū, o |
 +| 06CC | ی | j, i:, e: | y, ī, e |
 +| 06D2 | ے | e: | e |
 +| 0626 | ئ | - | 0 |
 +| 0674 | ٔ (high hamza) | - | 0 |
  

[ Back to the navigation ] [ Back to the content ]