[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
user:zeman:transliteration-of-urdu-to-latin-script [2010/11/09 15:02]
zeman wy
user:zeman:transliteration-of-urdu-to-latin-script [2010/11/09 16:14]
zeman Hamza.
Line 82: Line 82:
   * In word-final position, I assume that the only possible reading is //ī//.   * In word-final position, I assume that the only possible reading is //ī//.
   * In all other cases I output //[yīe]//.   * In all other cases I output //[yīe]//.
 +
 +The letter ے (YEH BARREE) only appears in word-final position and is transliterated as //e// (which is written in other positions using the ambiguous ی).
 +
 +The letter ا (ALEF) is ambiguous and can lead to many different readings:
 +
 +  * In word-initial position, it merely says that the word begins with a vowel. It could be any of the three short vowels //[aiu]//: افریقہ //afrīqah// “Africa”, اسلام //islām// “Islam”, اردو //urdū// “Urdu”.
 +    * If word-initial ا is followed by و or ی, they together could represent a word-initial long vowel //[ūoīe]//, such as in ایک //ek// “one”. In this case, ا should map to an empty string (because the next character itself will allow for transliteration by the long vowel).
 +  * In word-internal and word-final positions, ا is transliterated to the long vowel //ā// (pronounced as //a// in English //father//).
 +
 +The letter آ (ALEF MADDA) only appears in word-initial position and is transliterated as //ā// (which is written in other positions using normal ا).
 +
 +The YEH with the diacritic HAMZA above separates two consecutive vowels, e.g. جائے گا //jāe gā// “will go” or کوئی //koī// “some”.
 +
 +Similarly, the diacritic HAMZA above a و separates it from the preceding vowel as in ہاؤسنگ //hāūsing// “housing”. (In this case, the hamza is a separate character that is placed in the logical sequence after the و.)
 +
 +^ Unicode ^ Character ^ Pronunciation ^ Transliteration ^
 +| 0627 | ا | -, a: | a, i, u, 0, ā |
 +| 0622 | آ | a: | ā |
 +| 0648 | و | v, u:, o: | w, ū, o |
 +| 06CC | ی | j, i:, e: | y, ī, e |
 +| 06D2 | ے | e: | e |
 +| 0626 | ئ | - | 0 |
 +| 0674 | ٔ (high hamza) | - | 0 |
  

[ Back to the navigation ] [ Back to the content ]