Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
user:zeman:transliteration-of-urdu-to-latin-script [2010/11/10 14:25] zeman On postediting. |
user:zeman:transliteration-of-urdu-to-latin-script [2010/11/16 08:25] (current) zeman ~ |
||
---|---|---|---|
Line 1: | Line 1: | ||
====== Transliteration of Urdu to Latin Script ====== | ====== Transliteration of Urdu to Latin Script ====== | ||
+ | |||
+ | Copyright © 2010 by Dan Zeman < | ||
+ | License: GNU GPL | ||
+ | This research has been supported by the grant of the Czech Ministry of Education no. MSM0021620838. | ||
===== Transliteration versus Transcription ===== | ===== Transliteration versus Transcription ===== | ||
Line 18: | Line 22: | ||
===== Consonants ===== | ===== Consonants ===== | ||
- | Most of the consonants do not pose any serious problem. I decided to represent the retroflex consonants (ٹ, ڈ, ڑ) by a dot below their dental or other counterparts, | + | Most of the consonants do not pose any serious problem. I decided to represent the retroflex consonants (ٹ, ڈ, ڑ) by a dot below their dental or other counterparts, |
Some other notes: //j// is pronounced as in English, not as in Czech or German. //č// and //š// are used in Baltic and Slavic languages (among others) to represent the sounds that are usually written “ch” or “sh”, respectively, | Some other notes: //j// is pronounced as in English, not as in Czech or German. //č// and //š// are used in Baltic and Slavic languages (among others) to represent the sounds that are usually written “ch” or “sh”, respectively, | ||
Line 60: | Line 64: | ||
| 0645 | م | m | m | | | 0645 | م | m | m | | ||
| 0646 | ن | n | n | | | 0646 | ن | n | n | | ||
- | | 06BA | ں | n | ñ | | + | | 06BA | ں | ~ | ñ | |
| 0648 | و | v | w | | | 0648 | و | v | w | | ||
| 06C1 | ہ | h | h | | | 06C1 | ہ | h | h | | ||
Line 113: | Line 117: | ||
===== Short Vowels and Diacritics ===== | ===== Short Vowels and Diacritics ===== | ||
- | Without diacritics (which is more common), every consonant that is not followed by a long vowel may or may not be followed by a short vowel. I denote this possibility by the character for the neutral | + | Without diacritics (which is more common), every consonant that is not followed by a long vowel may or may not be followed by a short vowel. I denote this possibility by the character for the neutral |
//Warning! This section is under construction. I am still confused about the exact rules for Urdu vowel representation, | //Warning! This section is under construction. I am still confused about the exact rules for Urdu vowel representation, | ||
Line 153: | Line 157: | ||
<code bash> | <code bash> | ||
- | If you happen to sit on the ÚFAL network, you will find the script in '' | + | If you happen to sit on the ÚFAL network, you will find the script in '' |
I am also attaching the current snapshot of the two folders to this wiki {{: | I am also attaching the current snapshot of the two folders to this wiki {{: |