Both sides previous revision
Previous revision
Next revision
|
Previous revision
|
user:zeman:interset:tagsets:urdu [2010/05/05 12:56] zeman Stemmer funguje, ale divně. |
user:zeman:interset:tagsets:urdu [2010/05/05 13:32] (current) zeman Tagger taky nefunguje. |
* [[http://www.crulp.org/software/langproc/UrduStemmer.htm|Urdu Stemmer.]] This is a Windows GUI program. It requires that some files be in a fixed path but it works. However, its precision is questionable. For example, it segments "ناموں" as "نا|موں" (prefix|stem). | * [[http://www.crulp.org/software/langproc/UrduStemmer.htm|Urdu Stemmer.]] This is a Windows GUI program. It requires that some files be in a fixed path but it works. However, its precision is questionable. For example, it segments "ناموں" as "نا|موں" (prefix|stem). |
* [[http://www.crulp.org/software/langproc/MorphologicalAnalyzer.htm|Urdu Finite State Morphological Analyzer.]] This is a Windows program. I have not been able to run it because it requires Microsoft Visual C++, particularly the ''mfc42ud.dll'' library (Unicode debug version). However, there is a text file with the lexicon that could be potentially converted for PC Kimmo. | * [[http://www.crulp.org/software/langproc/MorphologicalAnalyzer.htm|Urdu Finite State Morphological Analyzer.]] This is a Windows program. I have not been able to run it because it requires Microsoft Visual C++, particularly the ''mfc42ud.dll'' library (Unicode debug version). However, there is a text file with the lexicon that could be potentially converted for PC Kimmo. |
* Urdu Statistical POS Tagger: http://www.crulp.org/software/langproc/POS_tagger.htm | * [[http://www.crulp.org/software/langproc/POS_tagger.htm|Urdu Statistical POS Tagger.]] This is a Windows program. I have not been able to run it on Emille data. There was an exception. However, there are text files with lexical data that could be potentially used to implement another tagger. |
* English-to-Urdu MT (based on LFG): http://www.crulp.org/software/langproc/E2UMachineTranslationSystem.htm | * English-to-Urdu MT (based on LFG): http://www.crulp.org/software/langproc/E2UMachineTranslationSystem.htm |
* Hassan Sajjad, Helmut Schmid: Tagging Urdu Text with Parts of Speech: A Tagger Comparison (EACL 2009 Athens): http://portal.acm.org/citation.cfm?id=1609067.1609144, http://www.aclweb.org/anthology/E/E09/E09-1079.pdf | * Hassan Sajjad, Helmut Schmid: Tagging Urdu Text with Parts of Speech: A Tagger Comparison (EACL 2009 Athens): http://portal.acm.org/citation.cfm?id=1609067.1609144, http://www.aclweb.org/anthology/E/E09/E09-1079.pdf |