Both sides previous revision
Previous revision
Next revision
|
Previous revision
Next revision
Both sides next revision
|
external:tectomt:tutorial [2009/01/20 14:11] kravalova |
external:tectomt:tutorial [2009/01/20 15:30] kravalova |
| |
TectoMT is a highly modular NLP (Natural Language Processing) software system implemented in Perl programming language under Linux. It is primarily aimed at Machine Translation, making use of the ideas and technology created during the Prague Dependency Treebank project. At the same time, it is also hoped to significantly facilitate and accelerate development of software solutions of many other NLP tasks, especially due to re-usability of the numerous integrated processing modules (called blocks), which are equipped with uniform object-oriented interfaces. | TectoMT is a highly modular NLP (Natural Language Processing) software system implemented in Perl programming language under Linux. It is primarily aimed at Machine Translation, making use of the ideas and technology created during the Prague Dependency Treebank project. At the same time, it is also hoped to significantly facilitate and accelerate development of software solutions of many other NLP tasks, especially due to re-usability of the numerous integrated processing modules (called blocks), which are equipped with uniform object-oriented interfaces. |
| |
| |
===== Prerequisities ===== | ===== Prerequisities ===== |
| |
| In this tutorial, we assume |
| |
| * Your system is Linux |
| * Your shell is bash |
| * You have basic experience bash and you can read Perl |
| |
| |
| |
| |
==== Installation and setup ==== | |
| |
* Install TrED FIXME | |
| |
* Checkout SVN repository. If you are running this installation in computer lab in Prague, you have to create the repository in directory /home/BIG (because data quota don't apply here): | |
| |
| ==== Installation and setup ==== |
| |
| * Checkout SVN repository. If you are running this installation in computer lab in Prague, you have checkout the repository into directory /home/BIG (because data quotas don't apply here): |
| |
<code bash> | <code bash> |
| |
| |
==== SVO typology ==== | |
| |
TODO | |
| |
| ==== SVO to SOV ==== |
| |
| **Motivation**: During translation from an SVO based language (English) to an SOV based language (Korean) we might need to change the word order from SVO to SOV. |
| |
| **Task**: On analytical layer, change the word order from SVO to SOV. |
| |
| |
| |
==== Prepositions ==== | ==== Prepositions ==== |
| |
In dependency approach a question "where to hang prepositions" arises. In praguian style (PDT), prepositions are heads of the subtree and the noun/pronoun is dependent on the preposition. However, another ordering might be preferable: The noun/pronoun might be the head of subtree, while the preposition would take the role of a modifier. | **Motivation**: In dependency approach a question "where to hang prepositions" arises. In praguian style (PDT), prepositions are heads of the subtree and the noun/pronoun is dependent on the preposition. However, another ordering might be preferable: The noun/pronoun might be the head of subtree, while the preposition would take the role of a modifier. |
| |
TODO obrazek | TODO obrazek |
| |
The task is to rehang all prepositions as indicated at the picture. You may assume that prepositions have at most 1 child. | **Task**: The task is to rehang all prepositions as indicated at the picture. You may assume that prepositions have at most 1 child. |
| |
| ** Instructions**: |
| |
You are going to need these new methods: | You are going to need these new methods: |