Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
external:tectomt:tutorial [2009/01/20 12:08] kravalova |
external:tectomt:tutorial [2009/01/20 15:31] kravalova |
||
---|---|---|---|
Line 8: | Line 8: | ||
TectoMT is a highly modular NLP (Natural Language Processing) software system implemented in Perl programming language under Linux. It is primarily aimed at Machine Translation, | TectoMT is a highly modular NLP (Natural Language Processing) software system implemented in Perl programming language under Linux. It is primarily aimed at Machine Translation, | ||
+ | |||
===== Prerequisities ===== | ===== Prerequisities ===== | ||
+ | |||
+ | In this tutorial, we assume | ||
+ | |||
+ | * Your system is Linux | ||
+ | * Your shell is bash | ||
+ | * You have basic experience bash and you can read Perl | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | |||
Line 15: | Line 27: | ||
==== Installation and setup ==== | ==== Installation and setup ==== | ||
- | TODO popsat instalaci | + | * Checkout SVN repository. If you are running this installation in computer lab in Prague, you have checkout the repository into directory /home/BIG (because data quotas don't apply here): |
- | Before running any experiments with TectoMT, you must set up your environment by running | + | <code bash> |
+ | cd ~/BIG | ||
+ | svn --username < | ||
+ | </ | ||
+ | |||
+ | * In '' | ||
<code bash> | <code bash> | ||
- | source config/init_devel_environ.sh | + | cd tectomt/install |
+ | ./install.sh | ||
</ | </ | ||
+ | |||
+ | * In your '' | ||
+ | |||
+ | <code bash> | ||
+ | source ~/ | ||
+ | </ | ||
+ | |||
+ | |||
+ | |||
+ | |||
- | ==== Theoretical background ==== | ||
- | TODO obrazek | ||
Line 35: | Line 61: | ||
- | ==== TrEd ==== | ||
- | TODO malicko o TrEdu a obrazek | ||
Line 55: | Line 79: | ||
This tutorial itself has its blocks in '' | This tutorial itself has its blocks in '' | ||
+ | |||
Line 60: | Line 85: | ||
==== Layers of Linguistic Structures ==== | ==== Layers of Linguistic Structures ==== | ||
+ | |||
+ | {{ external: | ||
TectoMT blocks repository is saved in '' | TectoMT blocks repository is saved in '' | ||
Line 251: | Line 278: | ||
==== Task ==== | ==== Task ==== | ||
A block which, given an analytical tree ('' | A block which, given an analytical tree ('' | ||
+ | |||
Line 296: | Line 324: | ||
- | //Advanced version//: The output of our block is not ideal - we don't solve coordination and subordinate conjunctions. | + | //Advanced version//: The output of our block might still be incorrect in special cases - we don't solve coordination and subordinate conjunctions. |
Line 303: | Line 331: | ||
- | ==== SVO typology ==== | ||
- | TODO | + | |
+ | |||
+ | |||
+ | |||
+ | ==== SVO to SOV ==== | ||
+ | |||
+ | **Motivation**: | ||
+ | |||
+ | **Task**: On analytical layer, change the word order from SVO to SOV. | ||
+ | |||
+ | **Instructions**: | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | |||
Line 319: | Line 362: | ||
==== Prepositions ==== | ==== Prepositions ==== | ||
- | In dependency approach a question "where to hang prepositions" | + | **Motivation**: |
TODO obrazek | TODO obrazek | ||
- | The task is to rehang all prepositions as indicated at the picture. You may assume that prepositions have at most 1 child. | + | **Task**: |
+ | |||
+ | ** Instructions**: | ||
You are going to need these new methods: | You are going to need these new methods: | ||
Line 330: | Line 375: | ||
* '' | * '' | ||
- | You can use block template in '' | + | //Hint//: |
+ | * On analytical layer, you can use this test to recognize prepositions: | ||
+ | * You can use block template in '' | ||
- | //Advanced version//: What happens in case of multiword prepositions? | + | //Advanced version//: What happens in case of multiword prepositions? |