Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
padt:start [2011/07/01 00:02] smrz |
padt:start [2013/05/27 17:35] zeman TODO: Treex reader. |
||
---|---|---|---|
Line 2: | Line 2: | ||
http:// | http:// | ||
- | |||
- | ===== Overview ===== | ||
===== Setup ===== | ===== Setup ===== | ||
Line 9: | Line 7: | ||
Install [[http:// | Install [[http:// | ||
- | The SVN repository of the PADT project is https:// | + | The SVN repository of the PADT project is https:// |
The project' | The project' | ||
Line 52: | Line 50: | ||
The code base for the PADT project, i.e. for annotation, display, and processing of the data, is the TrEd's '' | The code base for the PADT project, i.e. for annotation, display, and processing of the data, is the TrEd's '' | ||
===== Agenda ===== | ===== Agenda ===== | ||
+ | |||
+ | * Write a block to read the PADT 2.0 data in Treex. An XML schema is needed. | ||
Focus on paragraphs/ | Focus on paragraphs/ | ||
Line 67: | Line 67: | ||
- | There are some other task that have been partially solved, but need to be refreshed and completed: | + | There are some other tasks that have been partially solved |
* Retrain the CRF++ model for tagging selected morphological categories and apply it to prune remaining morphological ambiguities. | * Retrain the CRF++ model for tagging selected morphological categories and apply it to prune remaining morphological ambiguities. |