Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision | Next revision Both sides next revision | ||
padt:start [2013/05/27 23:19] zeman Odkaz na Trac. |
padt:start [2013/05/30 12:34] zeman Zalámání vět. |
||
---|---|---|---|
Line 44: | Line 44: | ||
data/ | data/ | ||
- | The project' | + | The project' |
There is also the ' | There is also the ' | ||
The code base for the PADT project, i.e. for annotation, display, and processing of the data, is the TrEd's '' | The code base for the PADT project, i.e. for annotation, display, and processing of the data, is the TrEd's '' | ||
+ | |||
===== Agenda ===== | ===== Agenda ===== | ||
* Write a block to read the PADT 2.0 data in Treex. An XML schema is needed. | * Write a block to read the PADT 2.0 data in Treex. An XML schema is needed. | ||
+ | * Jak je to teď se zalámáním vět? Bude se nějak využívat prvek Unit? Současné stromy zatím pořád odpovídají odstavcům, s průměrným počtem 38 tokenů na strom. Treebank obsahuje 874 souborů (dokumentů), | ||
Focus on paragraphs/ | Focus on paragraphs/ |