Both sides previous revision
Previous revision
Next revision
|
Previous revision
Next revision
Both sides next revision
|
padt:start [2013/05/27 13:15] zeman Aktualizována cesta k pracovní kopii vybalené z SVN. |
padt:start [2013/05/27 17:35] zeman TODO: Treex reader. |
Install [[http://ufal.mff.cuni.cz/~pajas/tred/|TrEd]] including the [[http://ufal.mff.cuni.cz/~pajas/tred/extensions/padt/documentation/|padt]] and [[http://ufal.mff.cuni.cz/~pajas/tred/extensions/elixir/documentation/|elixir]] extensions from the default TrEd repository http://ufal.mff.cuni.cz/~pajas/tred/extensions/. | Install [[http://ufal.mff.cuni.cz/~pajas/tred/|TrEd]] including the [[http://ufal.mff.cuni.cz/~pajas/tred/extensions/padt/documentation/|padt]] and [[http://ufal.mff.cuni.cz/~pajas/tred/extensions/elixir/documentation/|elixir]] extensions from the default TrEd repository http://ufal.mff.cuni.cz/~pajas/tred/extensions/. |
| |
The SVN repository of the PADT project is https://svn.ms.mff.cuni.cz/svn/padt/. A working copy is accessible at /net/projects/padt on the ÚFAL network. | The SVN repository of the PADT project is https://svn.ms.mff.cuni.cz/svn/padt/. A working copy is accessible at ''/net/projects/padt'' on the ÚFAL network. |
| |
The project's data are stored in the main subdirectory ''data'', which is split further into ''Prague'', ''Penn'', and ''ElixirFM'', explained below. | The project's data are stored in the main subdirectory ''data'', which is split further into ''Prague'', ''Penn'', and ''ElixirFM'', explained below. |
The code base for the PADT project, i.e. for annotation, display, and processing of the data, is the TrEd's ''padt'' extension, and its ''elixir'' extension that is a dependency for ''padt''. | The code base for the PADT project, i.e. for annotation, display, and processing of the data, is the TrEd's ''padt'' extension, and its ''elixir'' extension that is a dependency for ''padt''. |
===== Agenda ===== | ===== Agenda ===== |
| |
| * Write a block to read the PADT 2.0 data in Treex. An XML schema is needed. |
| |
Focus on paragraphs/sentences that miss PADT-Morpho annotation, esp. non-annotated headlines: | Focus on paragraphs/sentences that miss PADT-Morpho annotation, esp. non-annotated headlines: |