[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
padt:start [2011/05/27 20:42]
smrz
padt:start [2011/06/30 23:59]
smrz
Line 18: Line 18:
 tred /net/projects/ace/data/arabic/PADT/data/Prague/AEP/UMH_ARB_20040407.0001.{morpho,syntax}.pml tred /net/projects/ace/data/arabic/PADT/data/Prague/AEP/UMH_ARB_20040407.0001.{morpho,syntax}.pml
 </code> </code>
 +
 +For improved quality of display of the various scripts and trees types, you can use the following setup in TrEd's config file, or similar:
 +
 +<file>
 +Font = "family:DejaVu Sans Condensed, size:14, weight:normal"
 +
 +NodeXSkip = 30;
 +NodeYSkip = 10;
 +</file>
  
 ===== Locations ===== ===== Locations =====
Line 42: Line 51:
  
 The code base for the PADT project, i.e. for annotation, display, and processing of the data, is the TrEd's ''padt'' extension, and its ''elixir'' extension that is a dependency for ''padt''. The code base for the PADT project, i.e. for annotation, display, and processing of the data, is the TrEd's ''padt'' extension, and its ''elixir'' extension that is a dependency for ''padt''.
- 
 ===== Agenda ===== ===== Agenda =====
  
-===== References =====+Focus on paragraphs/sentences that miss PADT-Morpho annotation, esp. non-annotated headlines:
  
 +<code bash>
 +btred -QTe '@w = $this->children(); @n = grep { $_->children() } @w; print ThisAddress() . "\n" if @n < 0.9 * @w' Penn/???/*.morpho*.pml 
 +</code>
 +
 +
 +Focus on nodes in PADT-Syntax that do not have a valid ''afun'' annotation:
 +
 +<code bash>
 +btred -QTNe 'print ThisAddress() . "\n" if exists $this->{"afun"} and $this->{"afun"} eq "???"' Prague/???/*.syntax*.pml
 +</code>
 +
 +
 +There are some other task that have been partially solved, but need to be refreshed and completed:
 +
 +* Retrain the CRF++ model for tagging selected morphological categories and apply it to prune remaining morphological ambiguities.
 +* Refresh and improve the code and rules for converting PATB phrase syntax trees into dependency trees a la PADT.
 +* Update PADT::Syntax annotation context (level synchronization, non-conflicting bindings). 
 +* Update PADT::Deeper annotation context (level synchronization, working schemas, modern stylesheets, non-conflicting bindings).
 +* Improve documentation.
 +
 +===== References =====

[ Back to the navigation ] [ Back to the content ]