Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
pml-haters [2007/05/28 00:25] pajas |
pml-haters [2007/06/05 06:35] (current) bojar jen formatovani |
||
---|---|---|---|
Line 14: | Line 14: | ||
Please strongly prefer SAX-based tools to DOM-based tools. | Please strongly prefer SAX-based tools to DOM-based tools. | ||
+ | |||
===== Validation ===== | ===== Validation ===== | ||
- | Given a PML file, how do I validate it? I always forget... Please provide me with the one-liner to do the validation. | + | Given a PML file, how do I validate it? |
- | See [[user: | + | For most purposes, a libxml2 (DOM) based validator |
+ | < | ||
- | PP: there is a validation script at the [[http://ufal.mff.cuni.cz/jazz/pml/index_en.html|PML homepage]], but it uses DOM. A streaming variant that uses trang can be found in '' | + | For huge files, use< |
+ | Both scripts have decent user documentation. See inside the scripts if interested in the implementation details. | ||
===== XSH Won't Work: Blame XML Namespaces ===== | ===== XSH Won't Work: Blame XML Namespaces ===== | ||
Line 102: | Line 105: | ||
How do I create a suite of files with just the problematic sentence 345, i.e. files test-w.xml, test-m.xml, test-a.xml and test-t.xml, all properly referenced? A XML-Reader based script by Petr Pajas demonstrates that: | How do I create a suite of files with just the problematic sentence 345, i.e. files test-w.xml, test-m.xml, test-a.xml and test-t.xml, all properly referenced? A XML-Reader based script by Petr Pajas demonstrates that: | ||
- | < | + | < |
</ | </ | ||