[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
courses:rg:2012:segments [2012/12/29 15:25]
bilek
courses:rg:2012:segments [2013/01/03 22:38] (current)
popel
Line 21: Line 21:
 The most important question, though, is why do we do all this, because the data from the PDT tree are more thorough than the segments that we want to create! So, what is the exact reason? The most important question, though, is why do we do all this, because the data from the PDT tree are more thorough than the segments that we want to create! So, what is the exact reason?
  
-1) it is not because there are the training data +1) To prepare training data? 
-2) it is not because we use it as a testing data, because it has only 70% accuracy +> Probably no, because they don'use any machine learning approach. 
-3) It can be to show that it is difficult to create from the analytical tree, too +2) To prepare testing data
-4) We can use it as a "oracle experiment"how far can we go with plaintext?+> No. Because they already have some manually annotated sentences. Moreoverthe described approach (using PDT gold a-trees on input) has only 70% accuracy. 
 +3) As an "oracle experiment"using gold a-trees is an upper bound for using plaintext only. 
 +> Probably no. There are better algorithms (with higher precision than 70%) exploiting gold a-trees. 
 +4) To show some difficult cases with creating segmentation charts (even when gold a-trees are available). 
 +> Maybe.
 5) It can be just to fill up the space :) 5) It can be just to fill up the space :)
 +> ?
  
 ====How to Obtain Segments from Plain Text?==== ====How to Obtain Segments from Plain Text?====

[ Back to the navigation ] [ Back to the content ]