Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
courses:rg:2013:composite-activities [2013/04/23 04:33] machys |
courses:rg:2013:composite-activities [2013/09/29 21:35] (current) machys |
||
---|---|---|---|
Line 1: | Line 1: | ||
- | - After reading first three chapters: | + | - After reading |
- | * list the main parts/ | + | * list the main parts/ |
- | * How is their creation | + | * Is their creation |
- | - Thinking about the scripts | + | - Thinking about the scripts: |
- | * What is the main reason (the biggest advantage) of using scripts? What kind of information it brings? (Hint: page 2, page 8) | + | * What is the main reason (the biggest advantage) of using scripts? What kind of information |
- | * Truth is - authors don't get the " | + | * The authors don't get the " |
- | - In last paragraph of section | + | - In the last paragraph of Section |
- | * Why it works? (=> Why it should work the best?) | + | * Why does it work? (=> Why should |
- | * Have you any idea how to make it different way? | + | * Do you have any idea how to do it differently? |
- | | + | |
+ | |||
+ | ====== Answers ====== | ||
+ | |||
+ | - First set | ||
+ | * list components [[https:// | ||
+ | * dependance of components (the same graph) | ||
+ | - Scripts | ||
+ | * reason?: Cheap source of training data, Many combinations, | ||
+ | * four ways: 2x2: 1) direct use of words from data or 2) mapping word classes from WordNet X 3) simple word frequency or 4) TF*IDF | ||
+ | - There was a discussion about 3rd set of question. We are not sure why authors do that. There was strongly supported opinion that autohors do a lot unnecessary work, which is lost by binarization. | ||
+ | - 4th: Majority people in aswers nominated the use of TF*IDF in case of no training data as the best idea. |