Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
courses:rg:2013:composite-activities [2013/04/23 07:38] popel |
courses:rg:2013:composite-activities [2013/09/29 21:35] (current) machys |
||
---|---|---|---|
Line 1: | Line 1: | ||
- After reading the first three chapters: | - After reading the first three chapters: | ||
* list the main parts/ | * list the main parts/ | ||
- | * Is their creation dependent on each other? | + | * Is their creation dependent on other components? |
- | - Thinking about the scripts | + | - Thinking about the scripts: |
- | * What is the main reason (the biggest advantage) of using scripts? What kind of information it brings? (Hint: page 2, page 8) | + | * What is the main reason (the biggest advantage) of using scripts? What kind of information |
- | * The authors don't get the " | + | * The authors don't get the " |
- | - In last paragraph of Section 3, a method is described that enhances the robustness of the model (binarization of all association weights < | + | - In the last paragraph of Section 3, a method is described that enhances the robustness of the model (binarization of all association weights < |
- | * Why it works? (=> Why it should work the best?) | + | * Why does it work? (=> Why should |
- | * Have you any idea how to make it different? | + | * Do you have any idea how to do it differently? |
- | - Which tools enhanced the tasks //Attribute recognition// | + | |
+ | |||
+ | ====== Answers ====== | ||
+ | |||
+ | - First set | ||
+ | * list components [[https:// | ||
+ | * dependance of components (the same graph) | ||
+ | - Scripts | ||
+ | * reason?: Cheap source of training data, Many combinations, | ||
+ | * four ways: 2x2: 1) direct use of words from data or 2) mapping word classes from WordNet X 3) simple word frequency or 4) TF*IDF | ||
+ | - There was a discussion about 3rd set of question. We are not sure why authors do that. There was strongly supported opinion that autohors do a lot unnecessary work, which is lost by binarization. | ||
+ | - 4th: Majority people in aswers nominated the use of TF*IDF in case of no training data as the best idea. |