[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision Both sides next revision
courses:mapreduce-tutorial:step-28 [2012/01/29 16:38]
straka
courses:mapreduce-tutorial:step-28 [2012/01/29 16:38]
straka
Line 7: Line 7:
 ===== Exercise 1 ===== ===== Exercise 1 =====
  
-Improve the last [[.:step-27#exercise|inverted index creation exercise]] to+Improve the last [[.:step-27#exercise|inverted index creation exercise]], such that
   - in first job, create a list of unique document names. Number the documents using the order in this list.   - in first job, create a list of unique document names. Number the documents using the order in this list.
   - in second job, create for each word sorted list of ''DocWithOccurences<IntWritable>'', where the document is identified by its number (contrary to the previous exercise, where ''Text'' was used to identify the document).   - in second job, create for each word sorted list of ''DocWithOccurences<IntWritable>'', where the document is identified by its number (contrary to the previous exercise, where ''Text'' was used to identify the document).

[ Back to the navigation ] [ Back to the content ]