[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
courses:mapreduce-tutorial [2012/01/25 15:40]
straka
courses:mapreduce-tutorial [2012/01/25 18:38]
straka
Line 22: Line 22:
   * [[.:mapreduce-tutorial:Step 7]]: Dynamic Hadoop cluster for several computations.   * [[.:mapreduce-tutorial:Step 7]]: Dynamic Hadoop cluster for several computations.
  
-**From now on, run all examples using a one-machine cluster. Running the scripts locally without any cluster has several disadvantages, most notably having only one reducer per job.**+From now on, it is best to run MR jobs using a one-machine cluster. Running the scripts locally without any cluster has several disadvantages, most notably having only one reducer per job.
  
 === MapReduce extended === === MapReduce extended ===
Line 29: Line 29:
   * [[.:mapreduce-tutorial:Step 10]]: Combiners.   * [[.:mapreduce-tutorial:Step 10]]: Combiners.
   * [[.:mapreduce-tutorial:Step 11]]: Initialization and cleanup of MR tasks.   * [[.:mapreduce-tutorial:Step 11]]: Initialization and cleanup of MR tasks.
-  * [[.:mapreduce-tutorial:Step 12]]: Reducers +  * [[.:mapreduce-tutorial:Step 12]]: Additional output from mappers and reducers.
-  *  Initialization and cleanup of MR tasks. +
-  * Work dir.+
  
 === Advanced MapReduce exercises === === Advanced MapReduce exercises ===
-  * sorting +  * [[.:mapreduce-tutorial:Step 13]]: Sorting 
-  * N-grams with indexes +  * [[.:mapreduce-tutorial:Step 14]]: N-gram language model 
-  * K-means+  * [[.:mapreduce-tutorial:Step 15]]: K-means algorithm
  
 ===== Other ===== ===== Other =====
   * [[user:majlis:hadoop|Further information]]   * [[user:majlis:hadoop|Further information]]
  

[ Back to the navigation ] [ Back to the content ]