Differences

This shows you the differences between two versions of the page.

--- courses:mapreduce-tutorial [2012/01/27 00:45]
straka
+++ courses:mapreduce-tutorial [2012/01/27 23:31]
straka
@@ Line 23: / Line 23: @@
 === MapReduce extended ===
-From now on, it is best to run MR jobs using a one-machine cluster. Running the scripts locally without any cluster has several disadvantages, most notably having only one reducer per job.
+From now on, it is best to run MR jobs using a one-machine cluster -- create a one-machine cluster using ''hadoop-cluster'' for 3h (10800s) and run jobs using ''-jt cluster_master''. Running the scripts locally without any cluster has several disadvantages, most notably having only one reducer per job.
   * [[.:mapreduce-tutorial:Step 8]]: Multiple mappers, reducers and partitioning.
   * [[.:mapreduce-tutorial:Step 9]]: Hadoop properties.
@@ Line 48: / Line 48: @@
   * [[.:mapreduce-tutorial:Step 24]]: Mappers, running Java Hadoop jobs.
   * [[.:mapreduce-tutorial:Step 25]]: Reducers, combiners and partitioners.
-  * [[.:mapreduce-tutorial:Step 26]]: Counters, compression.
+  * [[.:mapreduce-tutorial:Step 26]]: Job configuration, counters and job context.
   * [[.:mapreduce-tutorial:Step 27]]: Reusing Mapper and Reducer code.

Institute of Formal and Applied Linguistics Wiki