Differences

This shows you the differences between two versions of the page.

--- courses:mapreduce-tutorial [2012/01/25 15:44]
straka
+++ courses:mapreduce-tutorial [2012/01/26 18:44]
straka
@@ Line 21: / Line 21: @@
   * [[.:mapreduce-tutorial:Step 6]]: Running on cluster.
   * [[.:mapreduce-tutorial:Step 7]]: Dynamic Hadoop cluster for several computations.
-**From now on, run all examples using a one-machine cluster. Running the scripts locally without any cluster has several disadvantages, most notably having only one reducer per job.**
 === MapReduce extended ===
+From now on, it is best to run MR jobs using a one-machine cluster. Running the scripts locally without any cluster has several disadvantages, most notably having only one reducer per job.
   * [[.:mapreduce-tutorial:Step 8]]: Multiple mappers, reducers and partitioning.
   * [[.:mapreduce-tutorial:Step 9]]: Hadoop properties.
-  * [[.:mapreduce-tutorial:Step 10]]: Properties of reducers, combiners.
+  * [[.:mapreduce-tutorial:Step 10]]: Combiners.
-  * [[.:mapreduce-tutorial:Step 11]]: Initialization and cleanup of MR tasks.
+  * [[.:mapreduce-tutorial:Step 11]]: Initialization and cleanup of MR tasks, performance of combiners.
   * [[.:mapreduce-tutorial:Step 12]]: Additional output from mappers and reducers.
 === Advanced MapReduce exercises ===
+Exercises in this section can be made in any order, but it is recommended to try solving all of them. The [[.:mapreduce-tutorial:Perl API|Perl API reference]] may come handy.
   * [[.:mapreduce-tutorial:Step 13]]: Sorting
   * [[.:mapreduce-tutorial:Step 14]]: N-gram language model
-  * [[.:mapreduce-tutorial:Step 15]]: K-means algorithm
+  * [[.:mapreduce-tutorial:Step 15]]: K-means clustering
+===== Day 2 =====
+Today we will be using the [[http://hadoop.apache.org/common/docs/r1.0.0/api/index.html|Java API]].
+=== Environment ===
+  * [[.:mapreduce-tutorial:Step 21]]: Preparing the environment.
+  * [[.:mapreduce-tutorial:Step 22]]: Optional -- Setting Eclipse.
 ===== Other =====
   * [[user:majlis:hadoop|Further information]]

[ Back to the navigation ] [ Back to the content ]

Institute of Formal and Applied Linguistics Wiki

Differences