Differences
This shows you the differences between two versions of the page.
Both sides previous revision
Previous revision
Next revision
|
Previous revision
Next revision
Both sides next revision
|
courses:mapreduce-tutorial [2012/01/25 15:44] straka |
courses:mapreduce-tutorial [2012/01/25 21:10] straka |
* [[.:mapreduce-tutorial:Step 7]]: Dynamic Hadoop cluster for several computations. | * [[.:mapreduce-tutorial:Step 7]]: Dynamic Hadoop cluster for several computations. |
| |
**From now on, run all examples using a one-machine cluster. Running the scripts locally without any cluster has several disadvantages, most notably having only one reducer per job.** | From now on, it is best to run MR jobs using a one-machine cluster. Running the scripts locally without any cluster has several disadvantages, most notably having only one reducer per job. |
| |
=== MapReduce extended === | === MapReduce extended === |
* [[.:mapreduce-tutorial:Step 8]]: Multiple mappers, reducers and partitioning. | * [[.:mapreduce-tutorial:Step 8]]: Multiple mappers, reducers and partitioning. |
* [[.:mapreduce-tutorial:Step 9]]: Hadoop properties. | * [[.:mapreduce-tutorial:Step 9]]: Hadoop properties. |
* [[.:mapreduce-tutorial:Step 10]]: Properties of reducers, combiners. | * [[.:mapreduce-tutorial:Step 10]]: Combiners. |
* [[.:mapreduce-tutorial:Step 11]]: Initialization and cleanup of MR tasks. | * [[.:mapreduce-tutorial:Step 11]]: Initialization and cleanup of MR tasks, performance of combiners. |
* [[.:mapreduce-tutorial:Step 12]]: Additional output from mappers and reducers. | * [[.:mapreduce-tutorial:Step 12]]: Additional output from mappers and reducers. |
| |