Both sides previous revision
Previous revision
Next revision
|
Previous revision
Next revision
Both sides next revision
|
courses:mapreduce-tutorial [2012/01/24 08:41] straka |
courses:mapreduce-tutorial [2012/01/25 15:44] straka |
* [[.:mapreduce-tutorial:Step 2]]: Input and output format, testing data. | * [[.:mapreduce-tutorial:Step 2]]: Input and output format, testing data. |
* [[.:mapreduce-tutorial:Step 3]]: Basic mapper. | * [[.:mapreduce-tutorial:Step 3]]: Basic mapper. |
* [[.:mapreduce-tutorial:Step 4]]: Basic reducer. | |
* [[.:mapreduce-tutorial:Step 4]]: Counters. | * [[.:mapreduce-tutorial:Step 4]]: Counters. |
| * [[.:mapreduce-tutorial:Step 5]]: Basic reducer. |
| |
=== Controlling the cluster === | === Controlling the cluster === |
* [[.:mapreduce-tutorial:Step 5]]: Dynamic cluster for one computation. | * [[.:mapreduce-tutorial:Step 6]]: Running on cluster. |
* [[.:mapreduce-tutorial:Step 6]]: Dynamic cluster for several computations. | * [[.:mapreduce-tutorial:Step 7]]: Dynamic Hadoop cluster for several computations. |
| |
| **From now on, run all examples using a one-machine cluster. Running the scripts locally without any cluster has several disadvantages, most notably having only one reducer per job.** |
| |
| === MapReduce extended === |
| * [[.:mapreduce-tutorial:Step 8]]: Multiple mappers, reducers and partitioning. |
| * [[.:mapreduce-tutorial:Step 9]]: Hadoop properties. |
| * [[.:mapreduce-tutorial:Step 10]]: Properties of reducers, combiners. |
| * [[.:mapreduce-tutorial:Step 11]]: Initialization and cleanup of MR tasks. |
| * [[.:mapreduce-tutorial:Step 12]]: Additional output from mappers and reducers. |
| |
| === Advanced MapReduce exercises === |
| * [[.:mapreduce-tutorial:Step 13]]: Sorting |
| * [[.:mapreduce-tutorial:Step 14]]: N-gram language model |
| * [[.:mapreduce-tutorial:Step 15]]: K-means algorithm |
| |
===== Other ===== | ===== Other ===== |
* [[user:majlis:hadoop|Further information]] | * [[user:majlis:hadoop|Further information]] |
| |