Both sides previous revision
Previous revision
Next revision
|
Previous revision
Next revision
Both sides next revision
|
courses:mapreduce-tutorial [2012/01/29 21:50] straka [xmlrpc dokuvimki edit] |
courses:mapreduce-tutorial [2012/01/31 12:44] straka |
| |
=== MapReduce extended === | === MapReduce extended === |
From now on, it is best to run MR jobs using a one-machine cluster -- create a one-machine cluster using ''hadoop-cluster'' for 3h (10800s) and run jobs using ''-jt cluster_master''. Running the scripts locally without any cluster has several disadvantages, most notably having only one reducer per job. | |
* [[.:mapreduce-tutorial:Step 8]]: Multiple mappers, reducers and partitioning. | * [[.:mapreduce-tutorial:Step 8]]: Multiple mappers, reducers and partitioning. |
* [[.:mapreduce-tutorial:Step 9]]: Hadoop properties. | * [[.:mapreduce-tutorial:Step 9]]: Hadoop properties. |
=== Java Hadoop basics ==== | === Java Hadoop basics ==== |
* [[.:mapreduce-tutorial:Step 23]]: Predefined formats and types. | * [[.:mapreduce-tutorial:Step 23]]: Predefined formats and types. |
* [[.:mapreduce-tutorial:Step 24]]: Mappers, running Java Hadoop jobs. | * [[.:mapreduce-tutorial:Step 24]]: Mappers, running Java Hadoop jobs, counters. |
* [[.:mapreduce-tutorial:Step 25]]: Reducers, combiners and partitioners. | * [[.:mapreduce-tutorial:Step 25]]: Reducers, combiners and partitioners. |
* [[.:mapreduce-tutorial:Step 26]]: Counters, compression and job configuration. | * [[.:mapreduce-tutorial:Step 26]]: Compression and job configuration. |
| |
=== Advanced topics === | === Advanced topics === |
* [[.:mapreduce-tutorial:Step 27]]: Custom data types. | * [[.:mapreduce-tutorial:Step 27]]: Custom data types. |
* [[.:mapreduce-tutorial:Step 28]]: Running multiple Hadoop jobs in one class. | * [[.:mapreduce-tutorial:Step 28]]: Running multiple Hadoop jobs in one class. |
* [[.:mapreduce-tutorial:Step 29]]: Custom input formats. | * [[.:mapreduce-tutorial:Step 29]]: Custom sorting and grouping comparators. |
| * [[.:mapreduce-tutorial:Step 30]]: Custom input formats. |
| |
=== Beyond MapReduce === | === Beyond MapReduce === |
* [[.:mapreduce-tutorial:Step 30]]: Implementing iterative MapReduce jobs faster using All-Reduce | * [[.:mapreduce-tutorial:Step 31]]: Implementing iterative MapReduce jobs faster using All-Reduce. |
| |
===== Other ===== | ===== Other ===== |
* [[user:majlis:hadoop|Further information]] | * [[user:majlis:hadoop|Further information]] |
| |