Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
courses:mapreduce-tutorial [2012/01/25 21:10] straka |
courses:mapreduce-tutorial [2012/01/28 00:35] straka |
||
---|---|---|---|
Line 21: | Line 21: | ||
* [[.: | * [[.: | ||
* [[.: | * [[.: | ||
- | |||
- | From now on, it is best to run MR jobs using a one-machine cluster. Running the scripts locally without any cluster has several disadvantages, | ||
=== MapReduce extended === | === MapReduce extended === | ||
+ | From now on, it is best to run MR jobs using a one-machine cluster -- create a one-machine cluster using '' | ||
* [[.: | * [[.: | ||
* [[.: | * [[.: | ||
Line 32: | Line 31: | ||
=== Advanced MapReduce exercises === | === Advanced MapReduce exercises === | ||
+ | Exercises in this section can be made in any order, but it is recommended to try solving all of them. The [[.: | ||
* [[.: | * [[.: | ||
* [[.: | * [[.: | ||
- | * [[.: | + | * [[.: |
+ | |||
+ | ===== Day 2 ===== | ||
+ | |||
+ | Today we will be using the [[http:// | ||
+ | |||
+ | === Environment === | ||
+ | * [[.: | ||
+ | * [[.: | ||
+ | |||
+ | === Java Hadoop basics ==== | ||
+ | * [[.: | ||
+ | * [[.: | ||
+ | * [[.: | ||
+ | * [[.: | ||
+ | |||
+ | === Custom data types and formats === | ||
+ | * Custom data type -- Pair<A, B>, BerIntWritable. | ||
+ | * Custom input format -- WholeFile and WholeFileAsPath | ||
+ | |||
+ | === Exercises === | ||
+ | * Inverted index. | ||
+ | * [[.: | ||
===== Other ===== | ===== Other ===== | ||
* [[user: | * [[user: | ||