Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
courses:mapreduce-tutorial [2012/01/25 22:20] straka |
courses:mapreduce-tutorial [2012/01/28 00:35] straka |
||
---|---|---|---|
Line 23: | Line 23: | ||
=== MapReduce extended === | === MapReduce extended === | ||
- | From now on, it is best to run MR jobs using a one-machine cluster. Running the scripts locally without any cluster has several disadvantages, | + | From now on, it is best to run MR jobs using a one-machine cluster |
* [[.: | * [[.: | ||
* [[.: | * [[.: | ||
Line 35: | Line 35: | ||
* [[.: | * [[.: | ||
* [[.: | * [[.: | ||
+ | |||
+ | ===== Day 2 ===== | ||
+ | |||
+ | Today we will be using the [[http:// | ||
+ | |||
+ | === Environment === | ||
+ | * [[.: | ||
+ | * [[.: | ||
+ | |||
+ | === Java Hadoop basics ==== | ||
+ | * [[.: | ||
+ | * [[.: | ||
+ | * [[.: | ||
+ | * [[.: | ||
+ | |||
+ | === Custom data types and formats === | ||
+ | * Custom data type -- Pair<A, B>, BerIntWritable. | ||
+ | * Custom input format -- WholeFile and WholeFileAsPath | ||
+ | |||
+ | === Exercises === | ||
+ | * Inverted index. | ||
+ | * [[.: | ||
===== Other ===== | ===== Other ===== | ||
* [[user: | * [[user: | ||