Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
courses:mapreduce-tutorial [2012/01/26 18:26] straka |
courses:mapreduce-tutorial [2012/01/28 00:35] straka |
||
---|---|---|---|
Line 23: | Line 23: | ||
=== MapReduce extended === | === MapReduce extended === | ||
- | From now on, it is best to run MR jobs using a one-machine cluster. Running the scripts locally without any cluster has several disadvantages, | + | From now on, it is best to run MR jobs using a one-machine cluster |
* [[.: | * [[.: | ||
* [[.: | * [[.: | ||
Line 38: | Line 38: | ||
===== Day 2 ===== | ===== Day 2 ===== | ||
- | Today we will be using the Java API. | + | Today we will be using the [[http:// |
- | ===Environment=== | + | === Environment === |
- | * | + | * [[.: |
+ | * [[.: | ||
+ | |||
+ | === Java Hadoop basics ==== | ||
+ | * [[.: | ||
+ | * [[.: | ||
+ | * [[.: | ||
+ | * [[.: | ||
+ | |||
+ | === Custom data types and formats === | ||
+ | * Custom data type -- Pair<A, B>, BerIntWritable. | ||
+ | * Custom input format -- WholeFile and WholeFileAsPath | ||
+ | |||
+ | === Exercises === | ||
+ | * Inverted index. | ||
+ | * [[.: | ||
===== Other ===== | ===== Other ===== | ||
* [[user: | * [[user: | ||