Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
courses:mapreduce-tutorial [2012/01/23 20:56] straka |
courses:mapreduce-tutorial [2012/01/28 00:35] straka |
||
---|---|---|---|
Line 8: | Line 8: | ||
===== Day 1 ===== | ===== Day 1 ===== | ||
- | Today we will be using the Perl API. | + | Today we will be using the [[.: |
- | * [[.: | + | === Environment === |
+ | * [[.: | ||
+ | |||
+ | === MapReduce basics === | ||
+ | * [[.: | ||
+ | * [[.: | ||
+ | * [[.: | ||
+ | * [[.: | ||
+ | |||
+ | === Controlling the cluster === | ||
+ | * [[.: | ||
+ | * [[.: | ||
+ | |||
+ | === MapReduce extended === | ||
+ | From now on, it is best to run MR jobs using a one-machine cluster -- create a one-machine cluster using '' | ||
+ | * [[.: | ||
+ | * [[.: | ||
+ | * [[.: | ||
+ | * [[.: | ||
+ | * [[.: | ||
+ | |||
+ | === Advanced MapReduce exercises === | ||
+ | Exercises in this section can be made in any order, but it is recommended to try solving all of them. The [[.: | ||
+ | * [[.: | ||
+ | * [[.: | ||
+ | * [[.: | ||
+ | |||
+ | ===== Day 2 ===== | ||
+ | |||
+ | Today we will be using the [[http:// | ||
+ | |||
+ | === Environment === | ||
+ | * [[.: | ||
+ | * [[.: | ||
+ | |||
+ | === Java Hadoop basics ==== | ||
+ | * [[.: | ||
+ | * [[.: | ||
+ | * [[.: | ||
+ | * [[.: | ||
+ | |||
+ | === Custom data types and formats === | ||
+ | * Custom data type -- Pair<A, B>, BerIntWritable. | ||
+ | * Custom input format -- WholeFile and WholeFileAsPath | ||
+ | |||
+ | === Exercises === | ||
+ | * Inverted index. | ||
+ | * [[.: | ||
===== Other ===== | ===== Other ===== | ||
* [[user: | * [[user: | ||