Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
courses:mapreduce-tutorial:step-3 [2012/01/24 19:19] straka |
courses:mapreduce-tutorial:step-3 [2012/01/28 12:05] majlis Scripts were reuploaded. |
||
---|---|---|---|
Line 1: | Line 1: | ||
====== MapReduce Tutorial : Basic mapper ====== | ====== MapReduce Tutorial : Basic mapper ====== | ||
- | The simplest | + | The simplest |
+ | |||
+ | The Hadoop framework silently handles failures. If a mapper task fails, another is executed and the input of the failed attempt is discarded. | ||
===== Example Perl mapper ===== | ===== Example Perl mapper ===== | ||
- | <code perl> | + | <file perl> |
# | # | ||
Line 28: | Line 30: | ||
$runner-> | $runner-> | ||
- | </code> | + | </file> |
The values '' | The values '' | ||
- | Resulting script can be executed locally | + | Resulting script can be executed locally |
perl script.pl run input_directory output_directory | perl script.pl run input_directory output_directory | ||
All files in input_directory are processes. The output_directory must not exist. | All files in input_directory are processes. The output_directory must not exist. | ||
+ | |||
===== Exercise ===== | ===== Exercise ===== | ||
- | Try running a MR job on ''/ | + | To check that your Hadoop environment works, try running a MR job on ''/ |
- | articles with names beginning with a (ignoring the case). | + | wget --no-check-certificate ' |
+ | rm -rf step-3-out-ex; | ||
+ | less step-3-out-ex/ | ||
+ | |||
+ | ==== Solution ==== | ||
+ | You can also download the solution {{: | ||
+ | wget --no-check-certificate ' | ||
+ | rm -rf step-3-out-sol; | ||
+ | less step-3-out-sol/ | ||
+ | |||
+ | ---- | ||
+ | < | ||
+ | <table style=" | ||
+ | <tr> | ||
+ | <td style=" | ||
+ | <td style=" | ||
+ | <td style=" | ||
+ | </tr> | ||
+ | </ | ||
+ | </ |