Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
courses:mapreduce-tutorial:step-9 [2012/01/25 15:56] straka |
courses:mapreduce-tutorial:step-9 [2012/01/31 09:42] (current) straka Change Perl commandline syntax. |
||
---|---|---|---|
Line 3: | Line 3: | ||
We have controlled the Hadoop jobs using the Perl API so far, which is quite limited. | We have controlled the Hadoop jobs using the Perl API so far, which is quite limited. | ||
- | The Hadoop itself uses many configuration options. | + | The Hadoop itself uses many configuration options. |
- | perl script.pl | + | perl script.pl [-jt cluster_master | -c cluster_size [-w sec_to_wait]] [-r number_of_reducers] [-Dname=value -Dname=value ...] input output_path |
- | Mind that the order of options matters -- the '' | + | Mind that the order of options matters -- the '' |
+ | Every Hadoop option has a read-only default. These are overridden by cluster specific options. Lastly, all of these are overridden by job specific options given on the command line (or set using the Java API). | ||
+ | ===== A brief list of Hadoop options ===== | ||
+ | ^ Hadoop option ^ Default value ^ Description ^ | ||
+ | | '' | ||
+ | | '' | ||
+ | | '' | ||
+ | | '' | ||
+ | | '' | ||
+ | | '' | ||
+ | | '' | ||
+ | A more complete list (but not exhaustive) can be found [[http:// | ||
+ | |||
+ | ===== Mapping of Perl options to Hadoop ===== | ||
+ | ^ Perl options ^ Hadoop options ^ | ||
+ | | no options \\ (running locally) | '' | ||
+ | | '' | ||
+ | | '' | ||
+ | | '' | ||
+ | |||
+ | ---- | ||
+ | |||
+ | < | ||
+ | <table style=" | ||
+ | <tr> | ||
+ | <td style=" | ||
+ | <td style=" | ||
+ | <td style=" | ||
+ | </tr> | ||
+ | </ | ||
+ | </ |