Differences
This shows you the differences between two versions of the page.
Next revision | Previous revision | ||
courses:mapreduce-tutorial:step-9 [2012/01/25 15:46] straka vytvořeno |
courses:mapreduce-tutorial:step-9 [2012/01/31 09:42] (current) straka Change Perl commandline syntax. |
||
---|---|---|---|
Line 1: | Line 1: | ||
- | ====== MapReduce Tutorial : ====== | + | ====== MapReduce Tutorial : Hadoop properties ====== |
+ | |||
+ | We have controlled the Hadoop jobs using the Perl API so far, which is quite limited. | ||
+ | |||
+ | The Hadoop itself uses many configuration options. The options can be set on command line using the '' | ||
+ | perl script.pl [-jt cluster_master | -c cluster_size [-w sec_to_wait]] [-r number_of_reducers] [-Dname=value -Dname=value ...] input output_path | ||
+ | Mind that the order of options matters -- the '' | ||
+ | |||
+ | Every Hadoop option has a read-only default. These are overridden by cluster specific options. Lastly, all of these are overridden by job specific options given on the command line (or set using the Java API). | ||
+ | |||
+ | ===== A brief list of Hadoop options ===== | ||
+ | ^ Hadoop option ^ Default value ^ Description ^ | ||
+ | | '' | ||
+ | | '' | ||
+ | | '' | ||
+ | | '' | ||
+ | | '' | ||
+ | | '' | ||
+ | | '' | ||
+ | |||
+ | A more complete list (but not exhaustive) can be found [[http:// | ||
+ | |||
+ | ===== Mapping of Perl options to Hadoop ===== | ||
+ | ^ Perl options ^ Hadoop options ^ | ||
+ | | no options \\ (running locally) | '' | ||
+ | | '' | ||
+ | | '' | ||
+ | | '' | ||
+ | |||
+ | ---- | ||
+ | |||
+ | < | ||
+ | <table style=" | ||
+ | < | ||
+ | <td style=" | ||
+ | <td style=" | ||
+ | <td style=" | ||
+ | </ | ||
+ | </ | ||
+ | </ |