[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
courses:mapreduce-tutorial:step-7 [2012/01/31 09:41]
straka Change Perl commandline syntax.
courses:mapreduce-tutorial:step-7 [2013/02/08 14:36] (current)
popel Milan improved our Hadoop
Line 4: Line 4:
  
 A cluster can be created using A cluster can be created using
-  /net/projects/hadoop/bin/hadoop-cluster -c number_of_machines -w sec_to_run_the_cluster_for+  /net/projects/hadoop/bin/hadoop-cluster -c number_of_machines -w sec_to_wait_after_all_jobs_completed
 The syntax is the same as in ''perl script.pl run''. The syntax is the same as in ''perl script.pl run''.
  
Line 12: Line 12:
 Running cluster is identified by its master. When running a Hadoop job using Perl API, existing cluster can be used by Running cluster is identified by its master. When running a Hadoop job using Perl API, existing cluster can be used by
   perl script.pl -jt cluster_master:9001 ...   perl script.pl -jt cluster_master:9001 ...
 +
 +===== Running Hadoop jobs from now on =====
 +
 +From now on, it is best to run MR jobs using a one-machine cluster -- create a one-machine cluster using ''hadoop-cluster'' for 3h (10800s) and run jobs using ''-jt cluster_master''. Running the scripts locally without any cluster has several disadvantages, most notably having only one reducer per job. 
  
 ===== Example ===== ===== Example =====

[ Back to the navigation ] [ Back to the content ]