[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
courses:mapreduce-tutorial:running-jobs [2012/02/05 21:14]
straka
courses:mapreduce-tutorial:running-jobs [2012/02/05 21:19]
straka
Line 5: Line 5:
 The output of a Hadoop job must be a directory, which does not exist. The output of a Hadoop job must be a directory, which does not exist.
  
-===== Run Perl jobs =====+===== Running jobs ===== 
 + 
 +| ^ Command ^ 
 +^ Run Perl script ''script.pl'' | ''perl script.pl''
 + 
 +  * When running Perl script ''script.pl'', use  
 Choosing mode of operation: Choosing mode of operation:
 | ^ Command ^ | ^ Command ^
Line 15: Line 21:
 Specifying number of mappers and reducers: Specifying number of mappers and reducers:
 | ^ Command ^ | ^ Command ^
-^ Run using //R// reducers \\ (//R//>1 not working when running locally)| ''perl -r R script.pl input output'' |+^ Run using //R// reducers \\ (//R//>1 not working when running locally)| ''perl script.pl -r R script.pl input output'' |
 ^ Run using //M// mappers | ''perl script.pl `/net/projects/hadoop/bin/compute-splitsize input M` input output'' | ^ Run using //M// mappers | ''perl script.pl `/net/projects/hadoop/bin/compute-splitsize input M` input output'' |
 +
 +===== Run Java jobs =====
 +Choosing mode of operation:
 +| ^ Command ^
 +^ Run locally | ''/net/projects/hadoop/bin/hadoop job.jar input output'' |
 +^ Run using specified jobtracker | ''/net/projects/hadoop/bin/hadoop job.jar -jt jobtracker:port input output'' |
 +^ Run job in dedicated cluster | ''/net/projects/hadoop/bin/hadoop job.jar -c number_of_machines input output'' |
 +^ Run job in dedicated cluster and after it finishes, \\ wait for //W// seconds before stopping the cluster | ''/net/projects/hadoop/bin/hadoop job.jar -c number_of_machines -w W_seconds input output'' |
 +
 +Specifying number of mappers and reducers:
 +| ^ Command ^
 +^ Run using //R// reducers \\ (//R//>1 not working when running locally)| ''/net/projects/hadoop/bin/hadoop job.jar -r R script.pl input output'' |
 +^ Run using //M// mappers | ''/net/projects/hadoop/bin/hadoop job.jar `/net/projects/hadoop/bin/compute-splitsize input M` input output'' |
 +

[ Back to the navigation ] [ Back to the content ]