[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision Both sides next revision
courses:mapreduce-tutorial:step-24 [2012/01/27 21:41]
straka
courses:mapreduce-tutorial:step-24 [2012/01/27 21:47]
straka
Line 3: Line 3:
 We start by going through a simple Hadoop job with Mapper only. We start by going through a simple Hadoop job with Mapper only.
  
-A mapper which processes (key, value) pairs of types (Kin, Vin) and produces (key, value) pairs of types (Kout, Vout) must be a subclass of [[http://hadoop.apache.org/common/docs/r1.0.0/api/org/apache/hadoop/mapreduce/Mapper.html|Mapper<Kin, Vin, Kout, Vout>]]. In our case, ''TheMapper'' is subclass of ''Mapper<Text, Text, Text, Text>''.+A mapper which processes (key, value) pairs of types (Kin, Vin) and produces (key, value) pairs of types (Kout, Vout) must be a subclass of [[http://hadoop.apache.org/common/docs/r1.0.0/api/org/apache/hadoop/mapreduce/Mapper.html|Mapper<Kin, Vin, Kout, Vout>]]. In our case, the mapper is subclass of ''Mapper<Text, Text, Text, Text>''.
  
 +The mapper must define a ''map'' method and may provide ''setup'' and ''context'' method:
 +<code java>
 +  public static class TheMapper extends Mapper<Text, Text, Text, Text>{
 +    public void setup(Context context) throws IOException, InterruptedException {}
  
-http://hadoop.apache.org/common/docs/r1.0.0/api/org/apache/hadoop/mapreduce/Mapper.html+    public void map(Text key, Text value, Context context) throws IOException, InterruptedException {}
  
- The Mapper outputs only keys starting with ''A''.+    public void cleanup(Context context) throws IOException, InterruptedException {} 
 +  
 +</code> 
 + 
 +Outputting (key, value) pairs is performed using the [[http://hadoop.apache.org/common/docs/r1.0.0/api/org/apache/hadoop/mapreduce/MapContext.html|MapContext<Kin, Vin, Kout, Vout>]] (the ''Context'' is an abbreviation for this type).
  
 <file java MapperOnlyHadoopJob.java> <file java MapperOnlyHadoopJob.java>

[ Back to the navigation ] [ Back to the content ]