[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
courses:mapreduce-tutorial [2012/01/28 00:34]
straka
courses:mapreduce-tutorial [2012/01/29 21:50]
straka [xmlrpc dokuvimki edit]
Line 32: Line 32:
 === Advanced MapReduce exercises === === Advanced MapReduce exercises ===
 Exercises in this section can be made in any order, but it is recommended to try solving all of them. The [[.:mapreduce-tutorial:Perl API|Perl API reference]] may come handy. Exercises in this section can be made in any order, but it is recommended to try solving all of them. The [[.:mapreduce-tutorial:Perl API|Perl API reference]] may come handy.
-  * [[.:mapreduce-tutorial:Step 13]]: Sorting +  * [[.:mapreduce-tutorial:Step 13]]: Sorting. 
-  * [[.:mapreduce-tutorial:Step 14]]: N-gram language model +  * [[.:mapreduce-tutorial:Step 14]]: N-gram language model. 
-  * [[.:mapreduce-tutorial:Step 15]]: K-means clustering+  * [[.:mapreduce-tutorial:Step 15]]: K-means clustering.
  
 ===== Day 2 ===== ===== Day 2 =====
Line 48: Line 48:
   * [[.:mapreduce-tutorial:Step 24]]: Mappers, running Java Hadoop jobs.   * [[.:mapreduce-tutorial:Step 24]]: Mappers, running Java Hadoop jobs.
   * [[.:mapreduce-tutorial:Step 25]]: Reducers, combiners and partitioners.   * [[.:mapreduce-tutorial:Step 25]]: Reducers, combiners and partitioners.
-  * [[.:mapreduce-tutorial:Step 26]]: Counters and job configuration.+  * [[.:mapreduce-tutorial:Step 26]]: Counters, compression and job configuration.
  
-=== Custom data types and formats === +=== Advanced topics === 
-  * Custom data type -Pair<A, B>, BerIntWritable+  * [[.:mapreduce-tutorial:Step 27]]: Custom data types. 
-  * Custom input format -- WholeFile and WholeFileAsPath+  * [[.:mapreduce-tutorial:Step 28]]: Running multiple Hadoop jobs in one class
 +  * [[.:mapreduce-tutorial:Step 29]]: Custom input formats.
  
-=== Exercises === +=== Beyond MapReduce === 
-  * Inverted index. +  * [[.:mapreduce-tutorial:Step 30]]: Implementing iterative MapReduce jobs faster using All-Reduce.
-  * Is [[.:mapreduce-tutorial:Step 13]], [[.:mapreduce-tutorial:Step 14]] and [[.:mapreduce-tutorial:Step 15]] enough?+
  
 ===== Other ===== ===== Other =====
   * [[user:majlis:hadoop|Further information]]   * [[user:majlis:hadoop|Further information]]
  

[ Back to the navigation ] [ Back to the content ]