Differences

This shows you the differences between two versions of the page.

--- courses:mapreduce-tutorial:step-31 [2012/02/06 08:18]
straka
+++ courses:mapreduce-tutorial:step-31 [2012/02/06 08:21]
straka
@@ Line 31: / Line 31: @@
 It is crucial that all the mappers run simultaneously. This can be achieved using the ''/net/projects/hadoop/bin/compute-splitsize'' script: for given Hadoop input and requested number of mappers, it computes the appropriate splitsize.
+When the computation finishes, only one of the mappers should print the results, as all of them have the same results. For simplicity, the ''cooperate'' method has ''boolean shouldWrite'' argument, which is set in exactly one mapper.
 ===== Example =====
-This example reads the keys of ''/net/projects/hadoop/examples/inputs/numbers-small/numbers.txt'', computes the sum of all the keys and print it:
+This example reads the keys of ''/net/projects/hadoop/examples/inputs/numbers-small'', computes the sum of all the keys and print it:
 <code java Sum.java>
 import org.apache.hadoop.mapreduce.*;
@@ Line 108: / Line 110: @@
   less step-31-out/part-*
+===== Exercise 1 =====
+Implement an AllReduce job on

Institute of Formal and Applied Linguistics Wiki