Differences

This shows you the differences between two versions of the page.

--- courses:mapreduce-tutorial:step-8 [2012/01/25 15:29]
straka
+++ courses:mapreduce-tutorial:step-8 [2012/01/25 18:37]
straka
@@ Line 12: / Line 12: @@
 To use multiple reducers, the MR job must be executed by a cluster (even with one computer), not locally. The number of reducers is specified by ''-r'' flag:
-  perl script.pl [-j cluster_master | -c cluster_size [-w sec_to_wait]] [-r number_of_reducers]
+  perl script.pl run [-jt cluster_master | -c cluster_size [-w sec_to_wait]] [-r number_of_reducers]
 ==== Partitioning ====
@@ Line 44: / Line 44: @@
 A MR job must have a reducer if it specifies a partitioner. Also, the partitioner is not called if there is only one reducer.
+===== The order of keys during reduce =====
+It is guaranteed that every reducer processes the keys in //ascending order//.
+On the other hand, when processing one key, the order of its values is undefined.
 ===== Example =====
-Run a MR job on '/home/straka/wiki/cs-text-medium', which creates a list of unique article names and at the same time list of unique words.
+Run a MR job on '/home/straka/wiki/cs-text-medium', which creates an ascending list of unique article names and at the same time an ascending list of unique words.
 {{:courses:mapreduce-tutorial:step-8-solution.txt|Solution.pl}}

Institute of Formal and Applied Linguistics Wiki