Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
courses:mapreduce-tutorial:step-13 [2012/01/31 15:52] straka |
courses:mapreduce-tutorial:step-13 [2012/01/31 15:54] (current) straka |
||
---|---|---|---|
Line 8: | Line 8: | ||
You can assume that the integers are uniformly distributed. | You can assume that the integers are uniformly distributed. | ||
- | Your task is to sort these data, comparing the key numerically and not lexicographically. | + | Your task is to sort these data, comparing the key numerically and not lexicographically. |
Your solution should work for TBs of data. For that reason, you must use multiple reducers. If your job is executed using //r// reducers, the output consists of //r// files, which when concatenated would produce sorted (key, value) pairs. In other words, each of the output files contains sorted (integer, data) pairs and all keys in one file are either smaller or larger than in other file. Your solution should work for any value //r// -- this value is given to [[.: | Your solution should work for TBs of data. For that reason, you must use multiple reducers. If your job is executed using //r// reducers, the output consists of //r// files, which when concatenated would produce sorted (key, value) pairs. In other words, each of the output files contains sorted (integer, data) pairs and all keys in one file are either smaller or larger than in other file. Your solution should work for any value //r// -- this value is given to [[.: |