[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision Both sides next revision
courses:mapreduce-tutorial:step-15 [2012/01/26 00:06]
straka
courses:mapreduce-tutorial:step-15 [2012/01/26 00:11]
straka
Line 6: Line 6:
 | ''/home/straka/hadoop/example-inputs/points-medium'' | 100000 | 100 | 100 | | ''/home/straka/hadoop/example-inputs/points-medium'' | 100000 | 100 | 100 |
 | ''/home/straka/hadoop/example-inputs/points-large'' | 500000 | 200 | 200 | | ''/home/straka/hadoop/example-inputs/points-large'' | 500000 | 200 | 200 |
 +
 +When dealing with iterative algorithms, each iteration is usually implemented as one Hadoop job. The Hadoop input_path contains the input data and each mapper also reads the current clusters. The reducers are used to aggregate the data and output new cluster centers. A controlling script is taking care of executing Hadoop jobs and stopping the iteration when the algorithm converges.
 +

[ Back to the navigation ] [ Back to the content ]