Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Last revision Both sides next revision | ||
courses:mapreduce-tutorial:step-15 [2012/01/26 00:11] straka |
courses:mapreduce-tutorial:step-15 [2012/01/28 23:23] majlis Added links to previous and next chapter. |
||
---|---|---|---|
Line 3: | Line 3: | ||
Implement the [[http:// | Implement the [[http:// | ||
^ Path ^ Number of points ^ Number of dimensions ^ Number of clusters ^ | ^ Path ^ Number of points ^ Number of dimensions ^ Number of clusters ^ | ||
- | | ''/ | + | | ''/ |
- | | ''/ | + | | ''/ |
- | | ''/ | + | | ''/ |
When dealing with iterative algorithms, each iteration is usually implemented as one Hadoop job. The Hadoop input_path contains the input data and each mapper also reads the current clusters. The reducers are used to aggregate the data and output new cluster centers. A controlling script is taking care of executing Hadoop jobs and stopping the iteration when the algorithm converges. | When dealing with iterative algorithms, each iteration is usually implemented as one Hadoop job. The Hadoop input_path contains the input data and each mapper also reads the current clusters. The reducers are used to aggregate the data and output new cluster centers. A controlling script is taking care of executing Hadoop jobs and stopping the iteration when the algorithm converges. | ||
+ | ---- | ||
+ | |||
+ | < | ||
+ | <table style=" | ||
+ | <tr> | ||
+ | <td style=" | ||
+ | <td style=" | ||
+ | <td style=" | ||
+ | </tr> | ||
+ | </ | ||
+ | </ |