[ Skip to the content ]

Institute of Formal and Applied Linguistics Wiki


[ Back to the navigation ]

This is an old revision of the document!


MapReduce Tutorial : K-means clustering

Implement the K-means clustering algorithm. You can use the following data:

Path Number of points Number of dimensions Number of clusters
/home/straka/hadoop/example-inputs/points-small 10000 50 50
/home/straka/hadoop/example-inputs/points-medium 100000 100 100
/home/straka/hadoop/example-inputs/points-large 500000 200 200

[ Back to the navigation ] [ Back to the content ]