• If you are citizen of an European Union member nation, you may not use this service unless you are at least 16 years old.

  • You already know Dokkio is an AI-powered assistant to organize & manage your digital files & messages. Very soon, Dokkio will support Outlook as well as One Drive. Check it out today!


kMeans-Canopy (redirected from kMeans)

Page history last edited by mike@mbowles.com 11 years, 4 months ago

References on k-means clustering:

     wikipedia: http://en.wikipedia.org/wiki/K-means_clustering

     visualizations: http://siebn.de/other/yakmeans/


     slides from tan, et. al. Introduction to Data Mining : chap8_basic_cluster_analysis.ppt


How to map-reduce k-Means




References on canopy clustering:

     original paper on canopy clustering: canopy-kdd00.pdf

    Mahout entry on canopy clustering: https://cwiki.apache.org/confluence/display/MAHOUT/Canopy+Clustering


How to get a random sample of k elements from a streaming input of unknown length:

     Slides from stanford course on big data mining:  15-streams.pdf (see slides 13 - 15)





Comments (0)

You don't have permission to comment on this page.