Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement the K-means clustering algorithm #22

Open
novoselrok opened this issue Jun 5, 2018 · 2 comments
Open

Implement the K-means clustering algorithm #22

novoselrok opened this issue Jun 5, 2018 · 2 comments
Labels
enhancement New feature or request help wanted Extra attention is needed

Comments

@novoselrok
Copy link
Collaborator

Add k-means[1] clustering to doddle-model.

  • Select the algorithm implementation (Lloyd's, Hartigan-Wong, ...)
  • Implement initialisation methods (Random, k-means++)
  • Include parallelization

[1] https://en.wikipedia.org/wiki/K-means_clustering

@inejc inejc added the enhancement New feature or request label Jun 8, 2018
@inejc
Copy link
Member

inejc commented Jun 9, 2018

scikit-learn uses Lloyd's algorithm with k-means++ initialization and caching + the triangle inequality. Based on Wikipedia, Hartigan-Wong's method is more sophisticated and more computationally expensive. Will look more into this.

@Kamuzeriz
Copy link

K++

@novoselrok novoselrok mentioned this issue Jun 19, 2018
5 tasks
@inejc inejc added the help wanted Extra attention is needed label Dec 1, 2018
@inejc inejc changed the title K-means clustering Implement the K-means clustering algorithm Dec 1, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

3 participants