[scikit-learn] Adding Quickshift clustering algorithm

Heinrich Jiang Sun, 07 Oct 2018 21:04:49 -0700

Hello,
I'm a researcher at Google Research and I am writing to initiate discussion
about adding Quickshift as well as a variant of it as part of
scikit-learn's set of clustering algorithms.


This somewhat recent algorithm was designed as a faster alternative to Mean
Shift and has been used extensively in computer vision (and already part of
scikit-image). The method was published independently in these papers
[1,2]. [1] has 600 citations and [2] has 1300 citations.

[1] Vedaldi, Andrea, and Stefano Soatto. "Quick shift and kernel methods
for mode seeking." *European Conference on Computer Vision*. Springer,
Berlin, Heidelberg, 2008.
[2] Rodriguez, Alex, and Alessandro Laio. "Clustering by fast search and
find of density peaks." *Science* 344.6191 (2014): 1492-1496.

In addition to Quickshift, I also propose a variant called Quickshift++,
which is Quickshift with an additional hyperparameter. We showed in [3]
that this substantially improved performance over Quickshift as well as
other clustering algorithms implemented in sklearn on benchmark datasets.
(i.e. Figure 9 in https://arxiv.org/abs/1805.07909) and was published at
ICML 2018.

[3] Jiang, Heinrich, Jennifer Jang, and Samory Kpotufe. "Quickshift++:
Provably Good Initializations for Sample-Based Mean Shift." ICML 2018

We have an implementation here (https://github.com/google/quickshift).

Best,
Heinrich

_______________________________________________
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn

[scikit-learn] Adding Quickshift clustering algorithm

Reply via email to