[Scikit-learn-general] BIRCH - Testing datasets

Dženan Softić Mon, 30 Nov 2015 12:08:15 -0800

Hi,

I am trying to test BIRCH with the original datasets found here:
https://cs.joensuu.fi/sipu/datasets/
(100K points, 100 clusters)


The problem is setting the threshold. I need to set it above 10 000 to get
decent results. That is very weird because on BIRCH example (
http://scikit-learn.org/stable/auto_examples/cluster/plot_birch_vs_minibatchkmeans.html),
similar dataset has been produced, and with threshold set to 0.0 - 2.0
normal results could be obtained.

I thought there was something wrong with the dataset itself, but then I
found on BIRCH issues that it was actually used for testing during the
development:(https://gist.github.com/MechCoder/16f121698ccd50568c2a)

Am I doing something wrong here?

Thanks,
Dzeno

------------------------------------------------------------------------------
Go from Idea to Many App Stores Faster with Intel(R) XDK
Give your users amazing mobile app experiences with Intel(R) XDK.
Use one codebase in this all-in-one HTML5 development environment.
Design, debug & build mobile apps & 2D/3D high-impact games for multiple OSs.
http://pubads.g.doubleclick.net/gampad/clk?id=254741911&iu=/4140

_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

[Scikit-learn-general] BIRCH - Testing datasets

Reply via email to