I work with partial_fit of Birch because the dataset is too huge to load into memory.
So I cluster data batch by batch. eg: I have 50000 samples and every batch contain 1000 samples. I found clustering result is better if I cluster data which contain part of last batch better than cluster data which doesn't contain previous data. So I want to know how partail_fit works.
_______________________________________________ scikit-learn mailing list scikit-learn@python.org https://mail.python.org/mailman/listinfo/scikit-learn