Hi All,
I have doing multi label classification for which I am using LabelBinarizer. I
am dealing with more than 6M data items and each data item has minimum 1 and
maximum 5 labels. Number of unique labels is more than 42K. When I am trying to
binarize labels, I am getting ValueError: array is too big which is obvious.
Are there any other alternatives for classifying such large amount of
multi-labelled data?
Regards,
Mahendra Kariya
------------------------------------------------------------------------------
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from
the latest Intel processors and coprocessors. See abstracts and register >
http://pubads.g.doubleclick.net/gampad/clk?id=60135031&iu=/4140/ostg.clktrk
_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general