Hi All,

I have doing multi label classification for which I am using LabelBinarizer. I 
am dealing with more than 6M data items and each data item has minimum 1 and 
maximum 5 labels. Number of unique labels is more than 42K. When I am trying to 
binarize labels, I am getting ValueError: array is too big which is obvious.


Are there any other alternatives for classifying such large amount of 
multi-labelled data?
 

Regards,
Mahendra Kariya
------------------------------------------------------------------------------
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register >
http://pubads.g.doubleclick.net/gampad/clk?id=60135031&iu=/4140/ostg.clktrk
_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to