On 11/06/2013 05:00 PM, Jim wrote:
>> Then you don't need a OneVsRestClassifier as OvR is the default
>> strategy for SGD. You do need to put a maximum on the number of
>> classes before you start learning, though.
>   I see. Thank you for the advice. This was initial novice iteration of the
>   solution and needs improvement of course. In terms of which, in order to
>   keep the behaviour of the classifier consistent, instead of a single
>   classifier with thousands of categories wouldn't it be better
>   to build an ensemble of classifiers with a hierarchy(similar dataset, less
>   collision since the noise from the other classes is reduced and
>   I can define the solution-flow custom to the category chosen) say,
>   
>                      Classifiers
>                          |
>      ----------------------------------------
>      |              |       -------         |
>    Type A         Type B                   Type N (total types = 5 or 6)
>      |
>      |              --
>    -----------------------------------
>    |            |        --------    |
>   Category A  Category B          Category N (N = atmost 1000)
>
I would venture that which one is better would depend on the nature of 
your data.
Do you know the number of types beforehand? And do all types have 1000 
categories?

------------------------------------------------------------------------------
November Webinars for C, C++, Fortran Developers
Accelerate application performance with scalable programming models. Explore
techniques for threading, error checking, porting, and tuning. Get the most 
from the latest Intel processors and coprocessors. See abstracts and register
http://pubads.g.doubleclick.net/gampad/clk?id=60136231&iu=/4140/ostg.clktrk
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to