This is incredible awesome.  Thanks!

On Fri, 14 Aug 2015 at 17:54 Jeffrey Blackburne <jblackbu...@gmail.com>
wrote:

> If you want to use a tree-based classifier, there is an experimental
> branch that can handle categorical feature data.
>
> https://github.com/scikit-learn/scikit-learn/pull/4899
>
> > On Aug 14, 2015, at 7:03 AM, Andreas Mueller <t3k...@gmail.com> wrote:
> >
> > Why do you think one-hot will be an "explosion"?
> > In your example, the vector would be length 8 (if there are values from
> a to f, that is, you gave the largest possible sets).
> >
> >
> > On 08/14/2015 09:01 AM, federico vaggi wrote:
> >> Hi,
> >>
> >> Simple example:
> >>
> >> Let's say that I have a binary classification task, and my input vector
> consists of two disjunct sects of categorical variables - something like:
> >>
> >> X1 = {'a', 'b', 'c', 'd'} and X2 = {'e', 'd', 'b', 'f'}
> >>
> >> The order within the sets does not matter (obviously), but it matters
> that the elements of X1 are conceptually separate from those of X2.
> >>
> >> All the categorical variables come from the same set.
> >>
> >> Is there a clever encoding that:
> >>
> >> - Emphasizes that order within each set does not matter
> >> - Avoids explosion with one-hot encoding everything?
> >>
> >> Federico
> >>
> >>
> >>
> ------------------------------------------------------------------------------
> >>
> >>
> >>
> >> _______________________________________________
> >> Scikit-learn-general mailing list
> >>
> >> Scikit-learn-general@lists.sourceforge.net
> >> https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
> >
> >
> ------------------------------------------------------------------------------
> > _______________________________________________
> > Scikit-learn-general mailing list
> > Scikit-learn-general@lists.sourceforge.net
> > https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
>
>
>
> ------------------------------------------------------------------------------
> _______________________________________________
> Scikit-learn-general mailing list
> Scikit-learn-general@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
>
------------------------------------------------------------------------------
_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to