Hi Andy,

In this competition, it helped me get to 1st place for a bit, and the
current 1st place holder is also using this.
https://www.kaggle.com/c/sf-crime/forums/t/15836/predicting-crime-categories-with-address-featurization-and-neural-nets

I am pretty sure that the data is public for this specific case.

I've implemented it in pandas, but the code can definitely use some cleanup
and speed up.

One of the issues that I would like to clarify a bit better is the data
leakage issue, that can be dealt with by adding a bit of laplacian noise.

Cheers,

CFP

On Fri, Sep 18, 2015 at 8:39 PM, Andy <t3k...@gmail.com> wrote:

> Hi.
> I think it would be great to add this transformation, it has been widely
> advertised by Owen.
> It would be great to have some public datasets (well-established
> preferable) to demonstrate the usefulness.
> The sf-crime is public, so that would be a good first step.
> Can you provide a full-fledged example?
>
> Cheers,
> Andy
>
>
> On 09/17/2015 07:41 PM, Christos F. Papadopoulos wrote:
>
> Hi all!
>
> Are there any plans to implement something like a countfeaturizer?
> https://msdn.microsoft.com/en-us/library/azure/dn913056.aspx
>
> It is a generic method for feature extraction that encodes multi-valued
> features based on classification
> count data.
> I've found it very useful, for example on this kaggle competition:
> https://www.kaggle.com/c/sf-crime
>
>
> Thank you!
>
>
> ------------------------------------------------------------------------------
>
>
>
> _______________________________________________
> Scikit-learn-general mailing 
> listScikit-learn-general@lists.sourceforge.nethttps://lists.sourceforge.net/lists/listinfo/scikit-learn-general
>
>
>
>
> ------------------------------------------------------------------------------
>
> _______________________________________________
> Scikit-learn-general mailing list
> Scikit-learn-general@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
>
>
------------------------------------------------------------------------------
_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to