Re: [Scikit-learn-general] higher accuracy with non scaled data

Michael Eickenberg Tue, 08 Jul 2014 07:01:46 -0700

That totally depends on your data. Here it looks like you are scaling down
a feature that captures a lot of the variation you are looking for, thus
making it less important with respect to the other features in the
euclidean distance. You could try selecting important features beforehand.
But they may be non-coordinate directions in your feature space as well.
Michael



On Tue, Jul 8, 2014 at 3:56 PM, Sheila the angel <[email protected]>
wrote:

> While using Nearest Neighbors Classification, I am getting higher
> cross-validation accuracy with raw data (without scaling) compare to scaled
> data (using preprocessing.scale) .
>
> Is this normal?
> When should one scale the data?
>
>
> Thanks
> --
> Sheila
>
>
> ------------------------------------------------------------------------------
> Open source business process management suite built on Java and Eclipse
> Turn processes into business applications with Bonita BPM Community Edition
> Quickly connect people, data, and systems into organized workflows
> Winner of BOSSIE, CODIE, OW2 and Gartner awards
> http://p.sf.net/sfu/Bonitasoft
> _______________________________________________
> Scikit-learn-general mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
>
>

------------------------------------------------------------------------------
Open source business process management suite built on Java and Eclipse
Turn processes into business applications with Bonita BPM Community Edition
Quickly connect people, data, and systems into organized workflows
Winner of BOSSIE, CODIE, OW2 and Gartner awards
http://p.sf.net/sfu/Bonitasoft

_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Re: [Scikit-learn-general] higher accuracy with non scaled data

Reply via email to