That totally depends on your data. Here it looks like you are scaling down
a feature that captures a lot of the variation you are looking for, thus
making it less important with respect to the other features in the
euclidean distance. You could try selecting important features beforehand.
But they may be non-coordinate directions in your feature space as well.
Michael
On Tue, Jul 8, 2014 at 3:56 PM, Sheila the angel <from.d.pu...@gmail.com>
wrote:
> While using Nearest Neighbors Classification, I am getting higher
> cross-validation accuracy with raw data (without scaling) compare to scaled
> data (using preprocessing.scale) .
>
> Is this normal?
> When should one scale the data?
>
>
> Thanks
> --
> Sheila
>
>
> ------------------------------------------------------------------------------
> Open source business process management suite built on Java and Eclipse
> Turn processes into business applications with Bonita BPM Community Edition
> Quickly connect people, data, and systems into organized workflows
> Winner of BOSSIE, CODIE, OW2 and Gartner awards
> http://p.sf.net/sfu/Bonitasoft
> _______________________________________________
> Scikit-learn-general mailing list
> Scikit-learn-general@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
>
>
------------------------------------------------------------------------------
Open source business process management suite built on Java and Eclipse
Turn processes into business applications with Bonita BPM Community Edition
Quickly connect people, data, and systems into organized workflows
Winner of BOSSIE, CODIE, OW2 and Gartner awards
http://p.sf.net/sfu/Bonitasoft
_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general