2013/3/28 Mathieu Blondel <[email protected]>:
> Encoding missing values with np.nan doesn't scale to very
> high-dimensional problems with mostly missing values.
> Personally, for encoding missing data, I just use sparse matrices.
> Values which are actually zero can be stored explicitly in the .data
> attribute.

+1 for not storing missing values, but will scipy.sparse matrices work
correctly when .data has zeros, and will conversion between formats
retain them?

-- 
Lars Buitinck
Scientific programmer, ILPS
University of Amsterdam

------------------------------------------------------------------------------
Own the Future-Intel&reg; Level Up Game Demo Contest 2013
Rise to greatness in Intel's independent game demo contest.
Compete for recognition, cash, and the chance to get your game 
on Steam. $5K grand prize plus 10 genre and skill prizes. 
Submit your demo by 6/6/13. http://p.sf.net/sfu/intel_levelupd2d
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to