2013/3/28 Mathieu Blondel <[email protected]>: > Encoding missing values with np.nan doesn't scale to very > high-dimensional problems with mostly missing values. > Personally, for encoding missing data, I just use sparse matrices. > Values which are actually zero can be stored explicitly in the .data > attribute.
+1 for not storing missing values, but will scipy.sparse matrices work correctly when .data has zeros, and will conversion between formats retain them? -- Lars Buitinck Scientific programmer, ILPS University of Amsterdam ------------------------------------------------------------------------------ Own the Future-Intel® Level Up Game Demo Contest 2013 Rise to greatness in Intel's independent game demo contest. Compete for recognition, cash, and the chance to get your game on Steam. $5K grand prize plus 10 genre and skill prizes. Submit your demo by 6/6/13. http://p.sf.net/sfu/intel_levelupd2d _______________________________________________ Scikit-learn-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
