2013/3/22 Albert Kottke <albert.kot...@gmail.com>: > My biggest question is forming the data into the X matrix (n_samples, > n_features). The approach you describe would cluster based on thickness and > velocity without consideration of the relationship between adjacent layers. > Initially, I want to try to cluster based on change in Vs with depth. In > doing so, it is important that layer sequence is considered. Eventually, I > might want to consider other aspects, but I think this (Vs and depth) will > give me a good understanding of what is possible.
I'm not sure what exactly you're trying to say, but in general, if you want to cluster based on some feature, then you should add a column with the feature's value to your X. So, switching to a completely fictitious problem, if you'd have to features i and j in your dataset as it's stored on disk, but you're actually interested in log(x[i]) / cos(x[j])² (again, I'm making this up), then just compute that quantity for all your samples and append the result as a column to your dataset. -- Lars Buitinck Scientific programmer, ILPS University of Amsterdam ------------------------------------------------------------------------------ Everyone hates slow websites. So do we. Make your web apps faster with AppDynamics Download AppDynamics Lite for free today: http://p.sf.net/sfu/appdyn_d2d_mar _______________________________________________ Scikit-learn-general mailing list Scikit-learn-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/scikit-learn-general