2013/3/22 Albert Kottke <albert.kot...@gmail.com>:
> My biggest question is forming the data into the X matrix (n_samples,
> n_features). The approach you describe would cluster based on thickness and
> velocity without consideration of the relationship between adjacent layers.
> Initially, I want to try to cluster based on change in Vs with depth. In
> doing so, it is important that layer sequence is considered. Eventually, I
> might want to consider other aspects, but I think this (Vs and depth) will
> give me a good understanding of what is possible.

I'm not sure what exactly you're trying to say, but in general, if you
want to cluster based on some feature, then you should add a column
with the feature's value to your X.

So, switching to a completely fictitious problem, if you'd have to
features i and j in your dataset as it's stored on disk, but you're
actually interested in log(x[i]) / cos(x[j])² (again, I'm making this
up), then just compute that quantity for all your samples and append
the result as a column to your dataset.

-- 
Lars Buitinck
Scientific programmer, ILPS
University of Amsterdam

------------------------------------------------------------------------------
Everyone hates slow websites. So do we.
Make your web apps faster with AppDynamics
Download AppDynamics Lite for free today:
http://p.sf.net/sfu/appdyn_d2d_mar
_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to