On Tue, Oct 2, 2012 at 7:48 AM, Dan Stowell <[email protected]> wrote: > > Hi all, > > I'm using the GMM class as part of a larger system, and something is > misbehaving. Can someone confirm please: the results of using GMM.fit() > shouldn't have a strong dependence on the data ranges, should they? For > example, if one variable has a range 0-1000, while the other has a range > 0-1, that difference shouldn't have much bearing?
This dependence is expected, and the variable with a range 0-1000 will dominate all others in your model unless you use a full covariance matrix, and even then you should expect some bias. In general it's good to mean-center and normalize everything before fitting a mixture model. -- - Alexandre ------------------------------------------------------------------------------ Don't let slow site performance ruin your business. Deploy New Relic APM Deploy New Relic app performance management and know exactly what is happening inside your Ruby, Python, PHP, Java, and .NET app Try New Relic at no cost today and get our sweet Data Nerd shirt too! http://p.sf.net/sfu/newrelic-dev2dev _______________________________________________ Scikit-learn-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
