Gael Varoquaux <gael.varoqu...@normalesup.org>
wrote:

> One problem that we have here, is that I don't believe that any of the
> current core developers are experts of GMMs. Thus unless you have
> good experience with GMMs, or find yourself a mentor who has, I don't
> believe that this is a good GSOC project to follow.

Not a core developer though, nor an expert, but I have coded a fair amount
of GMMs for the purpose of automated spike sorting. I thought about
submitting CEM, CEM2 and Gaussian hierarchical clustering to scikit-learn,
but I have never really had the time. In my experience, CEM2 to optimize
the MML score is the most reliable and numerically stabile method for
fitting GMMs (cf. Figuereido & Jain). The most common formulation of the EM
algorithm for fitting GMMs with ML is particularly unstabile, as it tends
to converge towards singularities and then blow up. CEM2 is rather robust
against that. I also have code to generate random samples from GMMs (which
I needed for some strange purpose I'm not even sure I remember. I think i
wanted to Monte Carlo integrate to find the KL divergence between two of
them, or something...) I'm also working on a new way to fit GMMs that I
think might find the globally best solution, not just a local optima, but I
still have to prove it rigourously. The paper has been sitting half written
on my desk for ages... :-)

And while we are talking about mixture modelling: Would nnclean (cf. Byers
& Raftery) be of any interest to scikit-learn? I have an extemely fast
kd-tree based implementation of it. 

Sturla


------------------------------------------------------------------------------
Managing the Performance of Cloud-Based Applications
Take advantage of what the Cloud has to offer - Avoid Common Pitfalls.
Read the Whitepaper.
http://pubads.g.doubleclick.net/gampad/clk?id=121051231&iu=/4140/ostg.clktrk
_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to