+1 in general +0 on this patch I find this sort of reference implementation to be most helpful in understanding the semantics of the algorithms. A lot of the notation in the defining papers is, well, challenging, and a nice clear implementation without the M/R parallelization to chop it up into small pieces is a useful learning tool. I would actually suggest such a reference implementation be a requirement. If this patch had a couple of unit tests I would be +1 on it too.
Jeff -----Original Message----- From: Grant Ingersoll [mailto:[EMAIL PROTECTED] Sent: Thursday, February 14, 2008 5:52 AM To: [email protected] Subject: Community development was Re: [jira] Updated: (MAHOUT-4) Simple prototype for Expectation Maximization (EM) Hi Ankur, Thanks for the contribution! This brings up an interesting community point. Would people rather we commit patches earlier, even though they aren't parallelized yet or completely ready to go? I know once the project is more mature, I wouldn't favor this, but I am wondering if it will help grease the wheels, so to speak, in the early stages if people can just go checkout the code and work on it, versus having to go find patches?
