Non-parallel versions of algorithms are very important. In a real data-mining situation, it is likely that some steps will be done using serial computation and some using parallel computation.
On 2/14/08 5:51 AM, "Grant Ingersoll" <[EMAIL PROTECTED]> wrote: > Hi Ankur, > > Thanks for the contribution! > > This brings up an interesting community point. Would people rather we > commit patches earlier, even though they aren't parallelized yet or > completely ready to go? I know once the project is more mature, I > wouldn't favor this, but I am wondering if it will help grease the > wheels, so to speak, in the early stages if people can just go > checkout the code and work on it, versus having to go find patches? > > Also, how do people want to handle creating examples? For instance, > it probably would be useful to have some simple examples using > publicly available datasets for our algorithms? I don't know that > they belong in the core library (although, maybe the do) but > definitely could be shipped as examples/tutorials/contrib. For > instance, I can see these kinds of things going along way to getting > us into students hands who are learning ML, etc. which should also > help build the community, etc. > > -Grant > > On Feb 13, 2008, at 7:55 AM, Ankur (JIRA) wrote: > >> >> [ >> https://issues.apache.org/jira/browse/MAHOUT-4?page=com.atlassian.jira.plugin >> .system.issuetabpanels:all-tabpanel >> ] >> >> Ankur updated MAHOUT-4: >> ----------------------- >> >> Attachment: PLSI_EM.patch >> >> Here is the prototype implementation of of Probabilistic Latent >> Semantic Indexing (PLSI) that uses Expectation Maximization. Please >> refer to javadoc comments for explanation. >> >> Feel free to experiment with the code and have fun :-) >> >>> Simple prototype for Expectation Maximization (EM) >>> -------------------------------------------------- >>> >>> Key: MAHOUT-4 >>> URL: https://issues.apache.org/jira/browse/MAHOUT-4 >>> Project: Mahout >>> Issue Type: New Feature >>> Reporter: Ankur >>> Attachments: PLSI_EM.patch >>> >>> >>> Create a simple prototype implementing Expectation Maximization - >>> EM that demonstrates the algorithm functionality given a set of >>> (user, click-url) data. >>> The prototype should be functionally complete and should serve as a >>> basis for the Map-Reduce version of the EM algorithm. >> >> -- >> This message is automatically generated by JIRA. >> - >> You can reply to this email to add a comment to the issue online. >> > >
