Non-parallel versions of algorithms are very important.  In a real
data-mining situation, it is likely that some steps will be done using
serial computation and some using parallel computation.


On 2/14/08 5:51 AM, "Grant Ingersoll" <[EMAIL PROTECTED]> wrote:

> Hi Ankur,
> 
> Thanks for the contribution!
> 
> This brings up an interesting community point.  Would people rather we
> commit patches earlier, even though they aren't parallelized yet or
> completely ready to go?  I know once the project is more mature, I
> wouldn't favor this, but I am wondering if it will help grease the
> wheels, so to speak, in the early stages if people can just go
> checkout the code and work on it, versus having to go find patches?
> 
> Also, how do people want to handle creating examples?  For instance,
> it probably would be useful to have some simple examples using
> publicly available datasets for our algorithms?  I don't know that
> they belong in the core library (although, maybe the do) but
> definitely could be shipped as examples/tutorials/contrib.  For
> instance, I can see these kinds of things going along way to getting
> us into students hands who are learning ML, etc. which should also
> help build the community, etc.
> 
> -Grant
> 
> On Feb 13, 2008, at 7:55 AM, Ankur (JIRA) wrote:
> 
>> 
>>     [ 
>> https://issues.apache.org/jira/browse/MAHOUT-4?page=com.atlassian.jira.plugin
>> .system.issuetabpanels:all-tabpanel
>>  ]
>> 
>> Ankur updated MAHOUT-4:
>> -----------------------
>> 
>>    Attachment: PLSI_EM.patch
>> 
>> Here is the prototype implementation of of Probabilistic Latent
>> Semantic Indexing (PLSI) that uses Expectation Maximization.  Please
>> refer to javadoc comments for explanation.
>> 
>> Feel free to experiment with the code and have fun :-)
>> 
>>> Simple prototype for Expectation Maximization (EM)
>>> --------------------------------------------------
>>> 
>>>                Key: MAHOUT-4
>>>                URL: https://issues.apache.org/jira/browse/MAHOUT-4
>>>            Project: Mahout
>>>         Issue Type: New Feature
>>>           Reporter: Ankur
>>>        Attachments: PLSI_EM.patch
>>> 
>>> 
>>> Create a simple prototype implementing Expectation Maximization -
>>> EM that demonstrates the algorithm functionality given a set of
>>> (user, click-url) data.
>>> The prototype should be functionally complete and should serve as a
>>> basis for the Map-Reduce version of the EM algorithm.
>> 
>> -- 
>> This message is automatically generated by JIRA.
>> -
>> You can reply to this email to add a comment to the issue online.
>> 
> 
> 

Reply via email to