On Tuesday 17 February 2009, Grant Ingersoll wrote:
> For ideas on what we need, see:  http://cwiki.apache.org/MAHOUT.  To
> name a few:  SVM, categorization algs, large scale graph ranking
> tools, maximum entropy implementation, collaborative filtering
> improvements (Sean?)

To name a few more: Algorithms for learning from sequential data (e.g. 
identifying named entities in an incoming stream of text), algorithms for 
learning rankings of items are also interesting.

If you plan to use Mahout as your platform of one of the various data mining-, 
machine learning- or information retrieval challenges feel free to submit 
your plan as GSoC proposal.


> !!!!!!!!
> For applicants, some things to keep in mind:
>
> It's very important applicants demonstrate they are capable of working
> and discussing ideas on the mahout-dev list during the application
> phase.

A definite +1 from me. Discussing your idea before submitting the proposal 
also helps you to get an idea of what exactly is needed, what is important to 
keep in mind and to get a better proposal. So don't be afraid to post your 
idea and refine it together with us.


> If you really think you could do more than one, instead propose items
> that are "time permitting" and that build on what you have completed.
> Demos and documentation are always good in this regard.

+1 It is not sufficient to submit a straight forward algorithm implementation. 
Keep in mind that in order to remain maintainable you need to provide unit- 
and integration tests for your work. In addition you need to provide examples 
and demos so others can see how to use your work. Finally thorough 
documentation of the algorithm itself, the implementation, its advantages and 
limits is needed to evaluate it for commercial projects.

Isabel


-- 
The abuse of greatness is when it disjoins remorse from power.          -- 
William 
Shakespeare, "Julius Caesar"
  |\      _,,,---,,_       Web:   <http://www.isabel-drost.de>
  /,`.-'`'    -.  ;-;;,_
 |,4-  ) )-,_..;\ (  `'-'
'---''(_/--'  `-'\_) (fL)  IM:  <xmpp://[email protected]>

Attachment: signature.asc
Description: This is a digitally signed message part.

Reply via email to