i don't think there's a formal list published anywhere. There is an informal roadmap.
The contributions are, the way i see it, mainly can be in 3 areas: (1) project support issues like for example fixing shell compatibility with spark 1.3; (2) framework support problems like for example performance and integrating 3rd party hardware accelerated linalg libraries; (3) methodology work. We have some pending items for (1) and (2) i think but for methodology items (3) we simply can't compile the list of everything that can possibly be done and contriubted. We just don't have that much expertise, combined. No one has [1]. The way it works is usually people would come up with pieces that they were missing on their own for some reason; and they need to propose methodology, parallelization strategy, maybe even a code sketch -- that all will be fine. [1] http://matt.might.net/articles/phd-school-in-pictures/ On Sun, Jun 14, 2015 at 11:49 PM, Rohit Shinde <[email protected]> wrote: > But is there a list of projects that new people could take up? Even I am a > student interested in contributing to the machine learning and data mining > parts of Apache Mahout. > > I am familiar with Scala and Java, Python and C++. > > What can I contribute to? > > On Mon, Jun 15, 2015 at 10:24 AM, Dmitriy Lyubimov <[email protected]> > wrote: > > > Well we are predominantly Scala shop now. Being fluent in Scala seems > like > > one prerequisite. > > > > > > On Sat, Jun 13, 2015 at 1:17 AM, Sreenivas Raghavan < > > [email protected]> wrote: > > > > > Hello everyone, > > > I am interested in contributing to mahout project. I > am > > > interested in algorithms, machine learning and linear algebra. Please > > give > > > me some idea as where to start and how to start. I know python and some > > > parts of Java, so please tell me is this knowledge of languages enough > > for > > > writing and optimizing codes > > > -- > > > > > > *With Regards,* > > > *K.S.Sreenivasa Raghavan* > > > > > >
