On Sun, 29 Dec 2013 15:10:46 +0200
<[email protected]> wrote:

> Perfect please advise where to start contributing to avoid redundant
> work

I think if you concentrate on the clustering and classification docs
there should be little potential for conflict at the moment.

My general advise would be to check the web site for what is already
there - raise an issue for pages you find *not* helpful so they can be
deleted.

If you are just getting started it might make sense to first
concentrate on newbie, getting up and running tutorials, maybe using
some easy to access dataset.

For information on how to edit the web site see here:

http://mahout.apache.org/developers/how-to-update-the-website.html

As you get further into the code you might find pieces that are lacking
commentary, package level documentation or class level documentation.
Check out the code, add the docs and submit the results as patches.

For information on where to checkout the code from see here:
http://mahout.apache.org/developers/version-control.html

For information on how to create and submit changes as patches see here:

http://mahout.apache.org/developers/how-to-contribute.html


As you dig deeper as others have pointed out earlier you may find that
there are a couple of rules of thumb from a business or data science
perspective that to make life easier. I'd suggest to collect these on
the web site as well.

Also make sure to track your work and what you want to contribute to in
our issue tracker (standard JIRA). For more information on what makes a
great issue see here:

http://mahout.apache.org/developers/issue-tracker.html


Looking forward to your contributions,
Isabel

Reply via email to