I think scripts which help users understand the usage of the various
algorithms will be helpful. For the 0.5 release, if some of the
algorithms have necessary scripts associated with them, it will make
it easy for people interested in contributing to run the tests and
look at the code. While testing the Bayes classifier that was one of
the issues I faced.

On Fri, Oct 8, 2010 at 8:40 PM, Ted Dunning <[email protected]> wrote:
> I will build a few SGD based classifier scripts.
>
> On Fri, Oct 8, 2010 at 12:29 PM, Drew Farris <[email protected]> wrote:
>
>> Perhaps it would be easy for the individuals doing tests for 0.4 to at
>> least take a transcript of the commands they're using so that they can
>> eventually be changed into these sorts of scripts.
>>
>> On Fri, Oct 8, 2010 at 3:25 PM, Robin Anil <[email protected]> wrote:
>> > +1 for integration script
>> >
>> > On Sat, Oct 9, 2010 at 12:52 AM, Drew Farris <[email protected]> wrote:
>> >
>> >> It sure would be really nice if we had more integration tests /
>> >> example scripts for the various algorithms like build-reuters.sh
>> >> script. These capture problems with the system in the way real users
>> >> are likely to first encounter it, and provide an easy way for new
>> >> users to understand the steps of using mahout externally to the wiki.
>> >> If we were really smart, we'd run them automatically from hudson as a
>> >> separate sanity check and then use something like gist to publish them
>> >> to confluence automatically so our examples would always be up to
>> >> date. But I get ahead of myself.
>> >>
>> >> Would something like the script attached to
>> >> https://issues.apache.org/jira/browse/MAHOUT-520, which adds a script
>> >> to run the bayes 20newsgroups example, be appropriate to commit at
>> >> this point?
>> >>
>> >> Drew
>> >>
>> >
>>
>

Reply via email to