[ 
https://issues.apache.org/jira/browse/MAHOUT-520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12920560#action_12920560
 ] 

Jeff Eastman commented on MAHOUT-520:
-------------------------------------

The synthetic control examples use default parameter values when run without 
arguments but users can also supply a full set of arguments is they wish. The 
following k-means arguments are supported, along with their default values:

-i testdata 
-o output 
-dm org.apache.mahout.common.distance.EuclideanDistanceMeasure
-t1 80
-t2 55
-x 10 
-ow

Note that, for k-means, the number of clusters (-k) is computed by Canopy using 
the supplied -dm,  -t1 and -t2 arguments so -k is not an argument.

> Add example scripts / integration tests for various algorithms.
> ---------------------------------------------------------------
>
>                 Key: MAHOUT-520
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-520
>             Project: Mahout
>          Issue Type: Improvement
>          Components: Classification
>    Affects Versions: 0.4
>            Reporter: Drew Farris
>            Assignee: Drew Farris
>            Priority: Minor
>         Attachments: MAHOUT-520-syntheticcontrol.patch, MAHOUT-520.patch, 
> MAHOUT-520.patch
>
>
> Scripts like build-reuters.sh are useful in that they both demonstrate 
> typical usage of Mahout from the command-line but also serve as integration 
> tests. We should add additional scripts that drive the algorithms so new 
> users can quickly run the examples. 
> Perhaps these can also be run from hudson as a part of the nightly builds and 
> can serve as integration tests.
> As a start towards this goal, provide build-20news-bayes.sh example (in the 
> same vein as build-reuters.sh, that follows 
> https://cwiki.apache.org/MAHOUT/twenty-newsgroups.html

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to