ClusteringYourData (MAHOUT) edited by Grant Ingersoll
      Page: http://cwiki.apache.org/confluence/display/MAHOUT/ClusteringYourData
   Changes: 
http://cwiki.apache.org/confluence/pages/diffpagesbyversion.action?pageId=120583&originalVersion=1&revisedVersion=2






Content:
---------------------------------------------------------------------

+*Mahout_0.2*+

After you've done the [QuickStart] and are familiar with the basics of Mahout, 
it is time to cluster your own data. 

The following pieces *may* be useful for in getting started:

h1. Input

For starters, you will need your data in an appropriate Vector format (which 
has changed since Mahout 0.1)

h2. Text Preparation

* See [Creating Vectors from Text] 
* 
http://www.lucidimagination.com/search/document/4a0e528982b2dac3/document_clustering

h1. Running the Process

+*TODO*+ FILL ME IN
h2. Canopy

h2. kMeans

h2. Dirichlet

h2. Mean-shift

h1. Validating the Output


* See 
http://www.lucidimagination.com/search/document/dab8c1f3c3addcfe/validating_clustering_output

h1. References

* [Mahout archive 
references|http://www.lucidimagination.com/search/p:mahout?q=clustering]

---------------------------------------------------------------------
CONFLUENCE INFORMATION
This message is automatically generated by Confluence

Unsubscribe or edit your notifications preferences
   http://cwiki.apache.org/confluence/users/viewnotifications.action

If you think it was sent incorrectly contact one of the administrators
   http://cwiki.apache.org/confluence/administrators.action

If you want more information on Confluence, or have a bug to report see
   http://www.atlassian.com/software/confluence


Reply via email to