Thanks Pat,
But how exactly can I run clusterdump using the -evaluate (-e) parameter?
When i try to run it for example:
./mahout clusterdump -i /user/Data-output/clusters-1-final -o analyze.txt
--evaluate
I get a Java null pointer Exception
14/05/19 15:02:03 INFO common.AbstractJob: Command line arguments:
{--dictionaryType=[text],
--distanceMeasure=[org.apache.mahout.common.distance.SquaredEuclideanDistanceMeasure],
--endPhase=[2147483647], --evaluate=null,
--input=[/user/Data-output/clusters-1-final], --output=[analyze.txt],
--outputFormat=[TEXT], --startPhase=[0], --tempDir=[temp]}
Exception in thread "main" java.lang.NullPointerException
Do I have to put a parameter to evaluate? As input for clusterdump I am using
the output with the clusters after running mahout K-Means.
> Subject: Re: Mahout K-Means - Quality of the clusters
> From: [email protected]
> Date: Sat, 17 May 2014 09:43:59 -0700
> To: [email protected]
>
> mahout clusterdump —evaluate …
>
> provides some stats
>
> On May 15, 2014, at 10:23 PM, hiroshi leon <[email protected]> wrote:
>
> Hello everybody,
>
> Do you know how can I get the MSE of the clusters in mahout K-Means?
> I would like to check the quality of the clusters. Thanks!
>
>
>