Re: Query on clusterdumper output and clusteredPoints

Suneel Marthi Wed, 14 Dec 2011 11:06:52 -0800

Ensure that you increase the JVM memory settings when running the clusterdump 
program to avoid OOM.




________________________________
 From: ipshita chatterji <[email protected]>
To: [email protected] 
Sent: Wednesday, December 14, 2011 12:37 PM
Subject: Re: Query on clusterdumper output and clusteredPoints
 
For clusterdumper I had following commandline:

$MAHOUT_HOME/bin/mahout clusterdump --seqFileDir output/clusters-6
--output clusteranalyze.txt

Have written a separate program to read clusteredOutput directory as
clusterdumper with "--pointsDir output/clusteredPoints " was giving
OOM exception.

Thanks

On Wed, Dec 14, 2011 at 10:06 PM, Gary Snider <[email protected]> wrote:
> What was on your command line?  e.g. seqFileDir, pointsDir, etc
>
> On Wed, Dec 14, 2011 at 10:54 AM, ipshita chatterji 
> <[email protected]>wrote:
>
>> Hi,
>>
>> I am a newbie in Mahout and also have elementary knowledge of
>> clustering. I managed to cluster my data using meanshift and then ran
>> clusterdumper, I get following output:
>>
>> MSV-21{n=1 c=[1:0...........]
>>
>> So I asssume that the cluster above has converged and n=1 indicates
>> that there is only one point associated with the cluster above.
>>
>> Now I try to read the members of this cluster from "clusteredPoints"
>> directory. I see from the output that number of points belonging this
>> cluster is 173.
>>
>> Why is this mismatch happening? Am I missing something here?
>>
>> Thanks,
>> Ipshita
>>

Re: Query on clusterdumper output and clusteredPoints

Reply via email to