Ensure that you increase the JVM memory settings when running the clusterdump program to avoid OOM.
________________________________ From: ipshita chatterji <[email protected]> To: [email protected] Sent: Wednesday, December 14, 2011 12:37 PM Subject: Re: Query on clusterdumper output and clusteredPoints For clusterdumper I had following commandline: $MAHOUT_HOME/bin/mahout clusterdump --seqFileDir output/clusters-6 --output clusteranalyze.txt Have written a separate program to read clusteredOutput directory as clusterdumper with "--pointsDir output/clusteredPoints " was giving OOM exception. Thanks On Wed, Dec 14, 2011 at 10:06 PM, Gary Snider <[email protected]> wrote: > What was on your command line? e.g. seqFileDir, pointsDir, etc > > On Wed, Dec 14, 2011 at 10:54 AM, ipshita chatterji > <[email protected]>wrote: > >> Hi, >> >> I am a newbie in Mahout and also have elementary knowledge of >> clustering. I managed to cluster my data using meanshift and then ran >> clusterdumper, I get following output: >> >> MSV-21{n=1 c=[1:0...........] >> >> So I asssume that the cluster above has converged and n=1 indicates >> that there is only one point associated with the cluster above. >> >> Now I try to read the members of this cluster from "clusteredPoints" >> directory. I see from the output that number of points belonging this >> cluster is 173. >> >> Why is this mismatch happening? Am I missing something here? >> >> Thanks, >> Ipshita >>
