The input should be a sequence file. Maybe that's the error.
On 01-08-2012 22:30, Kate Ericson wrote:
Hi, From the error message, it's tripping over "1K-blocks" when it's expecting a long. Is that somewhere in your input file ("F:/docsite/CIIndex/index/out.txt")? Or perhaps part of your hadoop configuration files? From the stack trace, it doesn't look like it got into mahout or kmeans, but instead crashed when setting up the mapreduce job in hadoop. -Kate On Wed, Aug 1, 2012 at 9:05 AM, kiran kumar <[email protected]> wrote:Hello, I am trying to run kmeans algorithm with these parameters *$ $MAHOUT_HOME/bin/mahout -core kmeans -i "F:/docsite/CIIndex/index/out.txt" -o $MAHOUT_HOME/docsite-kmeans-cosine-random-clusters -dm org.apache.mahout.co* *mmon.distance.CosineDistanceMeasure -cd 0.0000001 --numClusters 40 -x 40 --clusters $MAHOUT_HOME/docs-random-cosine-clusters -cl* * * I got this error and i didn't find any results from google. can you please help me in fixing this. *java.lang.NumberFormatException: For input string: "1K-blocks"* at java.lang.NumberFormatException.forInputString(NumberFormatException.java:48) at java.lang.Long.parseLong(Long.java:419) at java.lang.Long.parseLong(Long.java:468) at org.apache.hadoop.fs.DF.parseExecResult(DF.java:122) at org.apache.hadoop.util.Shell.runCommand(Shell.java:179) at org.apache.hadoop.util.Shell.run(Shell.java:134) at org.apache.hadoop.fs.DF.getAvailable(DF.java:73) at org.apache.hadoop.fs.LocalDirAllocator$AllocatorPerContext.getLocalPathForWrite(LocalDirAllocator.java:329) at org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:124) at org.apache.hadoop.mapred.MapOutputFile.getSpillFileForWrite(MapOutputFile.java:107) at org.apache.hadoop.mapred.MapTask$MapOutputBuffer.sortAndSpill(MapTask.java:1221) at org.apache.hadoop.mapred.MapTask$MapOutputBuffer.flush(MapTask.java:1129) at org.apache.hadoop.mapred.MapTask$NewOutputCollector.close(MapTask.java:549) at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:623) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:305) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:177) 12/08/01 20:29:39 INFO mapred.JobClient: Job complete: job_local_0001 12/08/01 20:29:39 INFO mapred.JobClient: Counters: 0 Exception in thread "main" java.lang.InterruptedException: K-Means Iteration failed processing F:/mahout/docs-random-cosine-clusters/part-randomSeed at org.apache.mahout.clustering.kmeans.KMeansDriver.runIteration(KMeansDriver.java:371) at org.apache.mahout.clustering.kmeans.KMeansDriver.buildClustersMR(KMeansDriver.java:316) at org.apache.mahout.clustering.kmeans.KMeansDriver.buildClusters(KMeansDriver.java:239) at org.apache.mahout.clustering.kmeans.KMeansDriver.run(KMeansDriver.java:154) at org.apache.mahout.clustering.kmeans.KMeansDriver.run(KMeansDriver.java:112) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) at org.apache.mahout.clustering.kmeans.KMeansDriver.main(KMeansDriver.java:61) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) at java.lang.reflect.Method.invoke(Method.java:597) at org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68) at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139) at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:188) -- Thanks & Regards, Kiran Kumar
