Space: Apache Mahout (https://cwiki.apache.org/confluence/display/MAHOUT)
Page: k-means-commandline
(https://cwiki.apache.org/confluence/display/MAHOUT/k-means-commandline)
Comment:
https://cwiki.apache.org/confluence/display/MAHOUT/k-means-commandline?focusedCommentId=27841426#comment-27841426
Comment added by yexq:
---------------------------------------------------------------------
[hadoop@RH01 mahout-work]$ mahout kmeans -i
mahout-work/reuters-out-seqdir-sparse-kmeans/tfidf-vectors/ -c
mahout-work/reuters-kmeans-clusters -o mahout-work/reuters-kmeans -x 10 -k 20
-ow
Running on hadoop, using HADOOP_HOME=/mnt/userspace/hadoop-0.20.2
HADOOP_CONF_DIR=/mnt/userspace/hadoop-0.20.2/conf
12/03/19 15:56:04 INFO common.AbstractJob: Command line arguments:
{--clusters=mahout-work/reuters-kmeans-clusters, --convergenceDelta=0.5,
--distanceMeasure=org.apache.mahout.common.distance.SquaredEuclideanDistanceMeasure,
--endPhase=2147483647,
--input=mahout-work/reuters-out-seqdir-sparse-kmeans/tfidf-vectors/,
--maxIter=10, --method=mapreduce, --numClusters=20,
--output=mahout-work/reuters-kmeans, --overwrite=null, --startPhase=0,
--tempDir=temp}
12/03/19 15:56:04 INFO common.HadoopUtil: Deleting
mahout-work/reuters-kmeans-clusters
12/03/19 15:56:05 INFO util.NativeCodeLoader: Loaded the native-hadoop library
12/03/19 15:56:05 INFO zlib.ZlibFactory: Successfully loaded & initialized
native-zlib library
12/03/19 15:56:05 INFO compress.CodecPool: Got brand-new compressor
Exception in thread "main" java.lang.IndexOutOfBoundsException: Index: 0, Size: 0
at java.util.ArrayList.RangeCheck(ArrayList.java:547)
at java.util.ArrayList.get(ArrayList.java:322)
at
org.apache.mahout.clustering.kmeans.RandomSeedGenerator.buildRandom(RandomSeedGenerator.java:108)
at
org.apache.mahout.clustering.kmeans.KMeansDriver.run(KMeansDriver.java:101)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
at
org.apache.mahout.clustering.kmeans.KMeansDriver.main(KMeansDriver.java:58)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at
org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68)
at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139)
at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:187)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at org.apache.hadoop.util.RunJar.main(RunJar.java:156)
[hadoop@RH01 mahout-work]$
Change your notification preferences:
https://cwiki.apache.org/confluence/users/viewnotifications.action