Cant reproduce on CentOS with hadoop 2.2.0
Running on hadoop, using
/home/andy/apache_builds/hadoop_bin/hadoop-2.2.0-src/hadoop-dist/target/hadoop-2.2.0/bin/hadoop
and HADOOP_CONF_DIR=
MAHOUT-JOB:
/home/andy/mahout_test/sandbox/mahout-trunk/examples/target/mahout-examples-1.0-SNAPSHOT-job.jar
14/01/24 14:09:32 INFO common.AbstractJob: Command line arguments:
{--clustering=null,
--clusters=[/tmp/mahout-work-andy/reuters-kmeans-clusters],
--convergenceDelta=[0.5],
--distanceMeasure=[org.apache.mahout.common.distance.CosineDistanceMeasure],
--endPhase=[2147483647],
--input=[/tmp/mahout-work-andy/reuters-out-seqdir-sparse-kmeans/tfidf-vectors/],
--maxIter=[10], --method=[mapreduce], --numClusters=[20],
--output=[/tmp/mahout-work-andy/reuters-kmeans], --overwrite=null,
--startPhase=[0], --tempDir=[temp]}
Attached the full log that helps any.
> Date: Fri, 24 Jan 2014 10:55:14 -0800
> Subject: Re: cluster-reuters.sh broken in trunk
> From: [email protected]
> To: [email protected]
>
> Actually, getting the same error with a fresh svn checkout:
>
> 14/01/24 09:42:13 INFO driver.MahoutDriver: Program took 291353 ms
> (Minutes: 4.855883333333334)
> Running on hadoop, using /home/akm/hadoop-0.20.205.0/bin/hadoop and
> HADOOP_CONF_DIR=
> MAHOUT-JOB:
> /home/akm/mahout/examples/target/mahout-examples-1.0-SNAPSHOT-job.jar
> 14/01/24 09:42:16 INFO common.AbstractJob: Command line arguments:
> {--clustering=null,
> --clusters=[/tmp/mahout-work-akm/reuters-kmeans-clusters],
> --convergenceDelta=[0.5],
> --distanceMeasure=[org.apache.mahout.common.distance.CosineDistanceMeasure],
> --endPhase=[2147483647],
> --input=[/tmp/mahout-work-akm/reuters-out-seqdir-sparse-kmeans/tfidf-vectors/],
> --maxIter=[10], --method=[mapreduce], --numClusters=[20],
> --output=[/tmp/mahout-work-akm/reuters-kmeans], --overwrite=null,
> --startPhase=[0], --tempDir=[temp]}
> 14/01/24 09:42:17 INFO common.HadoopUtil: Deleting
> /tmp/mahout-work-akm/reuters-kmeans-clusters
> 14/01/24 09:42:17 WARN util.NativeCodeLoader: Unable to load native-hadoop
> library for your platform... using builtin-java classes where applicable
> 14/01/24 09:42:17 INFO compress.CodecPool: Got brand-new compressor
> 14/01/24 09:42:17 INFO kmeans.RandomSeedGenerator: Wrote 20 Klusters to
> /tmp/mahout-work-akm/reuters-kmeans-clusters/part-randomSeed
> 14/01/24 09:42:17 INFO kmeans.KMeansDriver: Input:
> /tmp/mahout-work-akm/reuters-out-seqdir-sparse-kmeans/tfidf-vectors
> Clusters In: /tmp/mahout-work-akm/reuters-kmeans-clusters/part-randomSeed
> Out: /tmp/mahout-work-akm/reuters-kmeans Distance:
> org.apache.mahout.common.distance.CosineDistanceMeasure
> 14/01/24 09:42:17 INFO kmeans.KMeansDriver: convergence: 0.5 max
> Iterations: 10
> 14/01/24 09:42:17 INFO compress.CodecPool: Got brand-new decompressor
> Exception in thread "main" java.lang.IllegalStateException: No input
> clusters found in
> /tmp/mahout-work-akm/reuters-kmeans-clusters/part-randomSeed. Check your -c
> argument.
> at
> org.apache.mahout.clustering.kmeans.KMeansDriver.buildClusters(KMeansDriver.java:212)
> at
> org.apache.mahout.clustering.kmeans.KMeansDriver.run(KMeansDriver.java:143)
> at
> org.apache.mahout.clustering.kmeans.KMeansDriver.run(KMeansDriver.java:103)
> at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
> at
> org.apache.mahout.clustering.kmeans.KMeansDriver.main(KMeansDriver.java:47)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
> at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:601)
> at
> org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68)
> at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139)
> at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:195)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
> at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:601)
> at org.apache.hadoop.util.RunJar.main(RunJar.java:156)
>
>
>
> On Fri, Jan 24, 2014 at 10:07 AM, Andrew Musselman <
> [email protected]> wrote:
>
> > Yeah, disregard, my repo was out of whack.
> >
> >
> > On Fri, Jan 24, 2014 at 10:00 AM, ap.dev <[email protected]> wrote:
> >
> >> I'm not getting any exceptions there.
> >>
> >> -------- Original message --------
> >> From: Andrew Musselman <[email protected]>
> >> Date:01/24/2014 11:38 AM (GMT-05:00)
> >> To: [email protected]
> >> Subject: cluster-reuters.sh broken in trunk
> >>
> >> Last night I had this issue when testing out cluster-reuters.sh with no
> >> flags; anyone seen this recently?
> >>
> >> 14/01/23 22:03:54 INFO driver.MahoutDriver: Program took 286799 ms
> >> (Minutes: 4.779983333333333)
> >> Running on hadoop, using /home/akm/hadoop-0.20.205.0/bin/hadoop and
> >> HADOOP_CONF_DIR=
> >> MAHOUT-JOB:
> >> /home/akm/mahout/examples/target/mahout-examples-1.0-SNAPSHOT-job.jar
> >> 14/01/23 22:03:57 INFO common.AbstractJob: Command line arguments:
> >> {--clustering=null,
> >> --clusters=[/tmp/mahout-work-akm/reuters-kmeans-clusters],
> >> --convergenceDelta=[0.5],
> >>
> >> --distanceMeasure=[org.apache.mahout.common.distance.CosineDistanceMeasure],
> >> --endPhase=[2147483647],
> >>
> >> --input=[/tmp/mahout-work-akm/reuters-out-seqdir-sparse-kmeans/tfidf-vectors/],
> >> --maxIter=[10], --method=[mapreduce], --numClusters=[20],
> >> --output=[/tmp/mahout-work-akm/reuters-kmeans], --overwrite=null,
> >> --startPhase=[0], --tempDir=[temp]}
> >> 14/01/23 22:03:57 WARN util.NativeCodeLoader: Unable to load native-hadoop
> >> library for your platform... using builtin-java classes where applicable
> >> 14/01/23 22:03:57 INFO compress.CodecPool: Got brand-new compressor
> >> 14/01/23 22:03:57 INFO kmeans.RandomSeedGenerator: Wrote 20 Klusters to
> >> /tmp/mahout-work-akm/reuters-kmeans-clusters/part-randomSeed
> >> 14/01/23 22:03:57 INFO kmeans.KMeansDriver: Input:
> >> /tmp/mahout-work-akm/reuters-out-seqdir-sparse-kmeans/tfidf-vectors
> >> Clusters In: /tmp/mahout-work-akm/reuters-kmeans-clusters/part-randomSeed
> >> Out: /tmp/mahout-work-akm/reuters-kmeans Distance:
> >> org.apache.mahout.common.distance.CosineDistanceMeasure
> >> 14/01/23 22:03:57 INFO kmeans.KMeansDriver: convergence: 0.5 max
> >> Iterations: 10
> >> 14/01/23 22:03:57 INFO compress.CodecPool: Got brand-new decompressor
> >> Exception in thread "main" java.lang.IllegalStateException: No input
> >> clusters found in
> >> /tmp/mahout-work-akm/reuters-kmeans-clusters/part-randomSeed. Check your
> >> -c
> >> argument.
> >> at
> >>
> >> org.apache.mahout.clustering.kmeans.KMeansDriver.buildClusters(KMeansDriver.java:212)
> >> at
> >>
> >> org.apache.mahout.clustering.kmeans.KMeansDriver.run(KMeansDriver.java:143)
> >> at
> >>
> >> org.apache.mahout.clustering.kmeans.KMeansDriver.run(KMeansDriver.java:103)
> >> at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
> >> at
> >>
> >> org.apache.mahout.clustering.kmeans.KMeansDriver.main(KMeansDriver.java:47)
> >> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> >> at
> >>
> >> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
> >> at
> >>
> >> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> >> at java.lang.reflect.Method.invoke(Method.java:601)
> >> at
> >>
> >> org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68)
> >> at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139)
> >> at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:195)
> >> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> >> at
> >>
> >> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
> >> at
> >>
> >> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> >> at java.lang.reflect.Method.invoke(Method.java:601)
> >> at org.apache.hadoop.util.RunJar.main(RunJar.java:156)
> >> $ hls /tmp/mahout-work-akm/reuters-kmeans-clusters/part-randomSeed
> >> Found 1 items
> >> -rw-r--r-- 1 akm supergroup 149 2014-01-23 22:03
> >> /tmp/mahout-work-akm/reuters-kmeans-clusters/part-randomSeed
> >>
> >>
> >> $ hcat /tmp/mahout-work-akm/reuters-kmeans-clusters/part-randomSeed
> >> SEQ
> >>
> >> org.apache.hadoop.io.Text5org.apache.mahout.clustering.iterator.ClusterWritable
> >> *org.apache.hadoop.io.compress.DefaultCodec�����M5�0ü��� $
> >>
> >
> >