[ 
https://issues.apache.org/jira/browse/MAHOUT-1231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13672796#comment-13672796
 ] 

Summer Lee edited comment on MAHOUT-1231 at 6/3/13 4:00 AM:
------------------------------------------------------------

Hi, Angel.
Are you sure that seqdirectory needs a directory of text files for making 
clusters before kmeans job?
I thought kmeans job needs one "part-r-00000" file in "tfidf-vectors" directory 
created from one seqdirectory input file.
So I tried seqdirectory job with two input files, the result "chunk-0" file is 
broken.

and I found some clue that "No input clusters found in" error is occured from 
seq2sparse job. 
I'm debugging in way of printing "System.out.prinln("~~")" ,building 
mahout-example-0.7 or 0.8-job.jar on eclipse and testing the job. 
Is it right? It is hard to find the point of error.

Is everybody doing well kmeans job in hadoop with oozie system?
                
      was (Author: marrystar):
    Hi, Angel.
Are you sure that seqdirectory needs a directory of text files for making 
clusters before kmeans job?
I thought kmeans job needs one "part-r-00000" file in "tfidf-vectors" directory 
created from one seqdirectory input file.
So I tried seqdirectory job with two input files, the result "chunk-0" file is 
broken.

and I found some clue that "No input clusters found in" error is occured from 
seq2sparse job. 
I'm debugging in way of printing "System.out.prinln("~~")" ,building 
mahout-example-0.7 or 0.8-job.jar on eclipse and testing the job. 
Is it right? It is hard to find the point of error.

Is Everybody doing well kmeans job in hadoop with oozie system?
                  
> "No input clusters found in " error in kmeans
> ---------------------------------------------
>
>                 Key: MAHOUT-1231
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1231
>             Project: Mahout
>          Issue Type: Question
>          Components: Clustering
>            Reporter: Summer Lee
>             Fix For: Backlog
>
>
> 1.seqdirectory
> > mahout seqdirectory --input /user/hdfs/input/new1.csv --output
> > /user/hdfs/new1/seqdirectory --tempDir
> > /user/hdfs/new1/seqdirectory/tempDir
> 2.seq2sparse 
> > mahout seq2sparse --input /user/hdfs/new1/seqdirectory --output
> > /user/hdfs/new1/seq2sparse -wt tfidf
> 3.kmeans 
> > mahout kmeans --input /user/hdfs/new1/seq2sparse/tfidf-vectors
> > --output /user/hdfs/new1/kmeans -c /user/hdfs/new1/clusters/kmeans -x 3 -k 
> > 3 --tempDir /user/hdfs/new1/kmeans/tempDir
> and then error is occured
> Failing Oozie Launcher, Main class [org.apache.mahout.driver.MahoutDriver], 
> main() threw exception, No input clusters found in 
> /user/oozie/mahout/z3/kmeansCopy/clusters/part-randomSeed. Check your -c 
> argument.
> java.lang.IllegalStateException: No input clusters found in 
> /user/oozie/mahout/z3/kmeansCopy/clusters/part-randomSeed. Check your -c 
> argument.
>       at 
> org.apache.mahout.clustering.kmeans.KMeansDriver.buildClusters(KMeansDriver.java:217)
>       at 
> org.apache.mahout.clustering.kmeans.KMeansDriver.run(KMeansDriver.java:148)
>       at 
> org.apache.mahout.clustering.kmeans.KMeansDriver.run(KMeansDriver.java:107)
>       at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
>       at 
> org.apache.mahout.clustering.kmeans.KMeansDriver.main(KMeansDriver.java:48)
>       at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>       at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>       at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>       at java.lang.reflect.Method.invoke(Method.java:597)
>       at 
> org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68)
>       at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139)
>       at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:195)
>       at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>       at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>       at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>       at java.lang.reflect.Method.invoke(Method.java:597)
>       at 
> org.apache.oozie.action.hadoop.LauncherMapper.map(LauncherMapper.java:467)
>       at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:50)
>       at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:436)
>       at org.apache.hadoop.mapred.MapTask.run(MapTask.java:372)
>       at org.apache.hadoop.mapred.Child$4.run(Child.java:255)
>       at java.security.AccessController.doPrivileged(Native Method)
>       at javax.security.auth.Subject.doAs(Subject.java:396)
>       at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1149)
>       at org.apache.hadoop.mapred.Child.main(Child.java:249)
> Oozie Launcher failed, finishing Hadoop job gracefully
> Oozie Launcher ends
> =======================================================================================================
> Why kmeans driver can't make clusters in Hadoop with oozie system?
> In hadoop with not oozie system, it worked.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to