Error running RecommenderJob using mahout-core-0.5-cdh3u4-job.jar

tmefrt Wed, 05 Sep 2012 23:52:53 -0700

Hi All

I'm trying to test the item recommendation. using the command



hadoop jar /usr/lib/mahout/mahout-core-0.5-cdh3u4-job.jar
org.apache.mahout.cf.taste.hadoop.item.RecommenderJob
-Dmapred.input.dir=/user/etl_user/itemrecco/in_file.txt
-Dmapred.output.dir=/user/etl_user/itemreccooutput

Input file

 cat in_file.txt
1,101,5.0
1,102,3.0
1,103,2.5
2,101,2.0
2,102,2.5
2,103,5.0
2,104,2.0
3,101,2.5
3,104,4.0
3,105,4.5
3,107,5.0
4,101,5.0
4,103,3.0
4,104,4.5
4,106,4.0
5,101,4.0
5,102,3.0
5,103,2.0
5,104,4.0
5,105,3.5
5,106,4.0


I'm getting below error from the log


12/09/06 01:28:28 INFO mapred.JobClient:  
org.apache.mahout.cf.taste.hadoop.MaybePruneRowsMapper$Elements
12/09/06 01:28:28 INFO mapred.JobClient:     NEGLECTED=0
12/09/06 01:28:28 INFO mapred.JobClient:     USED=21
12/09/06 01:28:28 INFO mapred.JobClient:   Job Counters
12/09/06 01:28:28 INFO mapred.JobClient:     Launched reduce tasks=72
12/09/06 01:28:28 INFO mapred.JobClient:     SLOTS_MILLIS_MAPS=142128
12/09/06 01:28:28 INFO mapred.JobClient:     Total time spent by all reduces
waiting after reserving slots (ms)=0
12/09/06 01:28:28 INFO mapred.JobClient:     Total time spent by all maps
waiting after reserving slots (ms)=0
12/09/06 01:28:28 INFO mapred.JobClient:     Launched map tasks=72
12/09/06 01:28:28 INFO mapred.JobClient:     Data-local map tasks=72
12/09/06 01:28:28 INFO mapred.JobClient:     SLOTS_MILLIS_REDUCES=806019
12/09/06 01:28:28 INFO mapred.JobClient:   FileSystemCounters
12/09/06 01:28:28 INFO mapred.JobClient:     FILE_BYTES_READ=1755source
12/09/06 01:28:28 INFO mapred.JobClient:     HDFS_BYTES_READ=18905
12/09/06 01:28:28 INFO mapred.JobClient:     FILE_BYTES_WRITTEN=199593
12/09/06 01:28:28 INFO mapred.JobClient:     HDFS_BYTES_WRITTEN=7222
12/09/06 01:28:28 INFO mapred.JobClient:   Map-Reduce Framework
12/09/06 01:28:28 INFO mapred.JobClient:     Reduce input groups=7
12/09/06 01:28:28 INFO mapred.JobClient:     Combine output records=0
12/09/06 01:28:28 INFO mapred.JobClient:     Map input records=5
12/09/06 01:28:28 INFO mapred.JobClient:     Reduce shuffle bytes=71922
12/09/06 01:28:28 INFO mapred.JobClient:     Reduce output records=7
12/09/06 01:28:28 INFO mapred.JobClient:     Spilled Records=42
12/09/06 01:28:28 INFO mapred.JobClient:     Map output bytes=420
12/09/06 01:28:28 INFO mapred.JobClient:     Combine input records=0
12/09/06 01:28:28 INFO mapred.JobClient:     Map output records=21
12/09/06 01:28:28 INFO mapred.JobClient:     SPLIT_RAW_BYTES=11304
12/09/06 01:28:28 INFO mapred.JobClient:     Reduce input records=21
12/09/06 01:28:28 ERROR common.AbstractJob: Unexpected 101 while processing
Job-Specific Options:
usage: <command> [Generic Options] [Job-Specific Options]
Generic Options:
 -archives <paths>              comma separated archives to be unarchived
                                on the compute machines.
 -conf <configuration file>     specify an application configuration file
 -D <property=value>            use value for given property
 -files <paths>                 comma separated files to be copied to the
                                map reduce cluster
 -fs <local|namenode:port>      specify a namenode
 -jt <local|jobtracker:port>    specify a job tracker
 -libjars <paths>               comma separated jar files to include in
                                the classpath.
 -tokenCacheFile <tokensFile>   name of the file with the tokens
Unexpected 101 while processing Job-Specific Options:
Usage:
 [--input <input> --output <output> --numberOfColumns <numberOfColumns>
--similarityClassname <similarityClassname> --maxSimilaritiesPerRow
<maxSimilaritiesPerRow> --help --tempDir <tempDir> --startPhase <startPhase>
--endPhase <endPhase>]
Job-Specific Options:
  --input (-i) input                                    Path to job input
                                                        directory.
  --output (-o) output                                  The directory
pathname
                                                        for output.
  --numberOfColumns (-r) numberOfColumns                Number of columns in
                                                        the input matrix
  --similarityClassname (-s) similarityClassname        Name of distributed
                                                        similarity class to
                                                        instantiate,
                                                        alternatively use
one
                                                        of the predefined
                                                        similarities
                                                       
([SIMILARITY_COOCCURRENC
                                                        E,
                                                       
SIMILARITY_EUCLIDEAN_DIS
                                                        TANCE,
                                                       
SIMILARITY_LOGLIKELIHOOD
                                                        ,
                                                       
SIMILARITY_PEARSON_CORRE
                                                        LATION,
                                                       
SIMILARITY_TANIMOTO_COEF
                                                        FICIENT,
                                                       
SIMILARITY_UNCENTERED_CO
                                                        SINE,
                                                       
SIMILARITY_UNCENTERED_ZE
                                                        RO_ASSUMING_COSINE,
                                                       
SIMILARITY_CITY_BLOCK])
  --maxSimilaritiesPerRow (-m) maxSimilaritiesPerRow    Number of maximum
                                                        similarities per row
                                                        (default: 100)
  --help (-h)                                           Print out help
  --tempDir tempDir                                     Intermediate output
                                                        directory
  --startPhase startPhase                               First phase to run
  --endPhase endPhase                                   Last phase to run
12/09/06 01:28:28 INFO mapred.JobClient: Cleaning up the staging area
hdfs://hadoop-namenode-2.v39.ch3.caracal.com/tmp/hadoop-mapred/mapred/staging/etl_user/.staging/job_201205291818_31228
Exception in thread "main"
org.apache.hadoop.mapreduce.lib.input.InvalidInputException: Input path does
not exist: temp/similarityMatrix
        at
org.apache.hadoop.mapreduce.lib.input.FileInputFormat.listStatus(FileInputFormat.java:231)
        at
org.apache.hadoop.mapreduce.lib.input.SequenceFileInputFormat.listStatus(SequenceFileInputFormat.java:55)
        at
org.apache.hadoop.mapreduce.lib.input.FileInputFormat.getSplits(FileInputFormat.java:248)
        at
org.apache.hadoop.mapred.JobClient.writeNewSplits(JobClient.java:899)
        at
org.apache.hadoop.mapred.JobClient.writeSplits(JobClient.java:916)
        at org.apache.hadoop.mapred.JobClient.access$500(JobClient.java:170)
        at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:834)
        at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:793)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:396)
        at
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1063)
        at
org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:793)
        at org.apache.hadoop.mapreduce.Job.submit(Job.java:465)
        at org.apache.hadoop.mapreduce.Job.waitForCompletion(Job.java:495)
        at
org.apache.mahout.cf.taste.hadoop.item.RecommenderJob.run(RecommenderJob.java:239)
        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
        at
org.apache.mahout.cf.taste.hadoop.item.RecommenderJob.main(RecommenderJob.java:333)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
        at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
        at java.lang.reflect.Method.invoke(Method.java:597)
        at org.apache.hadoop.util.RunJar.main(RunJar.java:186)


Greatly appreciate, you help in identifying..





--
View this message in context: 
http://lucene.472066.n3.nabble.com/Error-running-RecommenderJob-using-mahout-core-0-5-cdh3u4-job-jar-tp4005786.html
Sent from the Mahout User List mailing list archive at Nabble.com.

Error running RecommenderJob using mahout-core-0.5-cdh3u4-job.jar

Reply via email to