I am trying to run GroupLensRecommender using the pseudo-distributed RecomenderJob. Isn't that allowed?
hadoop jar recommender.jar org.apache.mahout.cf.taste.hadoop.pseudo.RecommenderJob -Dmapred.input.dir=testdata/ratings.csv -Dmapred.output.dir=output --recommenderClassName org.apache.mahout.cf.taste.example.grouplens.GroupLensRecommender --numRecommendations 10 On Sat, Nov 6, 2010 at 4:55 AM, Sean Owen <[email protected]> wrote: > I'm not sure what the context is -- GroupLensRecommender has nothing to do > with Hadoop. So I'm not surprised this doesn't work. > > On Fri, Nov 5, 2010 at 8:57 PM, Sanjib Kumar Das <[email protected] > >wrote: > > > Hi all, > > > > I am trying to run the GroupLensRecommender on hadoop. > > I am using the pseudo.RecommenderJob for that purpose. > > Before running the command below i copied > mahout-core-0.4-SNAPSHOT-job.jar > > and renamed it to recommender.jar. I updated recommender.jar by adding > > grouplens.jar to it. > > Still I am getting the ClassNotFoundException for GroupLensRecommender. > Am > > I > > missing something? > > Any help in this regard is highly appreciated. > > > > > > hadoop jar recommender.jar > > org.apache.mahout.cf.taste.hadoop.pseudo.RecommenderJob > > -Dmapred.input.dir=testdata/ratings.csv -Dmapred.output.dir=output > > --recommenderClassName > > org.apache.mahout.cf.taste.example.grouplens.GroupLensRecommender > > --numRecommendations 10 > > > > 10/11/05 15:44:34 INFO common.AbstractJob: Command line arguments: > > {--endPhase=2147483647, --numRecommendations=10, > > > > > --recommenderClassName=org.apache.mahout.cf.taste.example.grouplens.GroupLensRecommender, > > --startPhase=0, --tempDir=temp} > > 10/11/05 15:44:34 INFO jvm.JvmMetrics: Initializing JVM Metrics with > > processName=JobTracker, sessionId= > > 10/11/05 15:44:35 INFO input.FileInputFormat: Total input paths to > process > > : > > 1 > > 10/11/05 15:44:36 INFO mapred.JobClient: Running job: job_local_0001 > > 10/11/05 15:44:36 INFO input.FileInputFormat: Total input paths to > process > > : > > 1 > > 10/11/05 15:44:36 INFO mapred.MapTask: io.sort.mb = 100 > > 10/11/05 15:44:36 INFO mapred.MapTask: data buffer = 79691776/99614720 > > 10/11/05 15:44:36 INFO mapred.MapTask: record buffer = 262144/327680 > > 10/11/05 15:44:36 INFO mapred.MapTask: Spilling map output: record full = > > true > > 10/11/05 15:44:36 INFO mapred.MapTask: bufstart = 0; bufend = 514677; > > bufvoid = 99614720 > > 10/11/05 15:44:36 INFO mapred.MapTask: kvstart = 0; kvend = 262144; > length > > = > > 327680 > > 10/11/05 15:44:37 INFO mapred.JobClient: map 0% reduce 0% > > 10/11/05 15:44:38 INFO util.NativeCodeLoader: Loaded the native-hadoop > > library > > 10/11/05 15:44:38 INFO zlib.ZlibFactory: Successfully loaded & > initialized > > native-zlib library > > 10/11/05 15:44:38 INFO compress.CodecPool: Got brand-new compressor > > 10/11/05 15:44:39 INFO mapred.MapTask: Finished spill 0 > > 10/11/05 15:44:39 INFO mapred.MapTask: Spilling map output: record full = > > true > > 10/11/05 15:44:39 INFO mapred.MapTask: bufstart = 514677; bufend = > 1038963; > > bufvoid = 99614720 > > 10/11/05 15:44:39 INFO mapred.MapTask: kvstart = 262144; kvend = 196607; > > length = 327680 > > 10/11/05 15:44:42 INFO mapred.LocalJobRunner: > > 10/11/05 15:44:42 INFO mapred.MapTask: Finished spill 1 > > 10/11/05 15:44:42 INFO mapred.MapTask: Spilling map output: record full = > > true > > 10/11/05 15:44:42 INFO mapred.MapTask: bufstart = 1038963; bufend = > > 1563249; > > bufvoid = 99614720 > > 10/11/05 15:44:42 INFO mapred.MapTask: kvstart = 196607; kvend = 131070; > > length = 327680 > > 10/11/05 15:44:43 INFO mapred.JobClient: map 58% reduce 0% > > 10/11/05 15:44:44 INFO mapred.MapTask: Finished spill 2 > > 10/11/05 15:44:45 INFO mapred.LocalJobRunner: > > 10/11/05 15:44:45 INFO mapred.MapTask: Starting flush of map output > > 10/11/05 15:44:46 INFO mapred.JobClient: map 90% reduce 0% > > 10/11/05 15:44:47 INFO mapred.MapTask: Finished spill 3 > > 10/11/05 15:44:47 INFO mapred.Merger: Merging 4 sorted segments > > 10/11/05 15:44:47 INFO compress.CodecPool: Got brand-new decompressor > > 10/11/05 15:44:47 INFO compress.CodecPool: Got brand-new decompressor > > 10/11/05 15:44:47 INFO compress.CodecPool: Got brand-new decompressor > > 10/11/05 15:44:47 INFO compress.CodecPool: Got brand-new decompressor > > 10/11/05 15:44:47 INFO mapred.Merger: Down to the last merge-pass, with 4 > > segments left of total size: 17968 bytes > > 10/11/05 15:44:48 INFO mapred.LocalJobRunner: > > 10/11/05 15:44:49 INFO mapred.JobClient: map 100% reduce 0% > > 10/11/05 15:44:51 INFO mapred.LocalJobRunner: > > 10/11/05 15:44:52 INFO mapred.TaskRunner: > > Task:attempt_local_0001_m_000000_0 > > is done. And is in the process of commiting > > 10/11/05 15:44:52 INFO mapred.LocalJobRunner: > > 10/11/05 15:44:52 INFO mapred.TaskRunner: Task > > 'attempt_local_0001_m_000000_0' done. > > 10/11/05 15:44:52 INFO mapred.LocalJobRunner: > > 10/11/05 15:44:52 INFO mapred.Merger: Merging 1 sorted segments > > 10/11/05 15:44:52 INFO mapred.Merger: Down to the last merge-pass, with 1 > > segments left of total size: 17813 bytes > > 10/11/05 15:44:52 INFO mapred.LocalJobRunner: > > 10/11/05 15:44:53 INFO file.FileDataModel: Creating FileDataModel for > file > > /tmp/mahout-taste-hadoop3533164687715963034txt > > 10/11/05 15:44:53 INFO file.FileDataModel: Reading file info... > > 10/11/05 15:44:58 INFO mapred.LocalJobRunner: reduce > reduce > > 10/11/05 15:44:59 INFO mapred.JobClient: map 100% reduce 74% > > 10/11/05 15:45:08 INFO file.FileDataModel: Processed 1000000 lines > > 10/11/05 15:45:08 INFO file.FileDataModel: Read lines: 1000209 > > 10/11/05 15:45:11 INFO model.GenericDataModel: Processed 6040 users > > 10/11/05 15:45:11 WARN mapred.LocalJobRunner: job_local_0001 > > java.lang.IllegalStateException: java.lang.ClassNotFoundException: > > org.apache.mahout.cf.taste.example.grouplens.GroupLensRecommender > > >
