ok, this one? https://cwiki.apache.org/confluence/display/MAHOUT/Collections
would have been better if it's linked to the "Getting Started " steps On Fri, Oct 21, 2011 at 2:06 PM, Yang <[email protected]> wrote: > sorry I'm new to mahout, but could you point out where is the "netflix > dataset " ? the wiki does not show it, nor can I find it in the source > code: > > $ find .. -iname "*netflix*" |grep -v '\.svn' |grep -v \.class > ../examples/src/main/java/org/apache/mahout/cf/taste/example/netflix > ../examples/src/main/java/org/apache/mahout/cf/taste/example/netflix/NetflixRecommenderBuilder.java > ../examples/src/main/java/org/apache/mahout/cf/taste/example/netflix/NetflixDataModel.java > ../examples/src/main/java/org/apache/mahout/cf/taste/example/netflix/NetflixRecommender.java > ../examples/src/main/java/org/apache/mahout/cf/taste/example/netflix/NetflixRecommenderEvaluatorRunner.java > ../examples/src/main/java/org/apache/mahout/cf/taste/example/netflix/NetflixFileDataModel.java > > > > > On Fri, Oct 21, 2011 at 2:03 PM, Sean Owen <[email protected]> wrote: >> Again -- wrong input file. See Lance's message to you, which >> specifically noted that it's ratings.dat, not movies.dat, that you >> need to feed. I think that's also pretty self-evident that you feed >> ratings to the recommender. The error also clearly shows it's trying >> to parse movie-related strings as input, which is another clue. >> >> On Fri, Oct 21, 2011 at 10:01 PM, Yang <[email protected]> wrote: >>> just tried the grouplens one, got some new errors: >>> >>> >>> $ mvn -X -e -q exec:java >>> -Dexec.mainClass="org.apache.mahout.cf.taste.example.grouplens.GroupLensRecommenderEvaluatorRunner" >>> -Dexec.args="-i >>> /home/yyang/work/mahout/trunk/examples/data/ml-1m/movies.dat" >>> >>> >>> >>> [DEBUG] joining on thread >>> Thread[org.apache.mahout.cf.taste.example.grouplens.GroupLensRecommenderEvaluatorRunner.main(),5,org.apache.mahout.cf.taste.example.grouplens.GroupLensRecommenderEvaluatorRunner] >>> [DEBUG] Setting accessibility to true in order to invoke main(). >>> 11/10/21 14:00:20 INFO file.FileDataModel: Creating FileDataModel for >>> file /tmp/ratings.txt >>> 11/10/21 14:00:20 INFO file.FileDataModel: Reading file info... >>> [INFO] >>> ------------------------------------------------------------------------ >>> [ERROR] BUILD ERROR >>> [INFO] >>> ------------------------------------------------------------------------ >>> [INFO] An exception occured while executing the Java class. null >>> >>> For input string: "Toy Story (1995)" >>> [INFO] >>> ------------------------------------------------------------------------ >>> [DEBUG] Trace >>> org.apache.maven.lifecycle.LifecycleExecutionException: An exception >>> occured while executing the Java class. null >>> at >>> org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeGoals(DefaultLifecycleExecutor.java:719) >>> at >>> org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeStandaloneGoal(DefaultLifecycleExecutor.java:569) >>> at >>> org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeGoal(DefaultLifecycleExecutor.java:539) >>> at >>> org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeGoalAndHandleFailures(DefaultLifecycleExecutor.java:387) >>> at >>> org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeTaskSegments(DefaultLifecycleExecutor.java:348) >>> at >>> org.apache.maven.lifecycle.DefaultLifecycleExecutor.execute(DefaultLifecycleExecutor.java:180) >>> at org.apache.maven.DefaultMaven.doExecute(DefaultMaven.java:328) >>> at org.apache.maven.DefaultMaven.execute(DefaultMaven.java:138) >>> at org.apache.maven.cli.MavenCli.main(MavenCli.java:362) >>> at >>> org.apache.maven.cli.compat.CompatibleMain.main(CompatibleMain.java:60) >>> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) >>> at >>> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) >>> at >>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) >>> at java.lang.reflect.Method.invoke(Method.java:616) >>> at >>> org.codehaus.classworlds.Launcher.launchEnhanced(Launcher.java:315) >>> at org.codehaus.classworlds.Launcher.launch(Launcher.java:255) >>> at >>> org.codehaus.classworlds.Launcher.mainWithExitCode(Launcher.java:430) >>> at org.codehaus.classworlds.Launcher.main(Launcher.java:375) >>> Caused by: org.apache.maven.plugin.MojoExecutionException: An >>> exception occured while executing the Java class. null >>> at org.codehaus.mojo.exec.ExecJavaMojo.execute(ExecJavaMojo.java:346) >>> at >>> org.apache.maven.plugin.DefaultPluginManager.executeMojo(DefaultPluginManager.java:490) >>> at >>> org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeGoals(DefaultLifecycleExecutor.java:694) >>> ... 17 more >>> Caused by: java.lang.reflect.InvocationTargetException >>> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) >>> at >>> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) >>> at >>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) >>> at java.lang.reflect.Method.invoke(Method.java:616) >>> at org.codehaus.mojo.exec.ExecJavaMojo$1.run(ExecJavaMojo.java:291) >>> at java.lang.Thread.run(Thread.java:679) >>> Caused by: java.lang.NumberFormatException: For input string: "Toy Story >>> (1995)" >>> at >>> java.lang.NumberFormatException.forInputString(NumberFormatException.java:65) >>> at java.lang.Long.parseLong(Long.java:438) >>> at java.lang.Long.parseLong(Long.java:478) >>> at >>> org.apache.mahout.cf.taste.impl.model.file.FileDataModel.readItemIDFromString(FileDataModel.java:616) >>> at >>> org.apache.mahout.cf.taste.impl.model.file.FileDataModel.processLineWithoutID(FileDataModel.java:545) >>> at >>> org.apache.mahout.cf.taste.impl.model.file.FileDataModel.processFileWithoutID(FileDataModel.java:519) >>> at >>> org.apache.mahout.cf.taste.impl.model.file.FileDataModel.buildModel(FileDataModel.java:261) >>> at >>> org.apache.mahout.cf.taste.impl.model.file.FileDataModel.reload(FileDataModel.java:207) >>> at >>> org.apache.mahout.cf.taste.impl.model.file.FileDataModel.<init>(FileDataModel.java:193) >>> at >>> org.apache.mahout.cf.taste.impl.model.file.FileDataModel.<init>(FileDataModel.java:148) >>> at >>> org.apache.mahout.cf.taste.example.grouplens.GroupLensDataModel.<init>(GroupLensDataModel.java:51) >>> at >>> org.apache.mahout.cf.taste.example.grouplens.GroupLensRecommenderEvaluatorRunner.main(GroupLensRecommenderEvaluatorRunner.java:47) >>> ... 6 more >>> [INFO] >>> ------------------------------------------------------------------------ >>> [INFO] Total time: 2 seconds >>> [INFO] Finished at: Fri Oct 21 14:00:20 PDT 2011 >>> [INFO] Final Memory: 23M/127M >>> [INFO] >>> ------------------------------------------------------------------------ >>> >>> >>> On Fri, Oct 21, 2011 at 1:59 PM, Yang <[email protected]> wrote: >>>> could you please give a list (or document in wiki, or I could edit it >>>> ), which mvn exec job requires exactly which file? >>>> >>>> thanks >>>> Yang >>>> >>>> On Fri, Oct 21, 2011 at 1:56 PM, Sean Owen <[email protected]> wrote: >>>>> Um, you are running the Netflix example on the GroupLens data set. Of >>>>> course this doesn't work. >>>>> >>>>> On Fri, Oct 21, 2011 at 9:54 PM, Yang <[email protected]> wrote: >>>>>> I got the following errors when running: >>>>>> >>>>>> $ mvn -X -e -q exec:java >>>>>> -Dexec.mainClass="org.apache.mahout.cf.taste.example.netflix.NetflixRecommenderEvaluatorRunner" >>>>>> -Dexec.args="-i /home/yyang/work/mahout/trunk/examples/data/ml-1m/" >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> [DEBUG] joining on thread >>>>>> Thread[org.apache.mahout.cf.taste.example.netflix.NetflixRecommenderEvaluatorRunner.main(),5,org.apache.mahout.cf.taste.example.netflix.NetflixRecommenderEvaluatorRunner] >>>>>> [DEBUG] Setting accessibility to true in order to invoke main(). >>>>>> 11/10/21 13:52:01 INFO netflix.NetflixDataModel: Creating >>>>>> NetflixDataModel for directory: >>>>>> /home/yyang/work/mahout/trunk/examples/data/ml-1m >>>>>> 11/10/21 13:52:01 INFO netflix.NetflixDataModel: Reading preference >>>>>> data... >>>>>> [INFO] >>>>>> ------------------------------------------------------------------------ >>>>>> [ERROR] BUILD ERROR >>>>>> [INFO] >>>>>> ------------------------------------------------------------------------ >>>>>> [INFO] An exception occured while executing the Java class. null >>>>>> >>>>>> [INFO] >>>>>> ------------------------------------------------------------------------ >>>>>> [DEBUG] Trace >>>>>> org.apache.maven.lifecycle.LifecycleExecutionException: An exception >>>>>> occured while executing the Java class. null >>>>>> at >>>>>> org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeGoals(DefaultLifecycleExecutor.java:719) >>>>>> at >>>>>> org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeStandaloneGoal(DefaultLifecycleExecutor.java:569) >>>>>> at >>>>>> org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeGoal(DefaultLifecycleExecutor.java:539) >>>>>> at >>>>>> org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeGoalAndHandleFailures(DefaultLifecycleExecutor.java:387) >>>>>> at >>>>>> org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeTaskSegments(DefaultLifecycleExecutor.java:348) >>>>>> at >>>>>> org.apache.maven.lifecycle.DefaultLifecycleExecutor.execute(DefaultLifecycleExecutor.java:180) >>>>>> at org.apache.maven.DefaultMaven.doExecute(DefaultMaven.java:328) >>>>>> at org.apache.maven.DefaultMaven.execute(DefaultMaven.java:138) >>>>>> at org.apache.maven.cli.MavenCli.main(MavenCli.java:362) >>>>>> at >>>>>> org.apache.maven.cli.compat.CompatibleMain.main(CompatibleMain.java:60) >>>>>> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) >>>>>> at >>>>>> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) >>>>>> at >>>>>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) >>>>>> at java.lang.reflect.Method.invoke(Method.java:616) >>>>>> at >>>>>> org.codehaus.classworlds.Launcher.launchEnhanced(Launcher.java:315) >>>>>> at org.codehaus.classworlds.Launcher.launch(Launcher.java:255) >>>>>> at >>>>>> org.codehaus.classworlds.Launcher.mainWithExitCode(Launcher.java:430) >>>>>> at org.codehaus.classworlds.Launcher.main(Launcher.java:375) >>>>>> Caused by: org.apache.maven.plugin.MojoExecutionException: An >>>>>> exception occured while executing the Java class. null >>>>>> at >>>>>> org.codehaus.mojo.exec.ExecJavaMojo.execute(ExecJavaMojo.java:346) >>>>>> at >>>>>> org.apache.maven.plugin.DefaultPluginManager.executeMojo(DefaultPluginManager.java:490) >>>>>> at >>>>>> org.apache.maven.lifecycle.DefaultLifecycleExecutor.executeGoals(DefaultLifecycleExecutor.java:694) >>>>>> ... 17 more >>>>>> Caused by: java.lang.reflect.InvocationTargetException >>>>>> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) >>>>>> at >>>>>> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) >>>>>> at >>>>>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) >>>>>> at java.lang.reflect.Method.invoke(Method.java:616) >>>>>> at >>>>>> org.codehaus.mojo.exec.ExecJavaMojo$1.run(ExecJavaMojo.java:291) >>>>>> at java.lang.Thread.run(Thread.java:679) >>>>>> Caused by: java.lang.NullPointerException >>>>>> at >>>>>> org.apache.mahout.cf.taste.example.netflix.NetflixDataModel.readUsers(NetflixDataModel.java:79) >>>>>> at >>>>>> org.apache.mahout.cf.taste.example.netflix.NetflixDataModel.<init>(NetflixDataModel.java:68) >>>>>> at >>>>>> org.apache.mahout.cf.taste.example.netflix.NetflixRecommenderEvaluatorRunner.main(NetflixRecommenderEvaluatorRunner.java:44) >>>>>> ... 6 more >>>>>> [INFO] >>>>>> ------------------------------------------------------------------------ >>>>>> [INFO] Total time: 2 seconds >>>>>> [INFO] Finished at: Fri Oct 21 13:52:01 PDT 2011 >>>>>> [INFO] Final Memory: 23M/127M >>>>>> [INFO] >>>>>> ------------------------------------------------------------------------ >>>>>> >>>>>> >>>>>> On Fri, Oct 21, 2011 at 1:21 AM, Sean Owen <[email protected]> wrote: >>>>>>> It depends on which example you are running! For GroupLens, yes, it's >>>>>>> the movies.dat file. >>>>>>> >>>>>>> I will warn you that I think the various GroupLens data sets have >>>>>>> different input formats. Some are comma-separated, which is what you >>>>>>> need, and some are separated by "::". Open the file and look. If it is >>>>>>> the latter, you need to change it to comma-separated form. >>>>>>> >>>>>>> On Thu, Oct 20, 2011 at 11:41 PM, Yang <[email protected]> wrote: >>>>>>>> in the recommendation examples >>>>>>>> >>>>>>>> https://cwiki.apache.org/confluence/display/MAHOUT/RecommendationExamples >>>>>>>> >>>>>>>> it mentions the "-i input_file " parameter, >>>>>>>> >>>>>>>> but the downloaded movie-lens dataset is a zip, and contains multiple >>>>>>>> .dat files, which one should I use? >>>>>>>> >>>>>>>> Thanks >>>>>>>> Yang >>>>>>>> >>>>>>> >>>>>> >>>>>> >>>>>> how do I debug this? >>>>>> >>>>> >>>> >>> >> >
