Oh, it just occurred to me I should have said that /usr/local/melk/mahout is a checkout of http://svn.apache.org/repos/asf/lucene/mahout/trunk/. When I got to step 2, which says...
> 2. Unpack the archive and copy movies.dat and ratings.dat to > trunk/taste-web/src/main/resources/org/apache/mahout/cf/taste/example/grouplens > under > the Mahout distribution directory. ...I assumed the instructions had left out the step of running "svn checkout http://svn.apache.org/repos/asf/lucene/mahout/trunk/". Was this assumption incorrect? I did have to "mkdir -p trunk/taste-web/src/main/resources/org/apache/mahout/cf/taste/example/grouplens" before I could copy the .dat files there as the trunk/taste-web/src/main/resources directory of the checkout doesn't contain anything in it. Did I go off on the wrong track? On Wed, Apr 1, 2009 at 4:27 PM, Joshua Bronson <[email protected]> wrote: > You mean you're supposed to do step 4 *before* step 8?!? ;p > I did run mvn install, and though I got a bunch of warnings like the > following: > > [WARNING] Entry: >> mahout-0.2-SNAPSHOT/usr/local/melk/mahout/core/src/main/java/org/apache/mahout/cf/taste/impl/common/ >> longer than 100 characters. >> > > after a couple hours it said it completed successfully: > > [INFO] >> ------------------------------------------------------------------------ >> > [INFO] Reactor Summary: >> > [INFO] >> ------------------------------------------------------------------------ >> > [INFO] Mahout core ........................................... SUCCESS >> [8:46.665s] >> > [INFO] Mahout Taste Webapp ................................... SUCCESS >> [55.496s] >> > [INFO] Mahout examples ....................................... SUCCESS >> [55.317s] >> > [INFO] Apache Lucene Mahout .................................. SUCCESS >> [2:02:03.392s] >> > [INFO] >> ------------------------------------------------------------------------ >> > [INFO] >> ------------------------------------------------------------------------ >> > [INFO] BUILD SUCCESSFUL >> > [INFO] >> ------------------------------------------------------------------------ >> > [INFO] Total time: 132 minutes 41 seconds >> > [INFO] Finished at: Wed Apr 01 00:59:27 EDT 2009 >> > [INFO] Final Memory: 61M/80M >> > [INFO] >> ------------------------------------------------------------------------ >> > > > So I proceeded through steps 5, 6, and 7, and then step 8's "mvn package" > command failed with the output I linked to. > > Just for the heck of it I tried "mvn install" again (from the top-level > directory) and after getting a bunch of the "longer-than-100-characters" > warnings again, this time after 7 minutes it failed with: > > [ERROR] BUILD ERROR >> > [INFO] >> ------------------------------------------------------------------------ >> > [INFO] Failed to create assembly: Error creating assembly archive project: >> A tar file cannot include itself. >> > > > I posted the full transcript of my console session at > http://melkjug.org/_static/grouplens-install-log.txt. Seems like something > funky's going on with tar, but I'm not sure what. > > > On Wed, Apr 1, 2009 at 12:11 PM, Grant Ingersoll <[email protected]>wrote: > >> Do a "mvn install" from the top level directory first: >> http://lucene.apache.org/mahout/taste.html#demo >> >> HTH, >> Grant >> >> >> On Apr 1, 2009, at 11:35 AM, Joshua Bronson wrote: >> >> Thanks all for the good info. Taste definitely sounds like a promising >>> direction for us to go in for our recommendation service. >>> I'm working through the installation of the GroupLens demo, but the mvn >>> package step is failing with the output at >>> http://paste.pocoo.org/show/110618/. Haven't looked into this yet, just >>> thought I'd post to the list first with my progress. If anyone else uses >>> IRC, I've created (and am currently the only one in) the #mahout channel >>> on >>> freenode. Hope to see some of you in there! >>> >>> Josh >>> >>> On Wed, Apr 1, 2009 at 5:48 AM, Sean Owen <[email protected]> wrote: >>> >>> Couple clarifications - >>>> >>>> The CF components are oriented to on-line, real-time use, though of >>>> course >>>> one can trivially build a batch job out of that. That is what I did with >>>> the >>>> EC2 image that cranks out recommendations for all users. >>>> >>>> The CF component is also already parallelized as much as is practical. >>>> There >>>> are already Hadoop jobs for parallel, batch operation. >>>> >>>> Finally if you have some external notion of item similarity, like text >>>> similarity between articles, you can and should include this info by >>>> creating an ItemSimilarity with this knowledge. In that case you want to >>>> use >>>> an item-based recommender, since it is only in such a case that >>>> item-based >>>> recommenders have a distinct advantage. >>>> >>>> On Apr 1, 2009 10:32 AM, "Otis Gospodnetic" <[email protected] >>>> > >>>> wrote: >>>> >>>> >>>> it's the former. Taste is still not parallelized, but other parts of >>>> Mahout >>>> are, and they make use of Hadoop. >>>> >>>> >> -------------------------- >> Grant Ingersoll >> http://www.lucidimagination.com/ >> >> Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids) using >> Solr/Lucene: >> http://www.lucidimagination.com/search >> >> >
