Re: mahout failing with -c as required option

2015-03-10 Thread Suneel Marthi
Try ./mahout kmeans -i http://master:50070/explorer.html#/user/netlog/upload/output4/tfidf-vectors/part-r-0 -o /usr/netlog/upload/output4/tfidf-vectors-kmeans-clusters -c some-folder -dm org.apache.mahout.common.distance.SquaredEuclideanDistanceMeasure -x 5 -ow -cl -k 25 I don't have a

Re: mahout failing with -c as required option

2015-03-10 Thread Suneel Marthi
Oops! I meant to say that -c is required for the random centroid initialization if -k is specified. It initializes k random centroids in the folder specified by -c. so yes -c is required. On Tue, Mar 10, 2015 at 1:42 AM, Raghuveer alwaysra...@yahoo.com.invalid wrote: No i have removed the -c

Re: mahout failing with -c as required option

2015-03-10 Thread Raghuveer
ok so if -c is required then how can i give it or atleast is there a way to remove -k itself? ./mahout kmeans -i http://master:50070/explorer.html#/user/netlog/upload/output4/tfidf-vectors/part-r-0 -o /usr/netlog/upload/output4/tfidf-vectors-kmeans-clusters -dm

Re: mahout failing with -c as required option

2015-03-10 Thread Raghuveer
I see the error below: Running on hadoop, using /usr/local/hadoop/bin/hadoop and HADOOP_CONF_DIR= MAHOUT-JOB: /home/raghuveer/trunk/examples/target/mahout-examples-1.0-SNAPSHOT-job.jar 15/03/10 11:50:20 INFO common.AbstractJob: Command line arguments: {--clustering=null,

Re: mahout failing with -c as required option

2015-03-10 Thread Raghuveer
I see the error below: On Tuesday, March 10, 2015 11:45 AM, Suneel Marthi suneel.mar...@gmail.com wrote: Try ./mahout kmeans -i http://master:50070/explorer.html#/user/netlog/upload/output4/tfidf-vectors/part-r-0 -o /usr/netlog/upload/output4/tfidf-vectors-kmeans-clusters -c

Re: implementation of context-aware recommender in Mahout

2015-03-10 Thread Efi Koulouri
Things got clearier with your help! Thank you very much On 9 March 2015 at 01:50, Ted Dunning ted.dunn...@gmail.com wrote: Efi, Only you can really tell which is best for your efforts. All the rest is our own partially informed opinions. Pre-filtering can often be accomplished in the

Re: implementation of context-aware recommender in Mahout

2015-03-10 Thread Ted Dunning
Glad to help. You can help us by reporting your results when you get them. We look forward to that! On Tue, Mar 10, 2015 at 4:22 AM, Efi Koulouri ekoulou...@gmail.com wrote: Things got clearier with your help! Thank you very much On 9 March 2015 at 01:50, Ted Dunning

Re: mahout spark-itemsimilarity from command line

2015-03-10 Thread Jeff Isenhart
OK, so the solution to the issue was to add the following to my core-site.xml !-- Added to try and solve mahout issue claiming 'No FileSystem for schema: hdfs' --property    namefs.file.impl/name    valueorg.apache.hadoop.fs.LocalFileSystem/value    descriptionThe FileSystem for file:

spark-item-similarity incremental update

2015-03-10 Thread Kevin Zhang
Hi, Does anybody have any idea about how to do incremental update for the item similarity? I mean how I can apply latest user action data for example today's data? Do I have to run it again for the entire dataset? Thanks, Kevin

Re: spark-item-similarity incremental update

2015-03-10 Thread Pat Ferrel
The latest user actions work just fine as the query against the last time you ran spark-itemsimilairty. Go to the Demo site https://guide.finderbots.com and run through the “trainer” those things you pick are instantly used to make recs. spark-itemsimilarity was not re-run. The only time you

Re: spark-item-similarity incremental update

2015-03-10 Thread Pat Ferrel
Just to be clear #1 was about new items, not users. New users will work as long as you have history for them. On Mar 10, 2015, at 3:34 PM, Kevin Zhang zhangyongji...@yahoo.com.INVALID wrote: I see. Thank you, Pat. On Tuesday, March 10, 2015 3:17 PM, Pat Ferrel p...@occamsmachete.com

Re: spark-item-similarity incremental update

2015-03-10 Thread Kevin Zhang
I see. Thank you, Pat. On Tuesday, March 10, 2015 3:17 PM, Pat Ferrel p...@occamsmachete.com wrote: The latest user actions work just fine as the query against the last time you ran spark-itemsimilairty. Go to the Demo site https://guide.finderbots.com and run through the “trainer”