Try
./mahout kmeans -i
http://master:50070/explorer.html#/user/netlog/upload/output4/tfidf-vectors/part-r-0
-o /usr/netlog/upload/output4/tfidf-vectors-kmeans-clusters -c
some-folder -dm
org.apache.mahout.common.distance.SquaredEuclideanDistanceMeasure -x 5 -ow
-cl -k 25
I don't have a
Oops! I meant to say that -c is required for the random centroid
initialization if -k is specified.
It initializes k random centroids in the folder specified by -c. so yes -c
is required.
On Tue, Mar 10, 2015 at 1:42 AM, Raghuveer alwaysra...@yahoo.com.invalid
wrote:
No i have removed the -c
ok so if -c is required then how can i give it or atleast is there a way to
remove -k itself?
./mahout kmeans -i
http://master:50070/explorer.html#/user/netlog/upload/output4/tfidf-vectors/part-r-0
-o /usr/netlog/upload/output4/tfidf-vectors-kmeans-clusters -dm
I see the error below:
Running on hadoop, using /usr/local/hadoop/bin/hadoop and HADOOP_CONF_DIR=
MAHOUT-JOB:
/home/raghuveer/trunk/examples/target/mahout-examples-1.0-SNAPSHOT-job.jar
15/03/10 11:50:20 INFO common.AbstractJob: Command line arguments:
{--clustering=null,
I see the error below:
On Tuesday, March 10, 2015 11:45 AM, Suneel Marthi
suneel.mar...@gmail.com wrote:
Try
./mahout kmeans -i
http://master:50070/explorer.html#/user/netlog/upload/output4/tfidf-vectors/part-r-0
-o /usr/netlog/upload/output4/tfidf-vectors-kmeans-clusters -c
Things got clearier with your help!
Thank you very much
On 9 March 2015 at 01:50, Ted Dunning ted.dunn...@gmail.com wrote:
Efi,
Only you can really tell which is best for your efforts. All the rest is
our own partially informed opinions.
Pre-filtering can often be accomplished in the
Glad to help.
You can help us by reporting your results when you get them.
We look forward to that!
On Tue, Mar 10, 2015 at 4:22 AM, Efi Koulouri ekoulou...@gmail.com wrote:
Things got clearier with your help!
Thank you very much
On 9 March 2015 at 01:50, Ted Dunning
OK, so the solution to the issue was to add the following to my core-site.xml
!-- Added to try and solve mahout issue claiming 'No FileSystem for schema:
hdfs' --property namefs.file.impl/name
valueorg.apache.hadoop.fs.LocalFileSystem/value descriptionThe
FileSystem for file:
Hi,
Does anybody have any idea about how to do incremental update for the item
similarity? I mean how I can apply latest user action data for example today's
data? Do I have to run it again for the entire dataset?
Thanks,
Kevin
The latest user actions work just fine as the query against the last time you
ran spark-itemsimilairty. Go to the Demo site https://guide.finderbots.com and
run through the “trainer” those things you pick are instantly used to make
recs. spark-itemsimilarity was not re-run. The only time you
Just to be clear #1 was about new items, not users. New users will work as long
as you have history for them.
On Mar 10, 2015, at 3:34 PM, Kevin Zhang zhangyongji...@yahoo.com.INVALID
wrote:
I see. Thank you, Pat.
On Tuesday, March 10, 2015 3:17 PM, Pat Ferrel p...@occamsmachete.com
I see. Thank you, Pat.
On Tuesday, March 10, 2015 3:17 PM, Pat Ferrel p...@occamsmachete.com wrote:
The latest user actions work just fine as the query against the last time you
ran spark-itemsimilairty. Go to the Demo site https://guide.finderbots.com and
run through the “trainer”
12 matches
Mail list logo