[RESULT] [VOTE] Apache Mahout 0.13.0 Release Candidate

2017-04-16 Thread Andrew Musselman
> > > > > > > > > > > > > > > > > > > ____ > > From: Andrew Musselman <andrew.mussel...@gmail.com> > > Sent: Saturday, April 15, 2017 2:48:17 AM > > To: user@mahout.apache.org; d...@mahout

Re: [VOTE] Apache Mahout 0.13.0 Release Candidate

2017-04-15 Thread Andrew Musselman
Hashes and sigs confirmed, bin and src (viennacl and viennacl-omp) artifacts run the spark shell and the sparse drm test fine, and kick off the GPU. +1 (binding) On Fri, Apr 14, 2017 at 10:25 PM, Andrew Musselman < andrew.mussel...@gmail.com> wrote: > This is the vote for relea

[VOTE] Apache Mahout 0.13.0 Release Candidate

2017-04-14 Thread Andrew Musselman
This is the vote for release 0.13.0 of Apache Mahout. The vote will be going for at least 72 hours and will be closed on Monday, April 17th, 2017 or once there are at least 3 PMC +1 binding votes (whichever occurs earlier). Please download, test and vote with [ ] +1, accept RC as the official

Re: [VOTE] Apache Mahout 0.13.0 Release Candidate

2017-04-14 Thread Andrew Musselman
Cancelling this vote, one more coming out shortly. On Fri, Apr 14, 2017 at 7:38 PM, Andrew Musselman < andrew.mussel...@gmail.com> wrote: > Yeah I ripped it out today. > > Can put it back, or we can have this patch in the release notes and get a > .1 release out in a couple we

[VOTE] Apache Mahout 0.13.0 Release Candidate

2017-04-14 Thread Andrew Musselman
This is the vote for release 0.13.0 of Apache Mahout. The vote will be going for at least 72 hours and will be closed on Monday, April 17th, 2017 or once there are at least 3 PMC +1 binding votes (whichever occurs earlier). Please download, test and vote with [ ] +1, accept RC as the official

Re: [VOTE] Apache Mahout 0.13.0 Release Candidate

2017-04-12 Thread Andrew Musselman
store_db LICENSE.txt NOTICE.txt > mahout-examples-0.13.0.jar README.md mahout-examples-0.13.0-job.jar > viennacl mahout-hdfs-0.13.0.jar viennacl-omp > > > From: Andrew Musselman <andrew.mussel...@gmail.com>

Re: [VOTE] Apache Mahout 0.13.0 Release Candidate

2017-04-11 Thread Andrew Musselman
source, and saw all cores on the CPU get exercised when running the viennacl-omp profile from source. So far I'm +1 (binding). On Tue, Apr 11, 2017 at 8:55 AM, Andrew Musselman < andrew.mussel...@gmail.com> wrote: > This is the vote for release 0.13.0 of Apache Mahout. > > The vot

Strangeloop CFP

2017-04-10 Thread Andrew Musselman
https://thestrangeloop.com/cfp.html I'll submit something like what we're doing at MLConf.

Re: [VOTE] Apache Mahout 0.13.0 Release Candidate

2017-03-27 Thread Andrew Musselman
02022/rawkintrevo > http://trevorgrant.org > > *"Fortunate is he, who is able to know the causes of things." -Virgil* > > > On Mon, Mar 27, 2017 at 2:42 PM, Andrew Musselman < > andrew.mussel...@gmail.com> wrote: > > > Hashes and sigs look good to me;

Re: [VOTE] Apache Mahout 0.13.0 Release Candidate

2017-03-27 Thread Andrew Musselman
Hashes and sigs look good to me; please someone else confirm. On Mon, Mar 27, 2017 at 10:40 AM, Andrew Musselman < andrew.mussel...@gmail.com> wrote: > This is the vote for release 0.13.0 of Apache Mahout. > > The vote will be going for at least 72 hours and will be closed on &g

[VOTE] Apache Mahout 0.13.0 Release Candidate

2017-03-27 Thread Andrew Musselman
This is the vote for release 0.13.0 of Apache Mahout. The vote will be going for at least 72 hours and will be closed on Thursday, March 30rd, 2017 or once there are at least 3 PMC +1 binding votes (whichever occurs earlier). Please download, test and vote with [ ] +1, accept RC as the

Re: [VOTE] Apache Mahout 0.13.0 Release Candidate

2017-03-21 Thread Andrew Musselman
Sigs and hashes check out for me. On Tue, Mar 21, 2017 at 9:17 AM, Andrew Musselman <a...@apache.org> wrote: > This is the vote for release 0.13.0 of Apache Mahout. > > The vote will be going for at least 72 hours and will be closed on Friday, > March 26th, 2017 or once there

[VOTE] Apache Mahout 0.13.0 Release Candidate

2017-03-21 Thread Andrew Musselman
This is the vote for release 0.13.0 of Apache Mahout. The vote will be going for at least 72 hours and will be closed on Friday, March 26th, 2017 or once there are at least 3 PMC +1 binding votes (whichever occurs earlier). Please download, test and vote with [ ] +1, accept RC as the official

Re: [VOTE] Apache Mahout 0.13.0 Release Candidate

2017-03-16 Thread Andrew Musselman
Cancelling vote due to https://issues.apache.org/jira/browse/MAHOUT-1955 On Wed, Mar 15, 2017 at 8:55 AM, Andrew Musselman < andrew.mussel...@gmail.com> wrote: > Correction, vote is until Friday, March 17th. > > On Tue, Mar 14, 2017 at 8:45 PM, Andrew Musselman < > andre

Re: [VOTE] Apache Mahout 0.13.0 Release Candidate

2017-03-15 Thread Andrew Musselman
Correction, vote is until Friday, March 17th. On Tue, Mar 14, 2017 at 8:45 PM, Andrew Musselman < andrew.mussel...@gmail.com> wrote: > This is the vote for release 0.13.0 of Apache Mahout. > > The vote will be going for at least 72 hours and will be closed on Friday, > March

[VOTE] Apache Mahout 0.13.0 Release Candidate

2017-03-14 Thread Andrew Musselman
This is the vote for release 0.13.0 of Apache Mahout. The vote will be going for at least 72 hours and will be closed on Friday, March 3rd, 2017 or once there are at least 3 PMC +1 binding votes (whichever occurs earlier). Please download, test and vote with [ ] +1, accept RC as the official

Re: [VOTE] Apache Mahout 0.13.0 Release Candidate

2017-03-14 Thread Andrew Musselman
Yes, if we're ready I can cut another one today. On Tue, Mar 14, 2017 at 3:13 PM, Pat Ferrel <p...@occamsmachete.com> wrote: > The release was not made due to broken drivers, now fixed. I assume a new > RC will come shortly? > > > On Mar 11, 2017, at 9:54 PM, Andrew Musse

[VOTE] Apache Mahout 0.13.0 Release Candidate

2017-03-11 Thread Andrew Musselman
This is the vote for release 0.13.0 of Apache Mahout. The vote will be going for at least 72 hours and will be closed on Friday, March 3rd, 2017 or once there are at least 3 PMC +1 binding votes (whichever occurs earlier). Please download, test and vote with [ ] +1, accept RC as the official

Re: [VOTE] Apache Mahout 0.13.0 Release Candidate

2017-03-02 Thread Andrew Musselman
Confirmed hashes and sigs, tested operations in the shell in src and bin artifacts. Would like someone else to check sigs too. +1 (binding) On Wed, Mar 1, 2017 at 9:39 PM, Andrew Musselman <andrew.mussel...@gmail.com > wrote: > New RC for 0.13.0 release out; please try out the new

Re: [VOTE] Apache Mahout 0.13.0 Release Candidate

2017-03-01 Thread Andrew Musselman
at 11:45 AM, Andrew Palumbo <ap@outlook.com> wrote: > I will verify keys tonight. > > > > Sent from my Verizon Wireless 4G LTE smartphone > > > Original message ---- > From: Andrew Musselman <andrew.mussel...@gmail.com> > Date:

Fwd: Mahout Compatibility With Hortonworks Sandbox

2017-03-01 Thread Andrew Musselman
Hi Shengfa, thanks for reaching out; I'm forwarding to the user and dev lists so more people can take a look. We're in the middle of a release this week so responses might be a bit delayed, but we'll help however we can. Thanks -- Forwarded message -- From: Shengfa Lin

Re: [VOTE] Apache Mahout 0.13.0 Release Candidate

2017-03-01 Thread Andrew Musselman
Nevermind, that was before building the src distro. Shell works fine with src and binary distros. On Wed, Mar 1, 2017 at 9:39 AM, Andrew Musselman <andrew.mussel...@gmail.com > wrote: > I'm getting this when starting the spark-shell on a Mac: > > Loading /Users/andrew.musselman/D

Re: [VOTE] Apache Mahout 0.13.0 Release Candidate

2017-03-01 Thread Andrew Musselman
) ^ :21: error: not found: value sc2sdc implicit val sdc: org.apache.mahout.sparkbindings.SparkDistributedContext = sc2sdc(sc) On Wed, Mar 1, 2017 at 9:21 AM, Andrew Musselman <a...@apache.org> wrote: > I've confirmed hashes and sigs; if someone other than me could co

Re: [VOTE] Apache Mahout 0.13.0 Release Candidate

2017-03-01 Thread Andrew Musselman
after running some tests. On Tue, Feb 28, 2017 at 10:58 PM, Andrew Musselman <a...@apache.org> wrote: > This is the vote for release 0.13.0 of Apache Mahout. > > The vote will be going for at least 72 hours and will be closed on Friday, > March 3rd, 2017 or once there are

[VOTE] Apache Mahout 0.13.0 Release Candidate

2017-02-28 Thread Andrew Musselman
This is the vote for release 0.13.0 of Apache Mahout. The vote will be going for at least 72 hours and will be closed on Friday, March 3rd, 2017 or once there are at least 3 PMC +1 binding votes (whichever occurs earlier). Please download, test and vote with [ ] +1, accept RC as the official

Re: Starter Issues

2017-02-01 Thread Andrew Musselman
Adding user. On Wed, Feb 1, 2017 at 4:14 PM Andrew Palumbo wrote: > bump after JIRA Issue planning email-blast. > > > From: Trevor Grant > Sent: Wednesday, February 1, 2017 5:01:21 PM > To: d...@mahout.apache.org >

Re: Mahout ML vs Spark Mlib vs Mahout-Spark integreation

2016-09-17 Thread Andrew Musselman
Mahout has changed a lot in the past couple years, becoming more focused on serving the needs of data workers and scientists who need to experiment with large matrix math problems. To that end we've broadened the execution engines that perform the distribution of computation to include Spark and

Re: "lucene.vector" function implementation

2016-09-17 Thread Andrew Musselman
-with-shell.html for some tips on setting environment variables, e.g.: export MAHOUT_HOME=[directory into which you checked out Mahout] export SPARK_HOME=[directory where you unpacked Spark] export MASTER=[url of the Spark master] On Fri, Sep 16, 2016 at 11:06 PM, Andrew Musselman < andrew.mussel...@gmail.

Re: "lucene.vector" function implementation

2016-09-17 Thread Andrew Musselman
RM <reth.ik...@gmail.com> wrote: > The functionality itself works on latest mahout_master branch(0.12.3), so > its likely moved to other class-package structure. Any pointers to which > class is it moved? > > I can file a jira to update docs. > > On Fri, Sep 16, 2016 a

Re: "lucene.vector" function implementation

2016-09-16 Thread Andrew Musselman
We are up to version 0.12 so 0.9 is well out of date. That may have moved into another package or been deleted, not sure. It may indicate we need to update that page; thank you for letting us know and if you'd like to file a bug in JIRA please do. On Friday, September 16, 2016, Reth RM

Re: [VOTE] Mahout 0.12.2 Release Candidate 2

2016-06-10 Thread Andrew Musselman
Signatures and hashes are correct; +1 (binding). On Fri, Jun 10, 2016 at 6:05 PM, Suneel Marthi wrote: > Verified {bin} * {zip,tar} - ran tests, tests pass > > Verified {src} * {zip,tar} - rant tests, tests pass > > Here's my +1 (binding) > > On Fri, Jun 10, 2016 at 8:59 PM,

Re: [VOTE] Apache Mahout 0.12.2 Release Candidate

2016-06-10 Thread Andrew Musselman
Signatures and hashes look good; built from source tarball and all tests pass. +1 (binding) On Fri, Jun 10, 2016 at 2:25 PM, Suneel Marthi wrote: > Verified {bin} * {zip,tar} - ran tests, tests pass > > Verified {src} * {zip,tar} - rant tests, tests pass > > Here's my +1

Re: Stickers

2016-06-03 Thread Andrew Musselman
ache projects r already working with > stickermule. > > > > > > > > On Thu, Jun 2, 2016 at 10:12 PM, Andrew Musselman < > > andrew.mussel...@gmail.com <javascript:;>> wrote: > > > >> The link to buy: > >> https://www.stick

Re: Stickers

2016-06-02 Thread Andrew Musselman
The link to buy: https://www.stickermule.com/en/marketplace/13179-apache-mahout On Thu, Jun 2, 2016 at 7:01 PM, Andrew Musselman <andrew.mussel...@gmail.com > wrote: > > https://www.stickermule.com/artworks/755889?token=cded79155151fd30df04ad7f2a37cbc3 > > On Thu, Jun 2, 2016

Re: Welcome Trevor Grant as a new Mahout Committer

2016-05-24 Thread Andrew Musselman
Welcome Trevor! On Mon, May 23, 2016 at 5:39 PM, Andrew Palumbo wrote: > In recognition of Trevor Grant's contributions to the Mahout project > notably his Zeppelin Integration work, the PMC has invited and is pleased > to announce that he has accepted our invitation to

Re: [VOTE] Apache Mahout 0.12.1 Release

2016-05-18 Thread Andrew Musselman
Sigs and hashes are good; +1 (binding). On Wed, May 18, 2016 at 3:53 PM, Andrew Palumbo wrote: > +1 (binding) tested a clean source build. > > > From: Suneel Marthi > Sent: Wednesday, May 18, 2016 6:23:57 PM > To:

Apachecon

2016-05-05 Thread Andrew Musselman
Anyone else going to Vancouver next week, let us know; Suneel and I will both be there for starters.

Re: Congratulations to our new Chair

2016-04-20 Thread Andrew Musselman
Suneel, thanks your great work as Chair and thank you Andy for stepping in! On Wed, Apr 20, 2016 at 5:00 PM, Dmitriy Lyubimov wrote: > congrats! > > On Wed, Apr 20, 2016 at 4:55 PM, Suneel Marthi wrote: > > > Please join me in congratulating Andrew

Re: [VOTE] Apache Mahout 0.12.0 Release Candidate

2016-04-11 Thread Andrew Musselman
ark-document-classifier.mscala script. > +1 > > ____ > From: Andrew Musselman <andrew.mussel...@gmail.com> > Sent: Monday, April 11, 2016 12:43 PM > To: d...@mahout.apache.org > Cc: user@mahout.apache.org > Subject: Re: [VOTE] Apach

Re: [VOTE] Apache Mahout 0.12.0 Release Candidate

2016-04-11 Thread Andrew Musselman
Sigs and hashes are correct, running a build and examples next. On Mon, Apr 11, 2016 at 8:38 AM, Suneel Marthi wrote: > Ran a complete build on {src} * {zip, tar} and verified that all tests > pass. > > Tested Spark Shell > > All Flink tests pass > > +1 (binding) > > On

Re: [VOTE] Apache Mahout 0.12.0 Release Candidate

2016-04-10 Thread Andrew Musselman
-1 Problem found during testing. On Sun, Apr 10, 2016 at 7:29 PM, Suneel Marthi wrote: > This is the vote for release 0.12.0 of Apache Mahout that adds Apache Flink > as a execution engine to the Samsara Linear Algebra framework. > > The vote will run for 24 hours and will

Re: Removing MAHOUT_LOCAL option

2016-03-21 Thread Andrew Musselman
I haven't but if you'd like to try it out and report back I'd love to hear about it. The mr jobs are staying in for now, no active move to remove them. On Mon, Mar 21, 2016 at 12:20 AM, David Starina wrote: > Has anyone tried to run the deprecated MapReduce code on

Re: Removing MAHOUT_LOCAL option

2016-03-20 Thread Andrew Musselman
n Mar 20, 2016, at 11:04 AM, Andrew Musselman <andrew.mussel...@gmail.com > <javascript:;>> wrote: > > To clarify, the MAHOUT_LOCAL option only works for legacy Hadoop > MapReduce-based jobs which officially became deprecated in 0.10.0. > > On Sun, Mar 20, 2016 at 10:

Re: Removing MAHOUT_LOCAL option

2016-03-20 Thread Andrew Musselman
st deployment and it’s really helpful for small > local processing > > > > Have a great weekend! > > Mihai > > > >> On 20 Mar 2016, at 06:13, Suneel Marthi <suneel.mar...@gmail.com > <javascript:;>> wrote: > >> > >> +1

Removing MAHOUT_LOCAL option

2016-03-19 Thread Andrew Musselman
We're discussing removing the MAHOUT_LOCAL option in order to trim artifact sizes. If you think keeping the option to use MAHOUT_LOCAL for testing with the single-node mode of Hadoop is important please let us know. It can be handy for trying things out but it would be nice to ditch the effort

Re: [VOTE] Apache Mahout 0.11.2 Release Candidate

2016-03-11 Thread Andrew Musselman
Checked sigs and hashes, ran some examples, all build tests pass. +1 binding On Fri, Mar 11, 2016 at 3:03 PM, Suneel Marthi wrote: > Checked {src} * {zip, tar}, ran a clean build and all tests pass. > > +1 > > On Fri, Mar 11, 2016 at 5:17 PM, Suneel Marthi

Re: New Mahout "Samsara" Book

2016-02-25 Thread Andrew Musselman
Yes congrats guys! I got my Kindle copy and it looks great. On Thu, Feb 25, 2016 at 9:24 AM, Pat Ferrel wrote: > This is awesome news! Can’t wait to get a copy. Congratulations Dmitriy > and Andrew. > > Also thanks for the invitation Scott. > > I feel like Mahout has

Re: Mahout error : seq2sparse

2016-02-04 Thread Andrew Musselman
rlier versions. > > Thanks, > Alok Tanna > > On Thu, Feb 4, 2016 at 2:18 AM, Alok Tanna <tannaa...@gmail.com> wrote: > >> Will try to update it to night to the latest version and then give it a >> try . >> >> Thanks, >> Alok Tanna >> >&

Re: Code execution path of mahout

2016-02-03 Thread Andrew Musselman
what I am looking for? > > Regards, > Mahmood > > > On Wednesday, February 3, 2016 10:59 PM, Andrew Musselman < > andrew.mussel...@gmail.com <javascript:;>> wrote: > > > > Here are a bunch > https://github.com/apache/mahout/tree/master/math/src/main/java/org/a

Re: Mahout error : seq2sparse

2016-02-03 Thread Andrew Musselman
Is it possible you have any empty lines or extra whitespace at the end or in the middle of any of your input files? I don't know for sure but that's where I'd start looking. Are you on the most recent release? On Wed, Feb 3, 2016 at 7:33 PM, Alok Tanna wrote: > Mahout in

Re: Mahout error : seq2sparse

2016-02-03 Thread Andrew Musselman
Would recommend updating to the latest version if you can; you're probably working with two-releases-old code. On Wednesday, February 3, 2016, Alok Tanna wrote: > Thank you Andrew . I was able to remove empty lines with your help and > also run re run the process but then

Re: Mahout error : seq2sparse

2016-02-03 Thread Andrew Musselman
> In the earlier attach file you can see it says 16/02/03 22:59:04 INFO > mapred.MapTask: Record too large for in-memory buffer: 99614722 bytes > > How can I increase in-memory buffer for Mahout local mode. > > I hope this has nothing to do with this error. > > Thanks, >

Re: Mahout error : seq2sparse

2016-02-03 Thread Andrew Musselman
ich you're free to take a stab at. On Wed, Feb 3, 2016 at 9:21 PM, Andrew Musselman <andrew.mussel...@gmail.com > wrote: > $ for i in `ls input-directory`; do sed -i '/^$/d' input-directory/$i; done > > On Wed, Feb 3, 2016 at 9:08 PM, Alok Tanna <tannaa...@gmail.com> wrote: &g

Re: Mahout error : seq2sparse

2016-02-03 Thread Andrew Musselman
uld save lot of > time. > I would re run this once I have removed empty lines. > > It would be great if I can get this working in local mode or else I will > have to send few days to get it working on hadoop\spark cluster. > > Thanks, > Alok Tanna > > On Wed,

Re: Code execution path of mahout

2016-02-03 Thread Andrew Musselman
st important data > structures are defined? I mean where does it create the chunks (or read the > chunks)? What are the sizes of the matrices? are they typically small > (10x10) or large (1000x1000)? > > > Regards, > Mahmood > > > On Wednesday, February 3, 2016 10:4

Re: Code execution path of mahout

2016-02-03 Thread Andrew Musselman
? Do the mahout rely on nested loops? How about > branch distribution in the code. > > So, you may answer the questions for the new version, then I will try to > map that to the old version by comparing the functions. > > Regards, > Mahmood > > > On Wednesday, February 3, 20

Re: Code execution path of mahout

2016-02-03 Thread Andrew Musselman
Hi Mahmood, would be possible to trace the path out in an IDE like IntelliJ but there's no automated method to print that out, if that's what you're asking. Definitely recommend upgrading as that's five major releases old if at all possible. Best Andrew On Wed, Feb 3, 2016 at 10:35 AM, Mahmood

User interview

2016-01-27 Thread Andrew Musselman
To the List, if anyone would be open to being interviewed as a user of Mahout for an article please let me know. I can let you know details and put you in touch with the writer. Thanks!

Re: Mahout 0.11.1

2015-11-09 Thread Andrew Musselman
Done On Mon, Nov 9, 2015 at 10:19 AM, Pat Ferrel wrote: > Can someone forward the announcement directly to my email? I didn’t get > the announcement of release.

Re: [VOTE] Apache Mahout 0.11.1 Release Candidate

2015-11-06 Thread Andrew Musselman
se has passed and the Voting is officially closed, will send an > > announcement out when the release has been finalized. > > > > Thanks again. > > > > On Fri, Nov 6, 2015 at 5:57 PM, Andrew Musselman < > > andrew.mussel...@gmail.com > > > wrote: >

Re: [VOTE] Apache Mahout 0.11.1 Release Candidate

2015-11-06 Thread Andrew Musselman
Checked sigs, built and ran some calculations in spark-shell from tar and zip. +1 binding On Fri, Nov 6, 2015 at 2:41 PM, Andrew Palumbo wrote: > 1. Downloaded and built {src} {tar}- all tests passed. > 2. Started shell from {src} {bin} *{tar} distro and ran some

Re: [VOTE] Apache Mahout 0.11.1 Release Candidate

2015-11-06 Thread Andrew Musselman
Src tar and zips build and tests pass but I may have some issues: $ echo $MASTER spark://Bob:7077 $ ./bin/mahout spark-shell MAHOUT_LOCAL is set, so we don't add HADOOP_CONF_DIR to classpath. Cannot find Spark classpath. Is 'SPARK_HOME' set? $ echo $SPARK_HOME /home/akm/spark-1.5.1-bin-hadoop2.4

Re: matrix inversion in plan ?

2015-10-08 Thread Andrew Musselman
e > thing): > > val (drmU, drmV, s) = dssvd(drmA, k = 100) > val drmInvA = drmV %*% diagv(1 /=: s) %*% drmU.t > > Still, technically, it is a right inverse as in reality m is rarely the > same as n. Also, k must be k<= drmA.nrow min drmA.ncol > > > On Thu

Re: matrix inversion in plan ?

2015-10-08 Thread Andrew Musselman
Yeah, nice trick Ted; here's a how-to for the list: http://www.cse.unr.edu/~bebis/CS791E/Notes/SVD.pdf On Thu, Oct 8, 2015 at 2:31 PM, Ted Dunning wrote: > Yes. You can get the inverse from an SVD or emulate its effect. > > Can you share the actual mathematical

Re: matrix inversion in plan ?

2015-10-08 Thread Andrew Musselman
ght inverse as in reality m is rarely the > > same as n. Also, k must be k<= drmA.nrow min drmA.ncol > > > > > > On Thu, Oct 8, 2015 at 2:52 PM, Andrew Musselman < > > andrew.mussel...@gmail.com> wrote: > > > >> Yeah, nice trick Ted; here's a

Re: matrix inversion in plan ?

2015-10-03 Thread Andrew Musselman
If there's a need for inversion that's good info; would love to know the purpose to get a sense of how people want to use the product. On Saturday, October 3, 2015, Allen McIntosh wrote: > Can you explain why you feel you must invert a very large matrix. This > can be

Re: Time Series Stuff

2015-08-14 Thread Andrew Musselman
Agreed; let us know if you want some help getting started. On Friday, August 14, 2015, Dmitriy Lyubimov dlie...@gmail.com wrote: Not that I know of. would be nice to have. On Fri, Aug 14, 2015 at 4:42 PM, Nick Kolegraff nickkolegr...@gmail.com javascript:; wrote: Hey Mahouts, Looking

Re: [VOTE] Apache Mahout 0.11.0 Release Candidate

2015-08-05 Thread Andrew Musselman
+1, already tested On Wed, Aug 5, 2015 at 9:44 PM, Suneel Marthi smar...@apache.org wrote: This is the vote for release 0.11.0 of Apache Mahout. The vote will be going for at least 72 hours and will be closed on Thursday, August 6th, 2015. Please download, test and vote with [ ] +1,

Re: [VOTE] Apache Mahout 0.10.2 Release

2015-08-05 Thread Andrew Musselman
Hashes for src tar and zip are correct and all tests pass; no code changes so I'm comfortable. +1 binding On Wed, Aug 5, 2015 at 5:21 PM, Suneel Marthi smar...@apache.org wrote: Tested the examples from {src, bin} in pseudo-cluster mode and all tests pass. Here's my +1 (binding) On Wed,

Re: [VOTE] Apache Mahout 0.10.2 Release Candidate

2015-08-04 Thread Andrew Musselman
This was user error, revoking my binding -1 vote. On Sun, Aug 2, 2015 at 5:22 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: -1 unless this is operator error on my part. $ gpg --verify Downloads/apache-mahout-distribution-0.10.2-src.zip.asc gpg: no signed data gpg: can't hash

Re: [VOTE] Apache Mahout 0.10.2 Release Candidate

2015-08-02 Thread Andrew Musselman
Is there any reason not to release 11 too? On Sunday, August 2, 2015, Pat Ferrel p...@occamsmachete.com wrote: +1 (binding) — do we have to say binding? Why do we continue on Spark 1.2 when all distros have updated to Spark 1.3.1 long ago, and Spark has released 1.4 with 1.5 in the works.

Re: [VOTE] Apache Mahout 0.10.2 Release Candidate

2015-08-02 Thread Andrew Musselman
definitely don't have the time) . If someone else wants to push thru 0.11.0, please do so. On Sun, Aug 2, 2015 at 1:27 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: Is there any reason not to release 11 too? On Sunday, August 2, 2015, Pat Ferrel p...@occamsmachete.com wrote

Re: Kmeans clusterdump Interpretation

2015-07-20 Thread Andrew Musselman
kmeans gave me a point vector as a centroid, not a calculated point central to a cluster. I guess in this case I would be looking for the most central point vector (from the index ) that I can use as a representative of the cluster. On Tue, Jul 21, 2015 at 6:41 AM, Andrew Musselman

Re: Kmeans clusterdump Interpretation

2015-07-20 Thread Andrew Musselman
I'm not sure centroid id is even a defined thing, especially since the centroid, in my understanding, is just a point in space, not necessarily a point in your data. Are you trying to find the most-central point in a given cluster? On Mon, Jul 20, 2015 at 5:18 PM, Ankit Goel

Re: Building Mahout Source

2015-06-11 Thread Andrew Musselman
Also it's worth noting there's a point release, 0.10.1, available now. On Thursday, June 11, 2015, Dmitriy Lyubimov dlie...@gmail.com wrote: I am not sure how maven repo is managed for released apache projects. Binary artifacts are available for downloads. Also if you are building from

Updated AMI for EMR

2015-06-01 Thread Andrew Musselman
AWS will be releasing a new AMI in July that will include our 0.10.1 release.

Re: [VOTE] Mahout 0.10.1 Release Candidate

2015-05-31 Thread Andrew Musselman
Marthi suneel.mar...@gmail.com wrote: Please hold ur votes, will be refreshing staging with another build in the next hour On Sat, May 30, 2015 at 8:31 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: Likewise source zip and tarballs build and pass tests. On Sat, May 30, 2015

Re: [VOTE] Mahout 0.10.1 Release Candidate

2015-05-30 Thread Andrew Musselman
Likewise source zip and tarballs build and pass tests. On Sat, May 30, 2015 at 3:23 PM, Suneel Marthi smar...@apache.org wrote: Verified {source} * {zip, tar} and all tests pass. +1 (binding) On Sat, May 30, 2015 at 5:28 PM, Suneel Marthi smar...@apache.org wrote: This is a call for VOTE

Re: I have a weird exception in Online logistic regression

2015-05-17 Thread Andrew Musselman
Could you post your code and some sample data? On Sunday, May 17, 2015, Aykut Çayır aykutcayi...@gmail.com wrote: Hi mahouters, I have tried to implement an application to classify Breast caner using wisconsin dataset (UCI). However, I have An exception Array out bounds exception dene

Re: Speed up LDA in Mahit 0.9

2015-05-08 Thread Andrew Musselman
I'd also recommend getting the newest version of Mahout, 0.10. On Fri, May 8, 2015 at 7:15 AM, Yutaka Mandai 20525entrad...@gmail.com wrote: If it's small enough to fit in memory, setting MAHOUT_LOCAL=TRUE should drive you crazy! I've suffered a lot from running LDA(CVB0) on even on EMR. If

Re: [VOTE] Apache Mahout 0.10.0 Release

2015-04-11 Thread Andrew Musselman
: Ran well but we have a packaging problem with the binary distro. Will require either a pom or code change I think, hold the vote. On Apr 9, 2015, at 4:31 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: Running on EMR now. On Thu, Apr 9, 2015 at 3:52 PM

Re: [VOTE] Apache Mahout 0.10.0 Release

2015-04-11 Thread Andrew Musselman
checking the {source} * {tar,zip} and running a few tests locally, I am fine with this release. +1 (binding) On Sat, Apr 11, 2015 at 11:43 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: After checking the binary tarball and zip, and running through all

Re: How to change /tmp directory for mahout usage of map-reduce?

2015-03-31 Thread Andrew Musselman
Can you let us know which code/scripts you're using? On Tuesday, March 31, 2015, Vikas Kumar kumar...@umn.edu wrote: Hello, I am using Mahout Spectral clustering example which internally calls a map reduce job. Right now, it is using */tmp/hadoop-username/mapred/..* directory by default for

Re: mahout output of seq2sparse is empty

2015-03-03 Thread Andrew Musselman
I don't have a terminal in front of me but are you sure tfidf-vectors is a file, not a directory? On Tuesday, March 3, 2015, Raghuveer alwaysra...@yahoo.com.invalid wrote: I have data file of the formatsrc_ip,dest_ip,packet, bytes_transferred, src_port,dest_port, start_timestamp

Re: FPGrowth and Recommendations

2015-03-02 Thread Andrew Musselman
Hi Jeff, as I recall the map-reduce-based fp-growth solution was problematic, and it's been either deprecated or removed. There are better solutions under the recommendations tab at http://mahout.apache.org And I would encourage your updating your version of Mahout to 0.9 or to the master branch

Re: Custome metrics in Mahout Recommendation

2015-03-02 Thread Andrew Musselman
If you check out the latest master branch from https://github.com/apache/mahout you'll find classes like this in the map-reduce legacy package: /home/akm/mahout/mrlegacy/src/main/java/org/apache/mahout/common/distance/CosineDistanceMeasure.java I'm not sure if/where new ones are being written..

Re: Tanimoto Coefficient

2014-12-17 Thread Andrew Musselman
I've never used it in production but there's no reason not to try it out, and there's nothing stopping you from applying it to users as well as items. On Wed, Dec 17, 2014 at 1:09 AM, ARROYO MANCEBO David david.arr...@altran.com wrote: Hi mahouters, Is useful and acceptable the tanimoto

Re: Advise needed for Mahout heap size allocation (seq2sparse failure)

2014-12-17 Thread Andrew Musselman
But also please update to Mahout version 0.9 since you're two versions behind. On Wed, Dec 17, 2014 at 10:55 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: It's worth trying to increase the heap size for child JVMs per this doc, depending on what version you're running: http

Re: computing the distance between 2 values from fuzzyKmeans clustering clusteredPoints

2014-12-09 Thread Andrew Musselman
Could you please upgrade to Mahout 0.9 or work off of trunk? Some of the code related to k-means results changed since 0.8. Thanks! On Tue, Dec 9, 2014 at 1:57 PM, Anne Sauve anne.sa...@hotmail.com wrote: Hello there, I have been trying for a while to compute the pairwise distance between

Re: Topological data analysis

2014-12-05 Thread Andrew Musselman
use case, I'd +1 the idea! On Dec 4, 2014, at 3:11 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: Any interest in a topological data analysis package in Mahout? https://www.google.com/search?q=topological+data+analysis http://danifold.net/mapper/introduction.html

Topological data analysis

2014-12-04 Thread Andrew Musselman
Any interest in a topological data analysis package in Mahout? https://www.google.com/search?q=topological+data+analysis http://danifold.net/mapper/introduction.html http://danifold.net/mapper Would be nice to be able to run jobs and and export to JSON for consumption in D3 or other

Re: Mahout 0.7 ALS Recommender: java.lang.Exception: java.lang.RuntimeException: java.lang.ClassCastException: org.apache.hadoop.io.Text cannot be cast to org.apache.hadoop.io.IntWritable

2014-11-23 Thread Andrew Musselman
) at org.apache.hadoop.util.ShutdownHookManager$1.run(ShutdownHookManager.java:54) On 23 November 2014 at 09:22, Andrew Musselman andrew.mussel...@gmail.com wrote: Please upgrade to Mahout version 0.9, as many things have been fixed since. On Nov 22, 2014, at 7:00 PM, Ashok Harnal ashokhar...@gmail.com

Re: Mahout 0.7 ALS Recommender: java.lang.Exception: java.lang.RuntimeException: java.lang.ClassCastException: org.apache.hadoop.io.Text cannot be cast to org.apache.hadoop.io.IntWritable

2014-11-22 Thread Andrew Musselman
Please upgrade to Mahout version 0.9, as many things have been fixed since. On Nov 22, 2014, at 7:00 PM, Ashok Harnal ashokhar...@gmail.com wrote: I use mahout 0.7 installed in Cloudera. After creating user-feature and item-feature matrix in hdfs, I run the following command: mahout

Re: recommenditembased returns 0 records from last map-reduce job

2014-07-20 Thread Andrew Musselman
I'm confused about how you're constructing the user file, and why there are negated item ids here. Can you post some more details please, including Mahout version and some sample data sets? On Jul 20, 2014, at 11:57 AM, Serega Sheypak serega.shey...@gmail.com wrote: Hi, I'm trying to

New post on the AWS blog

2014-07-17 Thread Andrew Musselman
I wrote a post about building a recommender using Mahout on Amazon EMR and it finally went live: http://blogs.aws.amazon.com/bigdata/post/Tx1TDK3HHBD4EZL/Building-a-Recommender-with-Apache-Mahout-on-Amazon-Elastic-MapReduce-EMR Please feel free to share the link to get the word out. Best

Re: mysql connection

2014-06-25 Thread Andrew Musselman
Mahout can't read directly out of MySQL. First bring data into HDFS/Hive using something like Sqoop as per: http://sqoop.apache.org On Jun 24, 2014, at 11:04 PM, vinayakb malagatti vinayakbmalaga...@gmail.com wrote: i am using apache mahout and i want read the table contents form the db

Re: mysql connection

2014-06-24 Thread Andrew Musselman
This sounds like a good question for the sqoop user list. See http://sqoop.apache.org On Jun 24, 2014, at 10:47 PM, vinayakb malagatti vinayakbmalaga...@gmail.com wrote: hi, how to connect to MySql DB and read the table contents. Thanks and Regards, Vinayak B

Re: Interpretation of cluster output

2014-06-18 Thread Andrew Musselman
1000 Also I have observed that the *part* file created inside *clusteredPoints* is empty. Please help me how to get data points from each cluster. On Fri, Jun 13, 2014 at 9:24 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: That's going to be easier if you can work off of trunk

Re: Interpretation of cluster output

2014-06-18 Thread Andrew Musselman
cluster. On Fri, Jun 13, 2014 at 9:24 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: That's going to be easier if you can work off of trunk, since the output of clustering has been cleaned up to write a better format, per https://issues.apache.org/jira/browse/MAHOUT-1505 E.g

Re: Interpretation of cluster output

2014-06-13 Thread Andrew Musselman
That's going to be easier if you can work off of trunk, since the output of clustering has been cleaned up to write a better format, per https://issues.apache.org/jira/browse/MAHOUT-1505 E.g., { top_terms: [ {all:3.0149030685424805}, {english:3.0149030685424805},

<    1   2   3   >