Re: IntelliJ license keys

2013-04-19 Thread Andrew Musselman
Absolutely agree; IntelliJ is a great tool. On Fri, Apr 19, 2013 at 11:34 AM, Ted Dunning ted.dunn...@gmail.com wrote: Yes. This is for committers only. But the community version of IntelliJ is really awesome anymore. And frankly, if I were not a committer I would buy a copy of the

Re: Cannot resolve symbol 'OpenIntObjectHashMap'

2013-04-30 Thread Andrew Musselman
) and then open the pom file using a clean start of IntelliJ? This will avoid question of cached evil. Also, which version of IntelliJ are you using? On Tue, Apr 16, 2013 at 11:10 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: On Tue, Apr 16, 2013 at 3:14 PM, Ted Dunning ted.dunn

Re: Cannot resolve symbol 'OpenIntObjectHashMap'

2013-04-30 Thread Andrew Musselman
PS IntelliJ Community Edition 11.1.5 for Linux On Tue, Apr 30, 2013 at 2:36 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: Get the svn repo from here? https://cwiki.apache.org/MAHOUT/buildingmahout.html svn co http://svn.apache.org/repos/asf/mahout/trunk On Tue, Apr 16, 2013

Re: Cannot resolve symbol 'OpenIntObjectHashMap'

2013-04-30 Thread Andrew Musselman
Upgrading to IntelliJ 12 has fixed this prob! Thanks On Tue, Apr 30, 2013 at 2:39 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: PS IntelliJ Community Edition 11.1.5 for Linux On Tue, Apr 30, 2013 at 2:36 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: Get the svn repo

Re: Call to action – Mahout needs your help

2013-05-02 Thread Andrew Musselman
would love to have you! I will let others answer about things to do since I have to fly. On Fri, Apr 5, 2013 at 1:56 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: In case this thread is still a good place to reply with an offer to help, I'd love to pitch in. I have built a few

Re: Call to action – Mahout needs your help

2013-05-02 Thread Andrew Musselman
/licenses/icla.txt but i think apache does not require it for each single contribution... if you want to become permanent committer and supporter of your sizeable contribution then yes Apache would want you to sign it. On Thu, May 2, 2013 at 4:05 PM, Andrew Musselman andrew.mussel...@gmail.com

Re: This is the high level scenario ---- Re: Can Mahout do pattern recognition?

2013-05-31 Thread Andrew Musselman
This sounds like something Storm was purpose-built for: http://storm-project.net/ It lets you do computation on streams coming in. Hope this helps. On Fri, May 31, 2013 at 2:27 PM, Mimi Tam mimi@ieee.org wrote: I have a gazillion streams of wireless call control data coming in from a

Re: (Bi-)Weekly/Monthly Dev Sessions

2013-06-12 Thread Andrew Musselman
Bi-weekly is good for me; I'm in Seattle and just filled out the poll. Great idea! On Wed, Jun 12, 2013 at 10:22 AM, Saikat Kanjilal sxk1...@hotmail.comwrote: +1, am in Seattle as well and would love to attend and be involved. Sent from my iPhone On Jun 12, 2013, at 10:18 AM, Ravi

Re: (Bi-)Weekly/Monthly Dev Sessions

2013-06-12 Thread Andrew Musselman
Let's do it in Ballard :) On Wed, Jun 12, 2013 at 10:56 AM, Jake Mannix jake.man...@gmail.com wrote: Wow, a lot of Seattleites, I should organize a Mahout MeetUp / Hackathon when I get back from europe at the end of the summer! On Wed, Jun 12, 2013 at 10:44 AM, Andrew Musselman

Re: (Bi-)Weekly/Monthly Dev Sessions

2013-07-07 Thread Andrew Musselman
Works for me, thanks for setting it up. On Jul 7, 2013, at 3:24 PM, Grant Ingersoll gsing...@apache.org wrote: How about tomorrow (Monday) night at 8:30 pm EDT? Anyone who wants to join, can browse to https://plus.google.com/hangouts/_/1aa32da8d1f9b1669cf6b5ec8bce123d12aec409?hl=en If

Re: (Bi-)Weekly/Monthly Dev Sessions

2013-07-08 Thread Andrew Musselman
I'm getting an error when I build after doing svn up: $ mvn package [INFO] Scanning for projects... [ERROR] The build could not read 1 project - [Help 1] [ERROR] [ERROR] The project (/home/akm/mahout/pom.xml) has 1 error [ERROR] Non-readable POM /home/akm/mahout/pom.xml: no more data

AWS test bed

2013-07-09 Thread Andrew Musselman
One of the things we chatted about last night in the hangout was how to automate this regression process. I reached out to our friends at Amazon Web Services, who are looking at how they could donate compute time so we could use a cluster as well regressing on our own hosts. We could either spin

Re: 0.8 progress

2013-07-10 Thread Andrew Musselman
9, 2013 at 1:50 PM, Ted Dunning ted.dunn...@gmail.com wrote: In meetings back to back 9-11 hours a day this wek. On Tue, Jul 9, 2013 at 12:37 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: No time yet today to run stuff on a cluster but I will do it tonight. On Tue, Jul 9

Re: Mahout release process

2013-07-10 Thread Andrew Musselman
That's how the maven release plugin does it in my experience, and yes that's what I get now too. On Wed, Jul 10, 2013 at 10:54 AM, Jake Mannix jake.man...@gmail.com wrote: So quick question: is an intentional side-effect of the current release process that when we build on trunk now, we build

Re: Mahout release process

2013-07-10 Thread Andrew Musselman
I have to admit I spent as little time as possible learning how to do maven releases so I don't know the answer to this. On Wed, Jul 10, 2013 at 11:06 AM, Jake Mannix jake.man...@gmail.com wrote: On Wed, Jul 10, 2013 at 10:00 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: That's

Re: survey of good time for rec demo status meeting

2013-07-19 Thread Andrew Musselman
It's requiring me to fill out an answer for every day but I can't do Monday. I can make time any time after noon Pacific the other days. Thanks Andrew On Jul 19, 2013, at 6:10 PM, Ted Dunning ted.dunn...@gmail.com wrote: I am hearing lots of support for the demo project off-line. Let's move

Re: Hangout on Monday

2013-08-03 Thread Andrew Musselman
Sounds good On Sat, Aug 3, 2013 at 12:04 AM, Ted Dunning ted.dunn...@gmail.com wrote: Yes. 1600 PDT I got that right in the linked doc, just not on the more important email. On Fri, Aug 2, 2013 at 3:30 PM, Andrew Psaltis andrew.psal...@webtrends.com wrote: On 8/2/13 4:42 PM,

Re: Hangout on Monday

2013-08-05 Thread Andrew Musselman
Can't make it alas On Mon, Aug 5, 2013 at 3:12 PM, Michael Kun Yang kuny...@stanford.eduwrote: what's the addr of the hangout? On Sun, Aug 4, 2013 at 10:37 AM, Peng Cheng pc...@uowmail.edu.au wrote: Nice, I'll be there. On 13-08-03 02:51 PM, Andrew Musselman wrote: Sounds good

Re: You are invited to Apache Mahout meet-up

2013-08-22 Thread Andrew Musselman
Likewise; we talked about getting some other local Mahout meetups going. I'm in Seattle and I know there are other people up here. Let's get one started too. On Aug 22, 2013, at 8:18 PM, B Lyon bradfl...@gmail.com wrote: Wish I could be there as well. On Thursday, August 22, 2013, Gokhan

Re: You are invited to Apache Mahout meet-up

2013-08-23 Thread Andrew Musselman
haven't attended. On Aug 22, 2013, at 8:38 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: Likewise; we talked about getting some other local Mahout meetups going. I'm in Seattle and I know there are other people up here. Let's get one started too.

Re: You are invited to Apache Mahout meet-up

2013-08-23 Thread Andrew Musselman
think Jake Mannix is in Seattle and already has a more general meetup here. http://www.meetup.com/Seattle-DAML/ It seems very non-Mahout specific, I haven't attended. On Aug 22, 2013, at 8:38 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: Likewise; we talked about

Re: You are invited to Apache Mahout meet-up

2013-08-24 Thread Andrew Musselman
That's a fine meetup; see you then. On Aug 23, 2013, at 10:57 PM, Ted Dunning ted.dunn...@gmail.com wrote: See this URL. http://www.meetup.com/Seattle-Hadoop-HBase-NoSQL-Meetup/events/120290942/ Sent from my iPhone On Aug 23, 2013, at 9:07, Andrew Musselman andrew.mussel

Re: You are invited to Apache Mahout meet-up

2013-08-29 Thread Andrew Musselman
That was a great talk, Ted; any chance you could share your slides? On Sat, Aug 24, 2013 at 4:19 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: That's a fine meetup; see you then. On Aug 23, 2013, at 10:57 PM, Ted Dunning ted.dunn...@gmail.com wrote: See this URL. http

Re: All links are broken on the Mahout quickstart page

2013-09-04 Thread Andrew Musselman
Or rather put in a 301(permanantly moved) redirect so any old links end up in the right place? On Wed, Sep 4, 2013 at 10:22 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: Can we take down the other one? Been a few people caught up by this problem recently. On Wed, Sep 4, 2013

Re: All links are broken on the Mahout quickstart page

2013-09-04 Thread Andrew Musselman
Can we take down the other one? Been a few people caught up by this problem recently. On Wed, Sep 4, 2013 at 10:20 AM, Stevo Slavić ssla...@gmail.com wrote: https://cwiki.apache.org/confluence/display/MAHOUT/Quickstart On Wed, Sep 4, 2013 at 7:16 PM, Ravi Mummulla ravi.mummu...@gmail.com

Eigencuts version of spectral clustering

2013-09-04 Thread Andrew Musselman
Looks like this is finished as of May of this year, but is there still the bottleneck performance issue with it? I.e., is it useful in production? Thanks Andrew

Re: interesting matrix decomposition idea

2013-09-08 Thread Andrew Musselman
The divide and conquer approach sounds promising; along those lines I have heard things about non-negative matrix factorization being worth doing since there are methods to break the matrix into parts and then combine the result after processing.

Re: Mahout's future

2013-10-05 Thread Andrew Musselman
I would like to help with ALS including adding blending. On Oct 5, 2013, at 6:13 PM, Ted Dunning ted.dunn...@gmail.com wrote: On Sat, Oct 5, 2013 at 5:08 PM, Saikat Kanjilal sxk1...@hotmail.com wrote: Does it make sense to have a quick meeting of interested developers over google

Re: Mahout's future

2013-10-05 Thread Andrew Musselman
Amazon hosts some public data sets at http://aws.amazon.com/publicdatasets/ and http://aws.amazon.com/datasets On Oct 5, 2013, at 1:11 PM, Ted Dunning ted.dunn...@gmail.com wrote: I was asked to answer an anonymous question about the future of Mahout on Quora and thought I should share the

Re: Mahout's future

2013-10-05 Thread Andrew Musselman
, please speak up. On Sat, Oct 5, 2013 at 6:46 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: Amazon hosts some public data sets at http://aws.amazon.com/publicdatasets/ and http://aws.amazon.com/datasets On Oct 5, 2013, at 1:11 PM, Ted Dunning ted.dunn...@gmail.com wrote: I

Re: [jira] [Commented] (MAHOUT-1030) Regression: Clustered Points Should be WeightedPropertyVectorWritable not WeightedVectorWritable

2013-10-31 Thread Andrew Musselman
Assignee: Andrew Musselman Fix For: 1.0, 0.9 Attachments: MAHOUT-1030.patch, MAHOUT-1030.patch, MAHOUT-1030.patch Looks like this won't make it into this build. Pretty widespread impact on code and tests and I don't know which properties were implemented in the old

Re: Cloudera announces Oryx

2013-11-12 Thread Andrew Musselman
I'd like to congratulate Sean and Cloudera on shipping a system that does a few things well and then lets you put them into production easily. This feels like the direction Mahout ought to go as well, and the group's been going toward a simpler system recently. My reason for using Mahout is that

Re: Mahout 0.9 release

2013-11-28 Thread Andrew Musselman
on Open JIRAs for 0.9: Mahout-1245, Mahout-1304, Mahout-1305, Mahout-1307, Mahout-1326 - all related to Wiki updates, please see Isabel's updates. M-1286 - Peng and Sebastian, we had talked about this during the last hangout. Can this be included in 0.9? M-1030- Andrew Musselman, its

Re: Mahout 0.9 release

2013-11-29 Thread Andrew Musselman
/update the wiki M-1286 - Peng and Sebastian, no updates on this. Can this be included in 0.9? I will look into this over the weekend! M-1030 - Andrew Musselman I'll have a patch for review this weekend, made progress yesterday. M-1319, M-1328 - Suneel M-1347 - Suneel, patch has

Re: Mahout 0.9 release

2013-12-01 Thread Andrew Musselman
- Peng and Sebastian, no updates on this. Can this be included in 0.9? I will look into this over the weekend! M-1030 - Andrew Musselman M-1319, M-1328 - Suneel M-1347 - Suneel, patch has been committed to trunk. M-1265 - I have been working with Yexi on this. Ted, would u

Re: Mahout 0.9 release

2013-12-01 Thread Andrew Musselman
No, just reviewboard in general; never put any patches up before. On Dec 1, 2013, at 8:52 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: For M-1349?? There's no patch for this, no one's worked on it yet. On Sunday, December 1, 2013 11:50 AM, Andrew Musselman andrew.mussel

Re: Mahout 0.9 release

2013-12-01 Thread Andrew Musselman
Thanks; me too On Dec 1, 2013, at 10:53 AM, Suneel Marthi suneel_mar...@yahoo.com wrote: Sorry I am out on the streets but see M-1265 comments for a link to review board Sent from my iPhone On Dec 1, 2013, at 1:50 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: No, just

Re: Mahout 0.9 release

2013-12-01 Thread Andrew Musselman
the link to Reviewboard https://reviews.apache.org On Sunday, December 1, 2013 1:51 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: No, just reviewboard in general; never put any patches up before. On Dec 1, 2013, at 8:52 AM, Suneel Marthi suneel_mar...@yahoo.com wrote

Re: Welcome to Frank Scholten as new Mahout committer

2013-12-03 Thread Andrew Musselman
Welcome Frank; congratulations! On Tue, Dec 3, 2013 at 8:31 AM, Stevo Slavić ssla...@gmail.com wrote: Congrats and welcome Frank! On Tue, Dec 3, 2013 at 2:34 PM, Gokhan Capan gkhn...@gmail.com wrote: Congratulations, Frank! Gokhan On Tue, Dec 3, 2013 at 3:27 PM, Isabel

Re: Mahout 0.9 release

2013-12-03 Thread Andrew Musselman
suneel_mar...@yahoo.com Its been a while since I last did it, I think the Base Directory needs to be mahout/trunk. On Monday, December 2, 2013 1:17 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: Any tips on submitting to reviewboard for mahout? I tried selecting repo

Re: Mahout 0.9 release

2013-12-03 Thread Andrew Musselman
Yeah On Dec 3, 2013, at 1:36 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Andrew, Let me try this. Is it the patch for M-1030 that u r trying to get onto reviewboard? On Tuesday, December 3, 2013 3:14 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: Must

Re: Mahout 0.9 release

2013-12-03 Thread Andrew Musselman
be because your patch was generated from http://svn.apache.org/repos/asf/mahout/trunk. On Tuesday, December 3, 2013 4:39 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: Yeah On Dec 3, 2013, at 1:36 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Andrew, Let me try

Re: Mahout 0.9 release

2013-12-07 Thread Andrew Musselman
Isabel and Sebastian, let me know how I can help on the wiki bugs; I made an account for Apache Confluence. Wiki - Isabel, Sebastian and other volunteers - M-1245, M-1304, M-1305, M-1307, M-1326 On Tue, Dec 3, 2013 at 5:12 PM, Suneel Marthi

Re: Mahout 0.9 release

2013-12-19 Thread Andrew Musselman
+1 On Thu, Dec 19, 2013 at 9:20 AM, Suneel Marthi suneel_mar...@yahoo.comwrote: +1 Sent from my iPhone On Dec 19, 2013, at 12:17 PM, Frank Scholten fr...@frankscholten.nl wrote: I am looking at M-1329 (Support for Hadoop 2.x) as we speak. This change requires quite some testing and

Re: Wiki cleanup

2013-12-19 Thread Andrew Musselman
Likewise I'd like to help, if someone can grant me permission to edit please do. On Dec 19, 2013, at 11:11 PM, Isabel Drost-Fromm isa...@apache.org wrote: Hi, seems like after the Confluence account lock-down I'm lacking permissions to delete pages in our wiki (currently using the account

Re: Wiki cleanup

2013-12-19 Thread Andrew Musselman
Perfect name :) On Dec 19, 2013, at 11:29 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Grant has all the edit Grants!! On Friday, December 20, 2013 2:28 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: Likewise I'd like to help, if someone can grant me permission

Re: Mahout 0.9 Release Notes - First Draft

2013-12-23 Thread Andrew Musselman
Suneel ran into some issues this weekend; I'm going to try it out and see if I can repro. On Dec 23, 2013, at 1:02 AM, Isabel Drost-Fromm isa...@apache.org wrote: Hi, one thing I forgot: you once mentioned running into issues with the new kmeans - are those fixed or tracked in jira? In

Re: Happy Holidays!

2013-12-24 Thread Andrew Musselman
Merry Christmas and a Happy New Year! On Dec 24, 2013, at 3:36 PM, Stevo Slavić ssla...@gmail.com wrote: Happy Holidays Everyone! On Tue, Dec 24, 2013 at 12:28 PM, Frank Scholten fr...@frankscholten.nlwrote: Best wishes! On Tue, Dec 24, 2013 at 11:11 AM, Sebastian Schelter

Re: MAHOUT 0.9 Release - New URL

2014-01-20 Thread Andrew Musselman
Trying out the build today On Mon, Jan 20, 2014 at 6:00 AM, Suneel Marthi suneel_mar...@yahoo.comwrote: This is an issue (trivial one though) that needs to be fixed for 0.9 Release, will be rerolling the release today (in the next few hrs) and putting out a new release candidate in staging.

Re: MAHOUT 0.9 Release - New URL

2014-01-20 Thread Andrew Musselman
(ThreadPoolExecutor.java:1146) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) at java.lang.Thread.run(Thread.java:701) On Mon, Jan 20, 2014 at 9:37 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: Trying out the build today On Mon, Jan 20

Re: MAHOUT 0.9 Release - New URL

2014-01-21 Thread Andrew Musselman
has failed for you as well. I'll be rolling back the release today to fix these issues. On Tuesday, January 21, 2014 1:22 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: Builds on Ubuntu 12.04 from tarball and zip, and on AWS's default 64-bit Linux AMI from tarball. All tests

Re: MAHOUT 0.9 Release - New URL

2014-01-21 Thread Andrew Musselman
the release today to fix these issues. On Tuesday, January 21, 2014 1:22 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: Builds on Ubuntu 12.04 from tarball and zip, and on AWS's default 64-bit Linux AMI from tarball. All tests pass. *Output

Re: MAHOUT 0.9 Release - New URL

2014-01-22 Thread Andrew Musselman
Likewise, a) through d) work on an Amazon AMI and Ubuntu 12.04. +1 On Wed, Jan 22, 2014 at 6:38 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: Same here. I did a), b), c) and d) too and all tests pass. Here's my +1, if my vote counts. On Wednesday, January 22, 2014 7:11 PM, Sebastian

cluster-reuters.sh broken in trunk

2014-01-24 Thread Andrew Musselman
Last night I had this issue when testing out cluster-reuters.sh with no flags; anyone seen this recently? 14/01/23 22:03:54 INFO driver.MahoutDriver: Program took 286799 ms (Minutes: 4.7799833) Running on hadoop, using /home/akm/hadoop-0.20.205.0/bin/hadoop and HADOOP_CONF_DIR=

Re: cluster-reuters.sh broken in trunk

2014-01-24 Thread Andrew Musselman
Yeah, disregard, my repo was out of whack. On Fri, Jan 24, 2014 at 10:00 AM, ap.dev ap@outlook.com wrote: I'm not getting any exceptions there. Original message From: Andrew Musselman andrew.mussel...@gmail.com Date:01/24/2014 11:38 AM (GMT-05:00) To: dev

Re: cluster-reuters.sh broken in trunk

2014-01-24 Thread Andrew Musselman
) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:601) at org.apache.hadoop.util.RunJar.main(RunJar.java:156) On Fri, Jan 24, 2014 at 10:07 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: Yeah, disregard, my repo was out of whack. On Fri, Jan 24

Re: [jira] [Updated] (MAHOUT-1410) clusteredPoints do not contain a vector id

2014-01-25 Thread Andrew Musselman
Trying it out, found one test failure: Failed tests: ClusterClassificationDriverTest.testVectorClassificationWithOutlierRemovalMR:102-assertVectorsWithOutlierRemoval:188-checkClustersWithOutlierRemoval:238-Assert.assertTrue:41-Assert.fail:88 not expecting cluster:0:{0:1.0,1:1.0} Here's the

Re: [jira] [Updated] (MAHOUT-1410) clusteredPoints do not contain a vector id

2014-01-25 Thread Andrew Musselman
Still trying to understand what these tests are doing, but that is only blowing up when that is called during testVectorClassificationWithOutlierRemoval*MR*. Runs fine during testVectorClassificationWithOutlierRemoval. On Sat, Jan 25, 2014 at 9:14 AM, Andrew Musselman andrew.mussel

Re: [jira] [Updated] (MAHOUT-1410) clusteredPoints do not contain a vector id

2014-01-25 Thread Andrew Musselman
The vectors are having keys added during the MR version, which the reference in the test doesn't expect. See attached screenshots of variables during debugging. On Sat, Jan 25, 2014 at 9:29 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: Still trying to understand what these tests

Re: [jira] [Updated] (MAHOUT-1410) clusteredPoints do not contain a vector id

2014-01-25 Thread Andrew Musselman
Further, the MR version has NamedVectors but the non-MR version has RandomAccessSparseVectors. On Sat, Jan 25, 2014 at 9:43 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: The vectors are having keys added during the MR version, which the reference in the test doesn't expect. See

Re: [jira] [Updated] (MAHOUT-1410) clusteredPoints do not contain a vector id

2014-01-25 Thread Andrew Musselman
Not happy with it but tests are passing now. On Sat, Jan 25, 2014 at 10:08 AM, Andrew Musselman (JIRA) j...@apache.orgwrote: [ https://issues.apache.org/jira/browse/MAHOUT-1410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel] Andrew Musselman updated MAHOUT-1410

Test failure in TDigestTest

2014-01-28 Thread Andrew Musselman
Got this error running tests; anyone know what causes this? Tests run: 9, Failures: 1, Errors: 0, Skipped: 0, Time elapsed: 40.063 sec FAILURE! - in org.apache.mahout.math.stats.TDigestTest testSequentialPoints(org.apache.mahout.math.stats.TDigestTest) Time elapsed: 4.674 sec FAILURE!

Re: Mahout 0.9 Release

2014-01-28 Thread Andrew Musselman
Looks good. +1 On Tue, Jan 28, 2014 at 8:07 PM, Andrew Palumbo ap@outlook.com wrote: a), b), c), d) all passed here. CosineDistance of clustered points from cluster-reuters.sh -1 kmeans were within the range [0,1]. Date: Tue, 28 Jan 2014 16:45:42 -0800 From:

Re: releasing without 3 PMC votes

2014-01-31 Thread Andrew Musselman
It took a while but we got three votes, or am I misunderstanding? On Jan 31, 2014, at 6:42 PM, Ted Dunning ted.dunn...@gmail.com wrote: I think that We should not be releasing without following the Apache release guidelines. In particular, 3 votes from the PMC are absolutely required to

Re: Mahout 0.9 Release

2014-02-02 Thread Andrew Musselman
On 01/29/2014 05:25 AM, Andrew Musselman wrote: Looks good. +1 On Tue, Jan 28, 2014 at 8:07 PM, Andrew Palumbo ap@outlook.com wrote: a), b), c), d) all passed here. CosineDistance of clustered points from cluster-reuters.sh -1 kmeans were within the range [0,1]. Date: Tue

Re: releasing without 3 PMC votes

2014-02-07 Thread Andrew Musselman
As I understand, there were enough votes though it did take a while to find them. Could be the extended voting period and the roll-backs obscured that fact.. On Feb 7, 2014, at 9:48 PM, Ted Dunning ted.dunn...@gmail.com wrote: On Fri, Feb 7, 2014 at 6:36 AM, Isabel Drost-Fromm

Re: Solr-recommender for Mahout 0.9

2014-02-22 Thread Andrew Musselman
*Pat*, I opened a ticket(M-1420) for putting a new script in examples/ that uses the solr-recommender. Seems there's another, related ticket from Suneel in M-1288. Did the work described in the thread below make it into 0.9, and/or how much more is needed on it? *Ted*, if you have any code you

Re: CosineDistanceMeasure for 2 zero vectors?

2013-04-04 Thread Andrew Musselman
I think it should return an undefined symbol. There is no angle between two zero vectors. In a practical sense, taking two zero vectors to be equivalent in the context of user-item vectors, say, is dodgy in my opinion. That is akin to saying If we both hate everything on this restaurant's menu

Re: CosineDistanceMeasure for 2 zero vectors?

2013-04-04 Thread Andrew Musselman
4, 2013 at 11:40 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: I think it should return an undefined symbol. There is no angle between two zero vectors. In a practical sense, taking two zero vectors to be equivalent in the context of user-item vectors, say, is dodgy in my

Re: CosineDistanceMeasure for 2 zero vectors?

2013-04-04 Thread Andrew Musselman
think that in our recommender code, 0 should mean no rating or no interaction observed. I think modeling dislike with 0 creates lot of unnecessary problems. On 04.04.2013 22:56, Andrew Musselman wrote: I see the arguments for having it defined, just raising the point that it's a very strange

Re: CosineDistanceMeasure for 2 zero vectors?

2013-04-04 Thread Andrew Musselman
as dislike. Think of all movies you never watched, do you really dislike them all? :) On 04.04.2013 23:03, Andrew Musselman wrote: I agree; I mis-spoke before if I said dislike. Zero to me means literally nothing. No interaction. Which could be either don't like, don't

Re: Call to action – Mahout needs your help

2013-04-04 Thread Andrew Musselman
In case this thread is still a good place to reply with an offer to help, I'd love to pitch in. I have built a few production recommenders, most recently using Mahout at a large retailer along with my partner where we used ALS, with a pipeline of transforming transactions in XML into vectors

Re: Call to action – Mahout needs your help

2013-04-05 Thread Andrew Musselman
jira issues or propose some issue to work on yourself. If you need some input, it would be awesome to enhance our ALS recommenders with cross-validation and tooling for finding a good regularization parameter. On 05.04.2013 01:56, Andrew Musselman wrote: In case this thread is still a good

Re: Call to action – Mahout needs your help

2013-04-05 Thread Andrew Musselman
Is there any interest in an IRC bug bash to groom and prioritize? Trying to get a hand-hold on where to start. On Fri, Apr 5, 2013 at 9:49 AM, Andrew Musselman andrew.mussel...@gmail.com wrote: Thanks guys! I'll poke around your jira and start that way. Looking forward to it. Best

Re: Code reviews and reviewers

2013-04-09 Thread Andrew Musselman
+1 for code reviews +1 for Review Board +1 for unit tests and integration tests On Tue, Apr 9, 2013 at 5:03 AM, Dan Filimon dangeorge.fili...@gmail.comwrote: Thanks Sebastian! But let's talk about policy. Wouldn't everyone agree that more code reviews are a good thing? Ideally, everyone

Cannot resolve symbol 'OpenIntObjectHashMap'

2013-04-16 Thread Andrew Musselman
Guys, using release 0.7 I have this complaint in my IDE(IntelliJ) for org.apache.mahout.cf.taste.hadoop.als.FactorizationEvaluator and org.apache.mahout.cf.taste.hadoop.als.ParallelALSFactorizationJob: Here's the offending line for both classes: import

Re: Cannot resolve symbol 'OpenIntObjectHashMap'

2013-04-16 Thread Andrew Musselman
Looks like OpenIntObjectHashMap is not in the release. Is this patched somewhere? On Tue, Apr 16, 2013 at 12:26 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: Guys, using release 0.7 I have this complaint in my IDE(IntelliJ

Re: Cannot resolve symbol 'OpenIntObjectHashMap'

2013-04-16 Thread Andrew Musselman
Ah it is in generated-sources. So any class in there will be unavailable to people's IDEs. Any interest in fixing this? On Tue, Apr 16, 2013 at 12:47 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: Looks like OpenIntObjectHashMap is not in the release. Is this patched somewhere

Re: Cannot resolve symbol 'OpenIntObjectHashMap'

2013-04-16 Thread Andrew Musselman
Nevermind I see this is in the docs On Tue, Apr 16, 2013 at 1:00 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: Ah it is in generated-sources. So any class in there will be unavailable to people's IDEs. Any interest in fixing this? On Tue, Apr 16, 2013 at 12:47 PM, Andrew

Re: Cannot resolve symbol 'OpenIntObjectHashMap'

2013-04-16 Thread Andrew Musselman
recent I have seen is 12.1.1. On Tue, Apr 16, 2013 at 8:00 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: Ah it is in generated-sources. So any class in there will be unavailable to people's IDEs. Any interest in fixing this? On Tue, Apr 16, 2013 at 12:47 PM, Andrew

Re: Cannot resolve symbol 'OpenIntObjectHashMap'

2013-04-16 Thread Andrew Musselman
On Tue, Apr 16, 2013 at 3:14 PM, Ted Dunning ted.dunn...@gmail.com wrote: Cool, thanks; I fixed it by adding the generated-sources/org directory to the source path.. That isn't a good fix. For one thing, you may have a similar problem with tests. And if we add protobuf sorts of

Re: Mahout 1.0 goals

2014-02-27 Thread Andrew Musselman
Thanks for starting the conversation, Ted. I'm relatively new to the project though I've been using Mahout for a couple years in production, and am happy to see things move forward in whatever way makes sense. I think Mahout needs to ship a production-ready version if it's going to be called

Re: Mahout 1.0 goals

2014-02-27 Thread Andrew Musselman
I agree with b) and c); haven't used seq2sparse enough to grok a). On Thu, Feb 27, 2014 at 6:30 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: With the announcement of http://deeplearning4j.org yesterday which is various Neural Networks implementations on Hadoop 2/JBlas that had been

Re: Mahout 1.0 goals

2014-02-28 Thread Andrew Musselman
To be constructive, here are four items that seem more important for something like 1.0.0 and are even a lot less work: - Use Hadoop .mapreduce API consistently - Standardize input output formats of all jobs - Remove use of deprecated code - Clear even a third of the open JIRA

Re: Mahout 1.0 goals

2014-02-28 Thread Andrew Musselman
, Andrew Musselman andrew.mussel...@gmail.com wrote: To be constructive, here are four items that seem more important for something like 1.0.0 and are even a lot less work: - Use Hadoop .mapreduce API consistently - Standardize input output formats of all jobs - Remove use

Re: Mahout 1.0 goals

2014-03-01 Thread Andrew Musselman
Great step, thanks Frank On Mar 1, 2014, at 10:29 AM, Frank Scholten fr...@frankscholten.nl wrote: I got inspired by the discussion so I took a first step in reducing Hadoop dependencies in the naive bayes code. See my Github branch:

Re: Mahout 1.0 goals

2014-03-03 Thread Andrew Musselman
How about reviving/advertising an IRC channel so people could hop on whenever they're free, see if that gains any momentum. On Mon, Mar 3, 2014 at 10:38 AM, Ted Dunning ted.dunn...@gmail.com wrote: We can have more than one hangout to cover multiple time zones/work requirements. Each meeting

Re: Mahout 1.0 goals

2014-03-03 Thread Andrew Musselman
iPhone On Mar 3, 2014, at 1:46 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: How about reviving/advertising an IRC channel so people could hop on whenever they're free, see if that gains any momentum. On Mon, Mar 3, 2014 at 10:38 AM, Ted Dunning ted.dunn...@gmail.com wrote

Re: Mahout 1.0 goals

2014-03-04 Thread Andrew Musselman
task than anything we need to do (like the recent setting up of svnpubsub for future releases). I can create an INFRA jira and wait for INFRA to take respond. On Tuesday, March 4, 2014 6:03 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: One of my big wishlist items is to move

Re: Welcome Andrew Musselman as new comitter

2014-03-07 Thread Andrew Musselman
, this is to announce that the Project Management Committee (PMC) for Apache Mahout has asked Andrew Musselman to become committer and we are pleased to announce that he has accepted. Being a committer enables easier contribution to the project since in addition to posting patches on JIRA it also gives

Re: Mahout 1.0 goals

2014-03-07 Thread Andrew Musselman
of an INFRA task than anything we need to do (like the recent setting up of svnpubsub for future releases). I can create an INFRA jira and wait for INFRA to take respond. On Tuesday, March 4, 2014 6:03 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: One of my big wishlist items

Re: Light weight process for Examples contributions

2014-03-08 Thread Andrew Musselman
...@yahoo.com, Andrew Musselman andrew.mussel...@gmail.com Subject: Fwd: Light weight process for Examples contributions I’ve re-aplied for dev@, confirmed my application, and supposedly been accepted. But as of this morning seem still unable to send to the list. For the time being can one of you

Re: Mahout 1.0 goals

2014-03-08 Thread Andrew Musselman
Me too. To answer the question: Ask yourself this: Is Mahout a sandbox for experimentation on cutting edge algorithms or is Mahout a scalable, performant ML library that is targeted for production environments? I think it is important to clean up a lot of wiring and user experience issues

Re: website

2014-03-11 Thread Andrew Musselman
Yeah, looks good. On Tue, Mar 11, 2014 at 12:38 AM, Ted Dunning ted.dunn...@gmail.com wrote: Looks like math to me. Nice. On Mon, Mar 10, 2014 at 3:54 PM, Dmitriy Lyubimov dlie...@gmail.com wrote: Well -- i kinda succeeded following this recipe [1] and got [2] look almost as

Re: 0xdata interested in contributing

2014-03-12 Thread Andrew Musselman
Sounds like a large positive step; looking forward to hearing more! On Mar 12, 2014, at 5:44 PM, Ted Dunning ted.dunn...@gmail.com wrote: I have been working with a company named 0xdata to help them contribute some new software to Mahout. This software will give Mahout the ability to do

Lucene issue in recommenditembased example

2014-03-13 Thread Andrew Musselman
I'm getting this error repeated for several attempts in the last phase of the recommenditembased example on EMR with the default AMI and Hadoop version and a fresh Mahout 0.9 non-source tarball: 14/03/13 23:22:58 INFO mapred.JobClient: Task Id : attempt_201403132009_0009_m_01_2, Status :

Re: Lucene issue in recommenditembased example

2014-03-13 Thread Andrew Musselman
ms (Minutes: 18.7459) On Thu, Mar 13, 2014 at 5:22 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: Could u print the complete stacktrace? On Thursday, March 13, 2014 7:31 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: I'm getting this error repeated for several attempts

Re: Lucene issue in recommenditembased example

2014-03-13 Thread Andrew Musselman
1.0.3 distro and is not compatible with lucene 4.x. On Thursday, March 13, 2014 8:28 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: It's with Hadoop 1.0.3 btw $ ./mahout-distribution-0.9/bin/mahout recommenditembased --input ratings.csv --output recommendations

Re: 0xdata interested in contributing

2014-03-13 Thread Andrew Musselman
Thanks Sri; nice to meet you and thanks for the conversation. When you say hello world I presume you're emphasizing that Mahout is a popular entry point for people seeking to join the field, rather than its being simple or easy to pick up. We've been talking about ways to make Mahout easier to

Re: 0xdata interested in contributing

2014-03-13 Thread Andrew Musselman
PM, Andrew Musselman andrew.mussel...@gmail.com wrote: Thanks Sri; nice to meet you and thanks for the conversation. When you say hello world I presume you're emphasizing that Mahout is a popular entry point for people seeking to join the field, rather than its being simple or easy

  1   2   3   4   5   6   7   8   9   10   >