Re: Quickstart for kMeans

2010-05-03 Thread Grant Ingersoll
Sisir, Thanks for the script. I think it would be great to open a JIRA issue for this and we can check in the shell script under the examples. I think LDA also has similar tools to download Reuters, we should try to reuse if possible. On May 2, 2010, at 3:42 PM, Sisir Koppaka wrote: For

Re: Intermittant Test Failure: testTranspose(org.apache.mahout.math.hadoop.TestDistributedRowMatrix)

2010-04-29 Thread Grant Ingersoll
On Apr 29, 2010, at 6:36 PM, Jeff Eastman wrote: right at the end of the 15 min core tests which makes it especially annoying. Lucene just put in parallel JUnit tests and they've gotten a lot faster.

[jira] Created: (MAHOUT-388) Upgrade Lucene

2010-04-28 Thread Grant Ingersoll (JIRA)
Upgrade Lucene -- Key: MAHOUT-388 URL: https://issues.apache.org/jira/browse/MAHOUT-388 Project: Mahout Issue Type: Improvement Reporter: Grant Ingersoll Priority: Minor Upgrade Lucene version used

[GSOC] Congrats to all students

2010-04-26 Thread Grant Ingersoll
Looks like student GSOC announcements are up (http://socghop.appspot.com/gsoc/program/list_projects/google/gsoc2010). Mahout got quite a few projects (5) accepted this year, which is a true credit to the ASF, Mahout, the mentors, and most of all the students! We had a good number of very

Re: Mahout TLP to-do list

2010-04-22 Thread Grant Ingersoll
On Apr 21, 2010, at 9:05 PM, Drew Farris wrote: Probably worth starting another thread for this. Mahout TLP to-do's: Website design: Robin SVN: Grant will take care can take care of the move when we are ready. Should be able to do this anytime. Here's what I'll do: Move SVN Leave a

Re: Mahout TLP to-do list

2010-04-22 Thread Grant Ingersoll
can likely add more admins as needed. Drew On Thu, Apr 22, 2010 at 10:22 AM, Grant Ingersoll gsing...@apache.org wrote: Drew, Please let the list know what INFRA issues are open so that we can track them. Thanks, Grant

Re: Status of Mahout TLP

2010-04-21 Thread Grant Ingersoll
go in at the same level. On Mon, Apr 12, 2010 at 7:34 PM, Grant Ingersoll gsing...@apache.org wrote: Yep. Meeting is on the 21st. I will be attending and letting y'all know what happens (I can't imagine it fails). From the sounds of it, a good chunk of subprojects will be splitting from

Re: Status of Mahout TLP

2010-04-21 Thread Grant Ingersoll
...@gmail.com wrote: I can help out in the redesign. Is there a CMS approved by apache security, something which will get patched automatically? Robin On Wed, Apr 21, 2010 at 3:34 PM, Grant Ingersoll gsing...@apache.org wrote: On Apr 21, 2010, at 5:28 AM, Robin Anil wrote: Today

TLP Status

2010-04-21 Thread Grant Ingersoll
The Board has approved Mahout, Tika, and Nutch moving to be top level status. Congrats! Now begins the fun part of changing mailing lists, domains, etc. -Grant

Re: maven/build.xml

2010-04-19 Thread Grant Ingersoll
On Apr 19, 2010, at 5:25 PM, Benson Margulies wrote: Grant, it looks to me like the job construction scripting in maven/build.xml would look neater as an application of the maven-shade-plugin. Do you (or anyone else) have an opinion about this, before I try to cook up a patch? I have no

Re: mahout/solr integration

2010-04-16 Thread Grant Ingersoll
On Apr 16, 2010, at 2:21 PM, Jake Mannix wrote: So here's my take: once we're a TLP (next month sometime?), it is a good time to start allowing subprojects or submodules which are Submodules, yes, subprojects, not so much, unless the committers are the same. We can definitely release

Re: mahout/solr integration

2010-04-16 Thread Grant Ingersoll
On Apr 16, 2010, at 3:11 PM, Jake Mannix wrote: On Fri, Apr 16, 2010 at 11:56 AM, Sean Owen sro...@gmail.com wrote: On Fri, Apr 16, 2010 at 7:39 PM, Jake Mannix jake.man...@gmail.com wrote: I will start playing around with Anthony's github-based stuff, and see where a patch can be made.

[jira] Commented: (MAHOUT-379) SequentialAccessSparseVector.equals does not agree with AbstractVector.equivalent

2010-04-14 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12856835#action_12856835 ] Grant Ingersoll commented on MAHOUT-379: I think we probably should have

Re: VOTE: take 2: mahout-collections-1.0

2010-04-13 Thread Grant Ingersoll
+1 On Apr 11, 2010, at 9:49 PM, Benson Margulies wrote: https://repository.apache.org/content/repositories/orgapachemahout-015/ contains (this time for sure) all the artifacts for release 1.0 of the mahout-collections component. This is the first independent release of collections from the

Re: VOTE: take 2: mahout-collections-1.0

2010-04-12 Thread Grant Ingersoll
On Apr 11, 2010, at 11:50 PM, Ted Dunning wrote: +1 (on trust, really) Not to pick on Ted here, but this is not something you should do. If you don't have time to download and verify the artifacts, please don't vote, as the vote is meaningless at best. No vote is better than an unverified

Re: Transition to mahout-collections as an independent component

2010-04-12 Thread Grant Ingersoll
On Apr 11, 2010, at 9:53 PM, Benson Margulies wrote: You can get dependency by running 'mvn' in the collections directory, which will build a snapshot and put it in your local repository, or by adding the Apache snapshot repository to your environment (if it's not there already). Does mvn

Re: Digest for google-summer-of-code-mentors-l...@googlegroups.com - 25 Messages in 2 Topics

2010-04-12 Thread Grant Ingersoll
On Apr 10, 2010, at 5:26 PM, Sean Owen wrote: +mahout-dev I think at this point I could be misremembering (there's that word again Grant) but are we not supposed to sign on to mentor more than 1 person without having talked it over on code-awards? Seems like a lot of grumbling about

Re: Digest for google-summer-of-code-mentors-l...@googlegroups.com - 25 Messages in 2 Topics

2010-04-12 Thread Grant Ingersoll
On Apr 12, 2010, at 10:01 AM, Benson Margulies wrote: Robin asked me to pay attention to this mentoring question. I have assumed that, to be an effective mentor here, I'd have to be capable of at least keeping up with the mentee on the math behind whatever algorithm is in play. Thus, absent

Re: VOTE: take 2: mahout-collections-1.0

2010-04-12 Thread Grant Ingersoll
On Apr 12, 2010, at 1:01 PM, Ted Dunning wrote: On Mon, Apr 12, 2010 at 5:23 AM, Grant Ingersoll gsing...@apache.orgwrote: On Apr 11, 2010, at 11:50 PM, Ted Dunning wrote: +1 (on trust, really) Not to pick on Ted here, but this is not something you should do. If you don't have time

Re: Status of Mahout TLP

2010-04-12 Thread Grant Ingersoll
Yep. Meeting is on the 21st. I will be attending and letting y'all know what happens (I can't imagine it fails). From the sounds of it, a good chunk of subprojects will be splitting from Lucene. Also, we should potentially start thinking about a Press Release to go with two things: 1.

Re: Dangling collections in front of commons

2010-04-03 Thread Grant Ingersoll
On Apr 3, 2010, at 5:17 AM, Sean Owen wrote: I'm neutral... maybe let it marinate longer in Mahout, prove it's used and worthwhile and such? Yeah, I'd tend to agree here. Let's see if we get some contributions on it and how it plays out for us. I think the question will be, well,

[GSOC] 2010 Timelines

2010-04-03 Thread Grant Ingersoll
http://socghop.appspot.com/document/show/gsoc_program/google/gsoc2010/faqs#timeline

Re: Proposal: make collections releases independent of the rest of Mahout

2010-04-03 Thread Grant Ingersoll
On Apr 3, 2010, at 2:22 PM, Benson Margulies wrote: On Sat, Apr 3, 2010 at 2:07 PM, Sean Owen sro...@gmail.com wrote: Actually it seems like this a valid subproject of a Mahout TLP in its own right, if that would be a useful middle-ground status. I'm not trying to suggest anything

Mahout TLP and Website

2010-04-02 Thread Grant Ingersoll
Might we consider a fresher look for the Mahout website now that we have a new logo and are going TLP? I like the http://ofbiz.apache.org/ site, but anything that matched up with the LF of our logo would be great. -Grant

Re: [GSOC] Wiki Page Added

2010-03-31 Thread Grant Ingersoll
D'oh! My bad: http://cwiki.apache.org/MAHOUT/gsoc.html. It's linked from the front wiki page under community. -Grant On Mar 31, 2010, at 9:11 AM, zhao zhendong wrote: Hi Grant, Could you please give us the link of this page? Cheers, Zhendong On Wed, Mar 31, 2010 at 8:53 PM, Grant

Re: Application for GSOC 2010

2010-03-31 Thread Grant Ingersoll
On Mar 31, 2010, at 1:52 PM, Ted Dunning wrote: File a JIRA issue with a detailed proposal of your project. The community will help work out details for your proposal and it will eventually be rated and possibly selected. Note, you also need to put your issue into the GSOC application. I

Re: Javadocs?

2010-03-30 Thread Grant Ingersoll
available. On Tue, Mar 30, 2010 at 11:25 AM, Grant Ingersoll gsing...@apache.org wrote: I think I just dumped them in that directory. If we want, we can keep move aside the old ones and update the website to refer to each version.

Re: Javadocs?

2010-03-30 Thread Grant Ingersoll
On Mar 30, 2010, at 10:14 AM, Jake Mannix wrote: Awesome, thanks guys. Doesn't Maven do this kind of thing for us, if we tell it to? (ie can't we also have daily updates of the 0.4-SNAPSHOT javadocs automagically posted up there too?) Yeah, we could. We do have to be careful about

Re: Interesting at GSoC: Mahout-343

2010-03-29 Thread Grant Ingersoll
(which are fine to have, just be up front about them) and that you are not taking on too much. Wait for your reply humbly. Best wishes. On Sun, Mar 28, 2010 at 8:00 PM, Grant Ingersoll gsing...@apache.orgwrote: On Mar 28, 2010, at 12:52 AM, 杨杰 wrote: Dear Mahout Developers

Re: not a lot of mentors for GSoC

2010-03-29 Thread Grant Ingersoll
Mentoring sign up is on the GSOC site. You need to be a committer to be a mentor, at least for the ASF anyway. On Mar 29, 2010, at 3:11 AM, Pradeep Pujari wrote: Where to sign up for mentoring? Pradeep. On Sun, Mar 28, 2010 at 12:31 AM, Sean Owen sro...@gmail.com wrote: I've not been

Re: [jira] Created: (MAHOUT-345) [GSOC] integrate Mahout with Drupal/PHP

2010-03-29 Thread Grant Ingersoll
On Mar 29, 2010, at 7:13 AM, David Stuart wrote: Re this proposal if I am going to help out with mentoring on the Drupal side of things should I sign up to be a mentor with Drupal or Mahout? Anybody know how cross project stuff works? Not sure, we should probably ask on the GSOC list.

Re: Interesting at GSoC: Mahout-343

2010-03-28 Thread Grant Ingersoll
EXT. 608 MSN: xtyangjie2...@yahoo.com.cn once i didn't know software is not free; then i knew it days later; now i find it indeed free. -- Grant Ingersoll http://www.lucidimagination.com/ Search the Lucene ecosystem using Solr/Lucene: http://www.lucidimagination.com

Fwd: Student apps start on March 29 19:00 UTC

2010-03-27 Thread Grant Ingersoll
FYI Begin forwarded message: From: Ross Gardler rgard...@apache.org Date: March 27, 2010 6:04:32 AM EDT To: d...@community.apache.org Subject: Student apps start on March 29 19:00 UTC Reply-To: d...@community.apache.org Admins: Please note that the student applications start at March

Re: [VOTE] Mahout as TLP

2010-03-22 Thread Grant Ingersoll
This vote has passed: +1s: 11 (all but one are binding) +0: 1 I'll submit to the PMC. On Mar 19, 2010, at 10:50 AM, Grant Ingersoll wrote: Per the earlier discussions, I'm calling a vote to submit the following resolution [1] to the Lucene PMC for consideration to then promote Mahout

Re: [jira] Created: (MAHOUT-345) [GSOC] integrate Mahout with Drupal/PHP

2010-03-22 Thread Grant Ingersoll
Taste already exposes a web service layer, so I'm not sure how much more there is on the Mahout end for recommenders. Still, would be great to see and I'm sure it would help iron out API issues, etc. These are definitely the kinds of things I'd love to see. On Mar 22, 2010, at 5:56 PM, Ted

Re: The new improved command-line: MahoutDriver (get it?)

2010-03-19 Thread Grant Ingersoll
. -jake -- Grant Ingersoll http://www.lucidimagination.com/ Search the Lucene ecosystem using Solr/Lucene: http://www.lucidimagination.com/search

[jira] Created: (MAHOUT-342) [GSOC] Implement Map/Reduce Enabled Neural Networks

2010-03-19 Thread Grant Ingersoll (JIRA)
: Grant Ingersoll Per the Ng. et. al. paper, implement a neural network with back propagation. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.

[jira] Created: (MAHOUT-343) [GSOC] Implement Integration of Mahout Clustering or Classification with Apache Solr

2010-03-19 Thread Grant Ingersoll (JIRA)
: Mahout Issue Type: New Feature Reporter: Grant Ingersoll There are many places where Mahout and Lucene/Solr can be hooked together. For instance, automatically classifying documents to be indexed and adding fields contain said classification. Another one is to write

[VOTE] Mahout as TLP

2010-03-19 Thread Grant Ingersoll
Farris (d...@...) • Grant Ingersoll (gsing...@...) • Benson Margulies (bimargul...@...) • Sean Owen (sro...@...) • Robin Anil (robina...@...) • Jake Mannix (jman...@...) RESOLVED, that the Apache Mahout Project be and hereby is tasked with the migration

Re: [VOTE] Mahout as TLP

2010-03-19 Thread Grant Ingersoll
Here's my vote. On Mar 19, 2010, at 10:50 AM, Grant Ingersoll wrote: Per the earlier discussions, I'm calling a vote to submit the following resolution [1] to the Lucene PMC for consideration to then promote Mahout to be a TLP. [x] +1 I'm for Mahout being a TLP and the resolution below.

Re: [VOTE] Mahout as TLP

2010-03-19 Thread Grant Ingersoll
Note, this is really a committer vote to ask the Lucene PMC to discharge Mahout, so I'd consider all committer votes to be binding. -Grant On Mar 19, 2010, at 11:05 AM, Benson Margulies wrote: My nonbinding vote is +1. On Fri, Mar 19, 2010 at 10:50 AM, Grant Ingersoll gsing

Re: [VOTE] Mahout as TLP

2010-03-19 Thread Grant Ingersoll
On Mar 19, 2010, at 10:50 AM, Grant Ingersoll wrote: Per the earlier discussions, I'm calling a vote to submit the following resolution [1] to the Lucene PMC for consideration to then promote Mahout to be a TLP. [] +1 I'm for Mahout being a TLP and the resolution below. Jeff Eastman

Re: [VOTE] Mahout as TLP

2010-03-19 Thread Grant Ingersoll
and the resolution below. On Fri, Mar 19, 2010 at 10:50 AM, Grant Ingersoll gsing...@apache.org wrote: [1] X. Establish the Apache Mahout Project WHEREAS, the Board of Directors deems it to be in the best interests of the Foundation and consistent with the Foundation's purpose to establish

Re: [DISCUSS] Mahout TLP Board Resolution

2010-03-18 Thread Grant Ingersoll
On Mar 18, 2010, at 1:34 PM, deneche abdelhakim wrote: should be Abdelhakim Deneche ... cause my first name is 'Abdelhakim Sorry, was going by your email from tag. Will correct. On Thu, Mar 18, 2010 at 1:07 PM, Grant Ingersoll gsing...@apache.org wrote: So here's the update: X

Re: [DISCUSS] Mahout TLP Board Resolution

2010-03-18 Thread Grant Ingersoll
(d...@...) • Grant Ingersoll (gsing...@...) • Benson Margulies (bimargul...@...) • Sean Owen (sro...@...) • Robin Anil (robina...@...) • Jake Mannix (jman...@...) RESOLVED, that the Apache Mahout Project be and hereby is tasked with the migration

Re: [DISCUSS] Mahout TLP Board Resolution

2010-03-17 Thread Grant Ingersoll
are appointed to serve as the initial members of the Apache Mahout Project: • Isabel Drost (isa...@...) • Ted Dunning (tdunn...@...) • Jeff Eastman (jeast...@...) • Drew Farris (d...@...) • Grant Ingersoll (gsing...@...) • Sean Owen (sro

Re: Can someone please mark 0.3 release in JIRA?

2010-03-17 Thread Grant Ingersoll
which are down). I will go and commit the site changes. Anyone willing to write the Release announcement for 0.3? Robin On Wed, Mar 17, 2010 at 3:43 AM, Grant Ingersoll gsing...@apache.orgwrote: It usually takes 24 hours. Just follow the release dirs and we'll be good. Tomorrow

Re: Collocations docs

2010-03-17 Thread Grant Ingersoll
BTW, is it colocation, collocation, or co-location? On Mar 16, 2010, at 8:48 AM, Drew Farris wrote: I added rough beginnings of some documentation last night, will continue to flesh it out tonight. Feedback welcome. On Mon, Mar 15, 2010 at 12:15 PM, Grant Ingersoll gsing...@apache.org

Re: Collocations docs

2010-03-17 Thread Grant Ingersoll
Answering my own question: http://en.wikipedia.org/wiki/Collocation On Mar 17, 2010, at 11:26 AM, Grant Ingersoll wrote: BTW, is it colocation, collocation, or co-location? On Mar 16, 2010, at 8:48 AM, Drew Farris wrote: I added rough beginnings of some documentation last night

Re: [DISCUSS] Mahout TLP Board Resolution

2010-03-17 Thread Grant Ingersoll
On Mar 17, 2010, at 9:43 AM, Grant Ingersoll wrote: Formalizing a bit more and updating the resolution based on the opt-in emails on this thread and the apparent obvious choice of Sean as Chair. Still haven't heard from Karl, Otis, Deneche and Isabel (although I did IM here and she

JIRA Admins

2010-03-16 Thread Grant Ingersoll
Hi Committers, If you wish to be a JIRA admin for Mahout (and who doesn't) please post your JIRA id (many people seem to have multiple ids). Here is the current list of Admins: Benson Margulies (bmargulies) Grant Ingersoll (gsingers) Isabel Drost (isabel) Jake Mannix (jake.mannix

Re: Can someone please mark 0.3 release in JIRA?

2010-03-16 Thread Grant Ingersoll
You should be an admin now. On Mar 16, 2010, at 11:30 AM, Benson Margulies wrote: I assume that I don't have admin karma on the JIRA project.

Re: Can someone please mark 0.3 release in JIRA?

2010-03-16 Thread Grant Ingersoll
On Mar 16, 2010, at 11:43 AM, Benson Margulies wrote: Hmm. Not a bad idea. I think of maven as done, but that's not right. I'll deal with this in the evening. There are quite a few steps after Maven is done on the Release page on the wiki. On Tue, Mar 16, 2010 at 11:33 AM, Robin Anil

Re: Can someone please mark 0.3 release in JIRA?

2010-03-16 Thread Grant Ingersoll
That's all on people.a.o under/www/lucene.apache.org/mahout Just follow the docs on http://cwiki.apache.org/MAHOUT/how-to-release.html and everything will work. On Mar 16, 2010, at 12:50 PM, Robin Anil wrote: http://www.apache.org/dev/mirror-step-by-step.html?Step-By-Step it says you

Re: Can someone please mark 0.3 release in JIRA?

2010-03-16 Thread Grant Ingersoll
think its because I am not in the lucene group. Robin On Tue, Mar 16, 2010 at 10:34 PM, Grant Ingersoll gsing...@apache.orgwrote: That's all on people.a.o under/www/lucene.apache.org/mahout Just follow the docs on http://cwiki.apache.org/MAHOUT/how-to-release.html and everything will work

Re: Can someone please mark 0.3 release in JIRA?

2010-03-16 Thread Grant Ingersoll
It usually takes 24 hours. Just follow the release dirs and we'll be good. Tomorrow is a great day for a Mahout announcement! Maybe we can change the logo to be green for tomorrow. On Mar 16, 2010, at 5:59 PM, Robin Anil wrote: http://www.apache.org/mirrors/ I see 0.3 folder in most of

Re: [VOTE]: release Mahout 0.3 (resend, I forgot gene...@lucene.apache.org)

2010-03-15 Thread Grant Ingersoll
On Mar 15, 2010, at 10:17 AM, Sean Owen wrote: Yes feel free to modify the list. Yes you're right actually, the co-occurrence recommender wasn't in 0.2, I misremembered. :-) W. is back. Do we have writeup of this somewhere on the Wiki? It should be on the list. We might be careful about

[DISCUSS] Mahout TLP Board Resolution

2010-03-15 Thread Grant Ingersoll
...@...) • Jeff Eastman (jeast...@...) • Drew Farris (d...@...) • Otis Gospodnetic (o...@...) • Grant Ingersoll (gsing...@...) • Sean Owen (sro...@...) • Karl Wettin (ka...@...) • AbdelHakim Deneche (adene...@...) • David Hall (d

Re: A mahout logo Revamp

2010-03-15 Thread Grant Ingersoll
Can you make sure to add scalable vector versions up on the Wiki? Adobe Illustrator and/or SVG and some other versions would be great. I think Lukas did this last time. On Mar 14, 2010, at 10:27 PM, Robin Anil wrote: Here is the new mahout logo. Slightly higher contrast and sharper than the

Re: [VOTE]: release Mahout 0.3 (resend, I forgot gene...@lucene.apache.org)

2010-03-15 Thread Grant Ingersoll
We can push it before then, too. All my script does is: /usr/local/bin/svn export --force http://svn.apache.org/repos/asf/lucene/mahout/site/publish /www/lucene.apache.org/mahout/ This is generally what I do when we do a release. Note, you have to wait 24 hours after pushing the button to

Re: [DISCUSS] Mahout TLP Board Resolution

2010-03-15 Thread Grant Ingersoll
On Mar 15, 2010, at 1:27 PM, deneche abdelhakim wrote: just to get it right: not being in the PMC doesn't mean I'm no more a committer, right ? Correct. You would still have the same commit privileges you do today and nothing would really change for you other than a new URL here or there.

Re: [VOTE]: release Mahout 0.3 (resend, I forgot gene...@lucene.apache.org)

2010-03-15 Thread Grant Ingersoll
Have you voted, Sean? On Mar 15, 2010, at 12:02 PM, Sean Owen wrote: I am on the Lucene PMC. On Mon, Mar 15, 2010 at 3:59 PM, Benson Margulies bimargul...@gmail.com wrote: Whoops, let's check this count. Are Drew and Sean Lucene PMC members? If not, we're a few bricks short of a load.

Re: [DISCUSS] Mahout TLP Board Resolution

2010-03-15 Thread Grant Ingersoll
On Mar 15, 2010, at 10:31 AM, Grant Ingersoll wrote: 2. Who is the Chair of the PMC? The Chair is a VP of the ASF and is responsible for reporting to the Board amongst other things (commit karma, handling software grants, monitoring incubation, etc. See http://apache.org/dev/pmc.html

Re: [VOTE RESULT] Mahout 0.3

2010-03-15 Thread Grant Ingersoll
A big thanks to all involved in this. It is a sure sign of a healthy and growing community when new people step in and step up and the load is distributed across many helping hands. Kudos to all, Grant On Mar 15, 2010, at 3:31 PM, Benson Margulies wrote: Mahout release 0.3 passed. Ted

[jira] Commented: (MAHOUT-335) Mahout Logo tweak

2010-03-14 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-335?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12845079#action_12845079 ] Grant Ingersoll commented on MAHOUT-335: Can we see #2 w/o the hair on the person

Re: Google Summer of Code: Bring out your projects

2010-03-12 Thread Grant Ingersoll
On Mar 12, 2010, at 1:22 AM, Robin Anil wrote: Shall I go and put some of the ideas up. I will do it as a whole for the project. Later we can re-assign things maybe ? How does that sound? Unlike other projects we cant really go an put a proposal like Implement back-propagation and expect a

Re: Release process status report

2010-03-12 Thread Grant Ingersoll
to the foundation in the event of some complaint. 2) Prepare the release. 3) Vote the release. Not all projects seem to make a habit of item #1, but unless someone is sure that I'm confused, it would be nice. -- Grant Ingersoll http

Re: [VOTE]: release Mahout 0.3 (resend, I forgot gene...@lucene.apache.org)

2010-03-12 Thread Grant Ingersoll
I'm almost identical: 10.6.2 java version 1.6.0_17 Java(TM) SE Runtime Environment (build 1.6.0_17-b04-248-10M3025) Java HotSpot(TM) 64-Bit Server VM (build 14.3-b01-101, mixed mode) mvn -version Apache Maven 2.2.1 (r801777; 2009-08-06 15:16:01-0400) Java version: 1.6.0_17 Java home:

Re: 0.3 release issues

2010-03-11 Thread Grant Ingersoll
the release process again. On Thu, Mar 11, 2010 at 3:20 AM, Drew Farris drew.far...@gmail.com wrote: Looks like the hadoop 0.20.2 issue is finally put to bed -- any other issues that we need to resolve before we proceed with a release? -- Grant Ingersoll http

Re: 0.3 release issues

2010-03-11 Thread Grant Ingersoll
...@gmail.com wrote: Ah just the mvn -Prelease,mahout_release release:prepare business On Thu, Mar 11, 2010 at 12:02 PM, Grant Ingersoll gsing...@apache.org wrote: What were you running when you hit this? -- Grant Ingersoll http://www.lucidimagination.com/ Search

Re: 0.3 release issues

2010-03-11 Thread Grant Ingersoll
Shall we assume by your vote call that this was fixed? On Mar 11, 2010, at 5:28 PM, Benson Margulies wrote: OK, I see something. I'm trying to use -Dmaven.test.skip=true to speed things up, but that stops the creation of the test jar. Unfortunately, since I get random test failures, this is a

Re: Hadoop

2010-03-09 Thread Grant Ingersoll
On Mar 9, 2010, at 7:13 AM, Benson Margulies wrote: Options: 1. Email to their private@ list. 2. Email to members@ seeking assistance. 3. Sleuthing their archive to see who was the release manager for this and try to persuade them. 4. Publish it ourselves. If their community doesn't

Re: Have Mahout applied GSOC 2010?

2010-03-09 Thread Grant Ingersoll
On Mar 9, 2010, at 12:27 PM, zhao zhendong wrote: Hi Robin Ted and Grant, Thank you very much. To Grant: One more thing, could you please tell us the link of archives you mentioned before? There's a bunch of 'em, but my personal fav. is http://search.lucidimagination.com ;-) Just

Re: Hadoop

2010-03-09 Thread Grant Ingersoll
Very cool. Given they use the repository.apache.org it shouldn't actually be that hard to promote it. -Grant On Mar 9, 2010, at 2:25 PM, Benson Margulies wrote: I just got email from Doug Cutting. He is going to take care of it for us. On Tue, Mar 9, 2010 at 9:17 AM, Grant Ingersoll gsing

Re: 0.3 release issues

2010-03-05 Thread Grant Ingersoll
Has anyone filed a JIRA with them to do so? The Extremely Esteemed PMC Chair (aka Paper Pusher Extraordinaire), Grant On Mar 5, 2010, at 7:28 AM, Benson Margulies wrote: Could I be stupid for a moment? Our fellow Apache project, Hadoop, makes releases but doesn't bother to stick them into

Re: Who owns mahout bucket on s3?

2010-02-27 Thread Grant Ingersoll
That probably is me. I'll give it to you for $20 credits. ;-) Just kidding. Something a little more meaningful is probably needed anyway, like wikipedia-mahout. On Feb 27, 2010, at 11:38 AM, Robin Anil wrote: Just curious. Was trying to put my wikipedia seqfiles for public consumption.

[jira] Commented: (MAHOUT-301) Improve command-line shell script by allowing default properties files

2010-02-26 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12838834#action_12838834 ] Grant Ingersoll commented on MAHOUT-301: Just capturing something longer term here

Re: anybody want to set a record with Mahout?

2010-02-25 Thread Grant Ingersoll
a square matrix with 5 million rows and columns. Jake, your social graph should be much larger than that. -- Ted Dunning, CTO DeepDyve -- Grant Ingersoll http://www.lucidimagination.com/ Search the Lucene ecosystem using Solr/Lucene: http

Re: anybody want to set a record with Mahout?

2010-02-25 Thread Grant Ingersoll
On Feb 25, 2010, at 3:41 PM, Jake Mannix wrote: On Thu, Feb 25, 2010 at 12:38 PM, Robin Anil robin.a...@gmail.com wrote: Whats the largest dataset available? BixoLabs ? Wikipedia(5 Mil articles)... I dont know anything public that is that big 5 million articles, if you take all the

Re: 0.3 release issues

2010-02-23 Thread Grant Ingersoll
We can publish 0.20.2 on our site. It's pretty easy to do. On Feb 23, 2010, at 6:39 AM, Sean Owen wrote: OK first roadblock -- we can't depend on the Hadoop 0.20.2 snapshot. It might not be such a sin to depend on 0.20.1. I believe it will break the CF job in some instances, but, this is not

Re: 0.3 release issues

2010-02-23 Thread Grant Ingersoll
On Feb 23, 2010, at 8:47 AM, Drew Farris wrote: The other option would be to wait until a 0.20.2 release is available, which could be imminent. Last I saw on the list they were on rc4? This doesn't seem horribly bad. We should download and try the RC and provide feedback.

Re: 0.3 release issues

2010-02-23 Thread Grant Ingersoll
On Feb 23, 2010, at 9:18 AM, Sean Owen wrote: It does look imminent. As much as I don't like holding out longer, and indefinitely, for this release, somehow I'd also really like to link to the latest/greatest and official Hadoop release. Let's try to be good about sticking to the code

SVD for dummies

2010-02-23 Thread Grant Ingersoll
Hey Jake, Was just going to ask for more insight into SVD when lo and behold, I checked my commits mail and saw http://cwiki.apache.org/confluence/display/MAHOUT/DimensionalReduction. Very nice! Thank you! -Grant

Re: [off-topic] Maven and SCP deploy.

2010-02-23 Thread Grant Ingersoll
Hmm, what version are you on? I've done it successfully, but it usually requires some setup in your ~/.m2/settings.xml file to incorporate your public key, etc. I think Mahout has it configured. Check the How To Release page on the Wiki. On Feb 23, 2010, at 3:16 AM, Dawid Weiss wrote:

Re: 0.3 release issues

2010-02-23 Thread Grant Ingersoll
I assume he means mark 301 as fixed (when appropriate) and then open a new ticket for follow on work marked for 0.4. -Grant On Feb 23, 2010, at 1:03 PM, Jake Mannix wrote: What does this mean? You mean make a 301-continuation ticket for 0.4, and reschedule the original 301 for 0.3? I could

Re: Algorithm implementations in Pig

2010-02-22 Thread Grant Ingersoll
I'm all for Pig, especially once we are a TLP. I haven't had the proper time to review the PLSI implementation, but it looks useful. I agree on the other points, though, in that I think we it would be nice to have consistent formats based on Vector so that things can be more portable. On

Re: test output in mahout-utils?

2010-02-21 Thread Grant Ingersoll
They could go under target as well, as then they will get cleaned up. On Feb 20, 2010, at 1:39 PM, Robin Anil wrote: Many of the clustering and classification algorithms use these dirs for tests. Sean had suggest earlier we move away from them and use temp directories. Its not changed yet.

Re: command line interfaces

2010-02-21 Thread Grant Ingersoll
bin/mahout should take care of a lot of these. I can say I'm not a big fan of nameless parameters that rely on someone to know the exact order of the args. I much prefer --input, --output, etc. Still, just saying train or predict is pretty cool and would be easy enough to incorporate.

Re: New to Mahout - question about the failed test cases

2010-02-19 Thread Grant Ingersoll
Does Hadoop even run on Windows? Even in Cygwin? On Feb 18, 2010, at 8:20 PM, Anish Shah wrote: I have created https://issues.apache.org/jira/browse/MAHOUT-298 to track this. On Thu, Feb 18, 2010 at 6:59 PM, Ted Dunning ted.dunn...@gmail.com wrote: Darn. That uses up all of my ideas.

Welcome Drew Farris

2010-02-18 Thread Grant Ingersoll
On behalf of the Lucene PMC, I'm happy to announce Drew Farris as the newest member of the Mahout committer family. Drew has been contributing some really nice work to Mahout in recent months and I look forward to his continuing involvement with Mahout. Congrats, Drew! -Grant

Re: Profiling SequentialAccessSparseVector

2010-02-18 Thread Grant Ingersoll
, cluster *centers* should indeed be RandomAccess. JIRA / patch so we can see exactly what the change is? -jake -- Grant Ingersoll http://www.lucidimagination.com/ Search the Lucene ecosystem using Solr/Lucene: http://www.lucidimagination.com/search

Re: Welcome Drew Farris

2010-02-18 Thread Grant Ingersoll
here! Robin On Fri, Feb 19, 2010 at 2:33 AM, Grant Ingersoll gsing...@apache.orgwrote: On behalf of the Lucene PMC, I'm happy to announce Drew Farris as the newest member of the Mahout committer family. Drew has been contributing some really nice work to Mahout in recent months and I look

Re: Welcome Drew Farris

2010-02-18 Thread Grant Ingersoll
On Feb 18, 2010, at 8:32 PM, Drew Farris wrote: There's lots more stuff I'd like to get in there, now I only need to figure how to squeeze 48 hours of consciousness into a day. I believe there is a compression algorithm for that.

Re: Mass Code Cleanup

2010-02-14 Thread Grant Ingersoll
On Feb 14, 2010, at 4:25 PM, Jake Mannix wrote: On Sun, Feb 14, 2010 at 12:16 PM, Benson Margulies bimargul...@gmail.comwrote: It seems to me that Robin's campaign is predicated on a prior decision as to what checkstyle rules we want to enforce. We can't have it both ways. if we want to

Taste demo not working

2010-02-14 Thread Grant Ingersoll
Anyone else getting the following when running the Taste demo per the website instructions: 2010-02-14 16:54:15.414:WARN::EXCEPTION java.lang.ClassNotFoundException: org.apache.axis.transport.http.AxisServlet at java.net.URLClassLoader$1.run(URLClassLoader.java:200) at

Re: Taste demo not working

2010-02-14 Thread Grant Ingersoll
This was broken by the removal of unused dependencies. Robin, can you please revert the removal of the unused dependencies patch. Either that or please run and verify every last thing that is in Mahout to make sure it still works. On Feb 14, 2010, at 4:57 PM, Grant Ingersoll wrote

Re: Mass Code Cleanup

2010-02-14 Thread Grant Ingersoll
If ever there were a case of http://en.wikipedia.org/wiki/Parkinson's_Law_of_Triviality, this is it. Committer time is a scarce resource. Unless it's automated, the code will always drift out of format. I'd rather be able to cleanly apply a patch than worry about a particular style being

Re: Taste demo not working

2010-02-14 Thread Grant Ingersoll
I committed it. In general, I think we should probably revert the dependency removal changes. I highly doubt Maven static analysis is going to be correct here. Dependencies are usually added because otherwise the thing won't run. For instance, some of the Hadoop dependencies that I think

Re: Mass Code Cleanup

2010-02-14 Thread Grant Ingersoll
of this message. I usually reformat the files affected when I commit. IntelliJ even has this built into it's commit capabilities and I'm all for it. On Sun, Feb 14, 2010 at 10:07 PM, Grant Ingersoll gsing...@apache.org wrote: If ever there were a case of http://en.wikipedia.org/wiki

Re: Taste demo not working

2010-02-14 Thread Grant Ingersoll
On Feb 14, 2010, at 5:31 PM, Sean Owen wrote: I think there are only a couple instances where we'd not know of a problem at compile time. I'd hate to never be able to purge defunct dependencies due to a lurking suspicion that 'something' won't work -- and there were clearly plenty. Probably

  1   2   3   4   5   6   7   8   9   10   >