Re: [GSOC] 2010 Timelines

2010-04-09 Thread Isabel Drost
Timeline including Apache internal deadlines: http://cwiki.apache.org/confluence/display/COMDEVxSITE/GSoC Mentors, please also click on the ranking link to the ranking explanation [1] for more information on how to rank student proposals. Isabel [1] http://cwiki.apache.org/confluence/display

Re: Javadocs?

2010-03-30 Thread Isabel Drost
On Tue Grant Ingersoll wrote: > We're probably to the point now that we could start doing a nightly > on Hudson if we aren't already. http://hudson.zones.apache.org/hudson/job/Mahout%20nightly/ ;) (At least this one tracks whether the project still builds and all unit tests pass.) The one for b

Re: not a lot of mentors for GSoC

2010-03-30 Thread Isabel Drost
On Mon Grant Ingersoll wrote: > Mentoring sign up is on the GSOC site. You need to be a committer to > be a mentor, at least for the ASF anyway. Please also identify yourself with your GsocLinkId at https://svn.apache.org/repos/private/committers/GsocLinkId.txt so Noirins knows who you are.

Re: Javadocs?

2010-03-30 Thread Isabel Drost
On Tue Jake Mannix wrote: > (ie can't we also have daily updates of the 0.4-SNAPSHOT javadocs > automagically posted up there too?) Yes - maven can do such a thing. I have configured a job on hudson to generate code reports for Mahout with maven - Javadocs are one part of these reports. Linked

Re: Javadocs?

2010-03-30 Thread Isabel Drost
On Tue Grant Ingersoll wrote: > If we want, we can keep move aside the old ones and update the > website to refer to each version. I think that would be great - now that we are slowly getting to a point where apis seem to stabilise at least a bit it would be great for users that don't upgrade to

Re: [jira] Created: (MAHOUT-345) [GSOC] integrate Mahout with Drupal/PHP

2010-03-23 Thread Isabel Drost
On Mon Ted Dunning wrote: > Still valuable, but it seems to me that the best mentors will be > Drupal developers rather than Mahout developers. I would guess for the student's project to be successful he will need support both, from Drupal people (for the integration side of the project) and from

Re: riffle ... small scale workflow manager

2010-03-23 Thread Isabel Drost
On Mon Ted Dunning wrote: > What do people think about this? Is it as useful as I think it is? > Did I not give enough information to even tell? The information you gave was enough to get at least myself interested in the topic. I think a decent workflow systems is what is still missing for Maho

Re: git or svn

2010-03-23 Thread Isabel Drost
On Mon Jake Mannix wrote: > The official apache repository (where the committers write to) is > the subversion repo. Git is just a clone/read-only mirror. But since > you're not writing to either of them, use whichever you are more > comfortable working with. :) For more information on how th

Fw: Mentors for GSoC

2010-03-22 Thread Isabel Drost
Potential GSoC mentors - please tell Noirin who you are, if you want to mentor a student for Mahout. More details below. If you have not done so already, please also subscribe to code-awa...@apache.org for more information on GSoC at Apache. Begin forwarded message: Date: Mon, 22 Mar 2010 15:48

Re: [VOTE] Mahout as TLP

2010-03-20 Thread Isabel Drost
> [X] +1 I'm for Mahout being a TLP and the resolution below. signature.asc Description: This is a digitally signed message part.

Re: [NOMINATION] Sean Owen as Mahout PMC Chair

2010-03-17 Thread Isabel Drost
+1 from here as well. On 15.03.2010 deneche abdelhakim wrote: > +1 too > > On Mon, Mar 15, 2010 at 7:53 PM, Jake Mannix wrote: > > +1 from over here. > > > > On Mon, Mar 15, 2010 at 11:36 AM, Drew Farris wrote: > >> +1 as well. > >> > >> On Mon, Mar 15, 2010 at 2:34 PM, Ted Dunning > >> > >> w

Re: [DISCUSS] Mahout TLP Board Resolution

2010-03-17 Thread Isabel Drost
On 15.03.2010 Grant Ingersoll wrote: > GSI: I personally think the Opt-In model is best as it helps cement that > someone is truly interested in helping out. Consider this my "I'm in" > declaration if we go that route. I'm in as well. Mahout has made fantastic progress since it was founded. I

Re: 0.3 release issues

2010-02-23 Thread Isabel Drost
On Tue Grant Ingersoll wrote: > On Feb 23, 2010, at 9:18 AM, Sean Owen wrote: > > > It does look imminent. As much as I don't like holding out longer, > > and indefinitely, for this release, somehow I'd also really like to > > link to the latest/greatest and official Hadoop release. > > > > Let'

Re: 0.3 release issues

2010-02-23 Thread Isabel Drost
On Tue Sean Owen wrote: > Er, how do we do that? Is it something you can describe, I can > document and do? It already has been described - and documented in our wiki: http://cwiki.apache.org/MAHOUT/thirdpartydependencies.html Hope that helps, Isabel

Re: Look! No more ISSUES

2010-02-23 Thread Isabel Drost
On Tue Sean Owen wrote: > I'm happy to play release engineer. Great - Thanks, Sean. Isabel

Re: Welcome Drew Farris

2010-02-20 Thread Isabel Drost
On 18.02.2010 Drew Farris wrote: > I'm looking forward to working with you all, Welcome to the Mahout community, Drew. Looking forward to working with you. Isabel signature.asc Description: This is a digitally signed message part.

Re: Mass Code Cleanup

2010-02-19 Thread Isabel Drost
On 15.02.2010 Robin Anil wrote: > SGD kmeans++ pegasus seems fine. Isabel can you check with the latest trunk > if the perceptron is alright? Any code I had is already checked in. Any examples I am working on should be easy to adopt. Isabel signature.asc Description: This is a digitally signed

Re: Mass Code Cleanup

2010-02-19 Thread Isabel Drost
On 14.02.2010 Grant Ingersoll wrote: > I don't object to good style. I object to sweeping changes that break a > lot of patches. Maybe not the case here, but it will be in the future and > unless the whole thing is automated as part of committing (as Hadoop > does), the code will always have f

Re: Mahout as TLP

2010-02-15 Thread Isabel Drost
On Sat Grant Ingersoll wrote: > > I don't see any harm in getting 0.3 out first if that makes folks > > more comfortable. > > Yeah, this feels better to me the more I think about it. +1 from me as well: I really like the idea of Mahout becoming a TLP - even before a 1.0 release is available. Ho

[jira] Updated: (MAHOUT-281) scm urls are wrong in the poms

2010-02-10 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Isabel Drost updated MAHOUT-281: Status: Patch Available (was: Open) > scm urls are wrong in the p

[jira] Updated: (MAHOUT-281) scm urls are wrong in the poms

2010-02-10 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Isabel Drost updated MAHOUT-281: Attachment: MAHOUT-281.diff Changed scm connection strings. (Needed a comparably simple example to

Re: Mahout 0.3 Plan and other changes

2010-02-10 Thread Isabel Drost
On Wed Sean Owen wrote: > I'd say we recommend 0.20, since that's what we develop against and > it's the current stable release, and everything we have works on it. > > We can also say it should work on 0.19 and 0.18, but we don't > guarantee or support that. (Slightly different than my last sug

Re: Some more dependencies

2010-02-10 Thread Isabel Drost
On Wed Jake Mannix wrote: > > May I kick them out? > > > > +1 +1 from me as well. Isabel

Re: Mahout 0.3 Plan and other changes

2010-02-10 Thread Isabel Drost
On Wed, 10 Feb 2010 11:10:41 + Sean wrote: > For simplicity, I'd document that Mahout works on 0.19 and 0.20, and > may work on 0.18 +1 Assuming that the majority of the algorithms may work on e.g. 0.19, we could tell users something along the lines of "works with Hadoop 0.19, except $algor

Re: Mahout 0.3 Plan and other changes

2010-02-10 Thread Isabel Drost
On Thu deneche abdelhakim wrote: > although I maintain two versions of Decision Forests, one with the old > api and with the new one, the differences between the two APIs are so > important that I can't just keep working on the two versions. Thus all > the new stuff is being committed using the ne

Re: GSOC 2010 is here

2010-02-02 Thread Isabel Drost
On Mon Robin Anil wrote: > 2. UIMA Integration with Mahout? (Maybe a good project if UIMA folks > are taking in GSOC students) I guess one could easily split this one in two: a) Using UIMA (whole pipeline or just the analysers if that is possible) for data pre-processing before Mahout algorithms

Re: GSOC 2010 is here

2010-02-01 Thread Isabel Drost
On Wed Robin Anil wrote: > Greetings! Fellow GSOC alums, administrators and dear mentors, the > next edition is right here. Details are given in the link below. > > https://groups.google.com/group/google-summer-of-code-discuss/browse_thread/thread/d839c0b02ac15b3f Some additional notes to commit

Re: Release thinking

2010-02-01 Thread Isabel Drost
On Mon Jake Mannix wrote: > On Mon, Jan 25, 2010 at 10:55 AM, Sean Owen wrote: > > > Agree that we should start planning 0.3, as it will take over a > > month I bet to actually be ready. > > > > +1 to releasing within a month or so. +1 here as well. I think it would be great to reach a shorter

Re: Release thinking

2010-02-01 Thread Isabel Drost
On Mon Ted Dunning wrote: > 240 can be WONT-FIX'ed. +1 > I think that Isabel may have something for 241. Nothing that I see as ready to go into 0.3. Isabel

Re: Release thinking

2010-02-01 Thread Isabel Drost
On Mon Grant Ingersoll wrote: > >> MAHOUT-231 Upgrade QM reports to use Clover 2.6 > >> > > > > No idea on this one. > > That should be independent of a release, I would think. It is. What would be needed is adjusting our pom and the Hudson job that builds the reports. Isabel

Re: [jira] Commented: (MAHOUT-238) Further Dependency Cleanup

2010-01-25 Thread Isabel Drost
On Mon Grant Ingersoll wrote: > We put it up there. http://www.lucidimagination.com/search/document/621471200d2182bb/dependencies_outside_maven_central_was_oh_joy#621471200d2182bb is the link to the posting by Jukka explaining exactly how it was done. Isabel

[jira] Commented: (MAHOUT-262) Writable for labeled vectors for supervised learning algorithms

2010-01-22 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12803690#action_12803690 ] Isabel Drost commented on MAHOUT-262: - Should be possible to apply the patch with

[jira] Updated: (MAHOUT-246) upgrade to new lucene TokenStream API to cleanup deprecation warnings

2010-01-21 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Isabel Drost updated MAHOUT-246: Resolution: Fixed Assignee: Olivier Grisel Status: Resolved (was: Patch Available

[jira] Commented: (MAHOUT-242) LLR Collocation Identifier

2010-01-21 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12803381#action_12803381 ] Isabel Drost commented on MAHOUT-242: - {quote} I am not worried about them at

Re: Status, IoC, Random numbers, etc.

2010-01-21 Thread Isabel Drost
On Mon Jake Mannix wrote: > I'm down with IoC, it's a great way to program to interfaces and > abstract away your deep coupling, but open-source libraries I think > aren't the best place for it. +1 I agree with your assessment of DI containers: Spring is very powerful and can simplify wiring larg

[jira] Commented: (MAHOUT-264) Make mahout-math compatible with Java 1.5 (bytecode and standard library).

2010-01-21 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12803281#action_12803281 ] Isabel Drost commented on MAHOUT-264: - The changes to the pom look good. But why

[jira] Commented: (MAHOUT-217) Tidy up generated data after unit tests are run

2010-01-21 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12803276#action_12803276 ] Isabel Drost commented on MAHOUT-217: - The test files I found creating but

[jira] Commented: (MAHOUT-237) Map/Reduce Implementation of Document Vectorizer

2010-01-21 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-237?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12803275#action_12803275 ] Isabel Drost commented on MAHOUT-237: - Hmm, Robin your last comment is "

[jira] Commented: (MAHOUT-242) LLR Collocation Identifier

2010-01-21 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12803274#action_12803274 ] Isabel Drost commented on MAHOUT-242: - First of all, thanks for the patch. The

Re: Tapioca anyone (fisheye)

2010-01-20 Thread Isabel Drost
On Sun Benson Margulies wrote: > http://fisheye6.atlassian.com/browse/mahout Thanks for fisheye integration. Isabel

[jira] Commented: (MAHOUT-153) Implement kmeans++ for initial cluster selection in kmeans

2010-01-16 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12801280#action_12801280 ] Isabel Drost commented on MAHOUT-153: - Welcome to Mahout. Thanks for stepping up

Re: New MEAP: Mahout in Action

2010-01-14 Thread Isabel Drost
On 15.01.2010 Grant Ingersoll wrote: > (BTW, great read so far, I've got 3 more chapters to go in the first > 6!) Can second that: Great book indeed. > We should state up front, just like in Lucene land, that anyone who has a > book on Mahout is welcome to link it on the page. The more books

[jira] Updated: (MAHOUT-244) Add root log-likelihood method to LogLikehood class.

2010-01-14 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Isabel Drost updated MAHOUT-244: Resolution: Fixed Status: Resolved (was: Patch Available) Patch applies cleanly and looks

[jira] Assigned: (MAHOUT-244) Add root log-likelihood method to LogLikehood class.

2010-01-14 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Isabel Drost reassigned MAHOUT-244: --- Assignee: Drew Farris > Add root log-likelihood method to LogLikehood cl

Re: [math] no-such-integer value

2010-01-14 Thread Isabel Drost
On Mon Grant Ingersoll wrote: > I'm sensing a theme. I think for this stuff we should prune fairly > aggressively, then add back in places once we have a need. +1 Isabel

Re: Fisheye?

2010-01-14 Thread Isabel Drost
On Wed Benson Margulies wrote: > Are we set up? If we are, than at least I am not aware of it. Isabel

Re: Welcome Benson Marguiles as Mahout Committer

2010-01-14 Thread Isabel Drost
On Wed Grant Ingersoll wrote: > The Lucene PMC is pleased to welcome the addition of Benson Marguiles > as a committer on Mahout. Welcome Benson - thanks to all the great work you have done so far for the mahout-math stuff. Looking forward to working together with you. Isabel

[jira] Commented: (MAHOUT-85) Perceptron/Winnow Trainer

2010-01-10 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-85?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12798524#action_12798524 ] Isabel Drost commented on MAHOUT-85: No, sorry. That was me committing a change th

[jira] Created: (MAHOUT-241) Example for perceptron

2010-01-10 Thread Isabel Drost (JIRA)
: Isabel Drost Fix For: 0.3 The goal is to provide an end-to-end example based on the 20-newsgroups dataset to show how to get from a set of labelled training examples to a trained model that can later be reused. -- This message is automatically generated by JIRA. - You can reply

[jira] Created: (MAHOUT-240) Parallel version of Perceptron

2010-01-10 Thread Isabel Drost (JIRA)
Reporter: Isabel Drost Fix For: 0.3 So far Perceptron (as well as Winnow) training is still implemented to run w/o parallelization. The goal of this issue is to explore ways for parallelization and if possible to provide a parallel version, that is one that is based on map

[jira] Resolved: (MAHOUT-85) Perceptron/Winnow Trainer

2010-01-10 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-85?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Isabel Drost resolved MAHOUT-85. Resolution: Fixed Finally committed. > Perceptron/Winnow Trai

Re: Mahout on Hudson

2010-01-03 Thread Isabel Drost
On Wednesday 30 December 2009 16:25:43 Grant Ingersoll wrote: > > Also, are we publishing nightly snapshots anywhere? > > To answer my own question: yes, yes we are: > https://repository.apache.org/index.html#nexus-search;quick~mahout http://hudson.zones.apache.org/hudson/job/Mahout nightly/ is t

Re: [math]: how to test sorts

2009-12-28 Thread Isabel Drost
On Wednesday 23 December 2009 22:09:48 Grant Ingersoll wrote: > Beyond that, we could start implementing Clover test coverage, I suppose. It comes with the code quality reports added earlier. They are generated on a daily basis through Hudson and are linked to in the dev section of our web page.

Re: [math] boolean collections

2009-12-28 Thread Isabel Drost
On Wednesday 23 December 2009 13:15:09 Benson Margulies wrote: > I spent some time last night on the question of 'boolean'. I > concluded, to begin with, that a BooleanArrayList is just flat-out > silly, insofar as there is a BitVector class. > > I am also inclined to stop trying thinking about thi

Re: Hadoop dependency on confluence

2009-12-28 Thread Isabel Drost
On Saturday 19 December 2009 20:41:19 Benson Margulies wrote: > http://cwiki.apache.org/MAHOUT/quickstart.html#FootnoteMarker2 > > Talks about parent/pom.xml. > > In fact, the version is in mahout-core. Now, it *could* be in > maven/pom.xml. And maven/pom.xml *could* be parent/pom.xml, and that > w

Re: Eclipse and checkstyle

2009-12-28 Thread Isabel Drost
On Saturday 19 December 2009 16:30:31 Benson Margulies wrote: > Since you've got a checkstyle set that you like, can I go ahead and > build the profile for setting up eclipse to use it? Sure. There should already be a checkstyle file checked in (maven module) - feel free to use that or replace by

Re: unit test failures on my mac

2009-12-28 Thread Isabel Drost
On Saturday 19 December 2009 16:29:40 Benson Margulies wrote: > Running org.apache.mahout.fpm.pfpgrowth.FPGrowthTest > Tests run: 1, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 0.322 > sec <<< FAILURE! > Running org.apache.mahout.df.mapreduce.partial.TreeIDTest > Tests run: 1, Failures: 0, Er

Re: How to apply these patches

2009-12-28 Thread Isabel Drost
On Saturday 19 December 2009 16:15:46 Drew Farris wrote: > Gang, should the wiki > (http://cwiki.apache.org/MAHOUT/howtocontribute.html) be updated to > include -E? Sure*. Isabel * The wiki is open for edits by anyone. All you need is a wiki account which you can create without being a committ

[jira] Created: (MAHOUT-231) Upgrade QM reports to use Clover 2.6

2009-12-27 Thread Isabel Drost (JIRA)
Reporter: Isabel Drost Priority: Minor Fix For: 0.3 Atlassian has donated a license for a new Clover version. The reports provide more information and are easier to read. We should upgrade to site reports to use that version. -- This message is automatically

[jira] Updated: (MAHOUT-85) Perceptron/Winnow Trainer

2009-12-26 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-85?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Isabel Drost updated MAHOUT-85: --- Attachment: MAHOUT-85.patch The patch has tests added to the implementation. The additional

[jira] Updated: (MAHOUT-85) Perceptron/Winnow Trainer

2009-12-26 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-85?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Isabel Drost updated MAHOUT-85: --- Attachment: MAHOUT-85.patch The patch has tests added to the implementation. The additional

[jira] Commented: (MAHOUT-210) Publish code quality reports through maven

2009-12-18 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12792449#action_12792449 ] Isabel Drost commented on MAHOUT-210: - Forgot to include what I changed to mak

[jira] Resolved: (MAHOUT-210) Publish code quality reports through maven

2009-12-18 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Isabel Drost resolved MAHOUT-210. - Resolution: Fixed Links are working now and accessible without logging into hudson. What remains

[jira] Commented: (MAHOUT-210) Publish code quality reports through maven

2009-12-17 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12792019#action_12792019 ] Isabel Drost commented on MAHOUT-210: - Update: Clover tests are up now as well.

[jira] Commented: (MAHOUT-210) Publish code quality reports through maven

2009-12-17 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12791887#action_12791887 ] Isabel Drost commented on MAHOUT-210: - Checked in the current status of the re

[jira] Commented: (MAHOUT-224) Dependency Cleanup

2009-12-15 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12790658#action_12790658 ] Isabel Drost commented on MAHOUT-224: - Maven supports marking dependencies as &qu

[jira] Commented: (MAHOUT-217) Tidy up generated data after unit tests are run

2009-12-15 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12790656#action_12790656 ] Isabel Drost commented on MAHOUT-217: - Not only fpgrowth. I will take a closer loo

[jira] Commented: (MAHOUT-220) Mahout Bayes Code cleanup

2009-12-15 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12790653#action_12790653 ] Isabel Drost commented on MAHOUT-220: - Before reorganizing code - could someone wh

Re: SVM algo, code, etc.

2009-12-15 Thread Isabel Drost
On Fri Sean Owen wrote: > Sure is there a mailing list or something for this? I'd like to be > looped into talking about issues like this. d...@community.apache.org Isabel

Re: SVM algo, code, etc.

2009-12-11 Thread Isabel Drost
On Fri Sean Owen wrote: > On Fri Isabel Drost wrote: > > If you are interested in a broader discussion, it might make sense > > to include the people over at the newly founded community > > development project in the discussion? > > What's this? Attractin

[jira] Updated: (MAHOUT-210) Publish code quality reports through maven

2009-12-11 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Isabel Drost updated MAHOUT-210: Attachment: MAHOUT-210.patch The patch adds clover, findbugs, pmd, cpd and maven dependency

Re: SVM algo, code, etc.

2009-12-11 Thread Isabel Drost
On Fri Sean Owen wrote: > 1) Is SVM in scope for Mahout? (I am guessing so.) Yes. > 2) Who is nominally committing to shepherd the code into the code base > and fix bugs and answer questions? (Jake?) > > I'm not really bothered about this particular patch, but the more > general question. I

[jira] Commented: (MAHOUT-85) Perceptron/Winnow Trainer

2009-12-11 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-85?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789312#action_12789312 ] Isabel Drost commented on MAHOUT-85: I am about to add tests currently. I guess, I

[jira] Created: (MAHOUT-217) Tidy up generated data after unit tests are run

2009-12-11 Thread Isabel Drost (JIRA)
Reporter: Isabel Drost Fix For: 0.3 I tried to compile Mahout on people.apache.org yesterday: The build failed at first, because tests could not generate test data. The reason: Some tests tried to generate test data at /tmp//... - but those directories did exist already and

[jira] Assigned: (MAHOUT-210) Publish code quality reports through maven

2009-12-10 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Isabel Drost reassigned MAHOUT-210: --- Assignee: Isabel Drost > Publish code quality reports through ma

Re: [jira] Assigned: (MAHOUT-11) Static fields used throughout clustering code (Canopy, K-Means).

2009-12-10 Thread Isabel Drost
On Thu Sean Owen wrote: > Looks like Hudson is saying that broke the build but looks like easily > addressable stuff. Fixed it - but only shortly *after* Hudson had already started building the project :/ Triggered the build on Hudson manually a few minutes ago - now it runs successfully again.

[jira] Assigned: (MAHOUT-11) Static fields used throughout clustering code (Canopy, K-Means).

2009-12-10 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-11?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Isabel Drost reassigned MAHOUT-11: -- Assignee: Drew Farris (was: Isabel Drost) Thanks. > Static fields used throughout cluster

[jira] Assigned: (MAHOUT-11) Static fields used throughout clustering code (Canopy, K-Means).

2009-12-10 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-11?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Isabel Drost reassigned MAHOUT-11: -- Assignee: Isabel Drost > Static fields used throughout clustering code (Canopy, K-Me

[jira] Updated: (MAHOUT-11) Static fields used throughout clustering code (Canopy, K-Means).

2009-12-10 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-11?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Isabel Drost updated MAHOUT-11: --- Resolution: Fixed Status: Resolved (was: Patch Available) Committed. Thanks Drew for your

[jira] Commented: (MAHOUT-11) Static fields used throughout clustering code (Canopy, K-Means).

2009-12-09 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-11?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12788129#action_12788129 ] Isabel Drost commented on MAHOUT-11: I'll make the changes before committing

[jira] Resolved: (MAHOUT-90) Adding all scripts (for nightly build) to SVN repository.

2009-12-07 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-90?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Isabel Drost resolved MAHOUT-90. Resolution: Later Marked as "Later" - currently snapshots are published to the ap

[jira] Commented: (MAHOUT-85) Perceptron/Winnow Trainer

2009-12-06 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-85?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786679#action_12786679 ] Isabel Drost commented on MAHOUT-85: It is just a sequential version of the algor

[jira] Commented: (MAHOUT-90) Adding all scripts (for nightly build) to SVN repository.

2009-12-06 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-90?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786678#action_12786678 ] Isabel Drost commented on MAHOUT-90: I did add a hudson job to upload maven snaps

[jira] Assigned: (MAHOUT-90) Adding all scripts (for nightly build) to SVN repository.

2009-12-06 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-90?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Isabel Drost reassigned MAHOUT-90: -- Assignee: (was: Isabel Drost) > Adding all scripts (for nightly build) to SVN reposit

[jira] Commented: (MAHOUT-11) Static fields used throughout clustering code (Canopy, K-Means).

2009-12-04 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-11?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12785985#action_12785985 ] Isabel Drost commented on MAHOUT-11: Applies cleanly and builds w/o unit test fail

Re: Packaging target + dependencies in one .jar with Maven?

2009-12-03 Thread Isabel Drost
On Thu Sean Owen wrote: > Anyone know if there is an easy way to package a build target with all > its dependencies with Maven? I can't find the formula with the > assembly plugin but guess it is there. Hmm, judging from the poms in our repo, we are currently doing that through an ant-script. Ju

Re: Publish code quality reports on web-site?

2009-12-03 Thread Isabel Drost
On Thu Sean Owen wrote: > I suggest our current stance be that we use 0.20.x, with the old APIs. > When 0.21 comes out and stabilizes, we move. So I suggest keeping > these and deleting 'mapred' at that point. Sounds good to me. Isabel

Re: Publish code quality reports on web-site?

2009-12-03 Thread Isabel Drost
On Sun deneche abdelhakim wrote: > df/mapred works with the old hadoop API > df/mapreduce works with hadoop 0.20 API Hmm. Maybe it would still be possible to factor that code out that is common to both implementations? That step might make migrating to a future Hadoop version easier as well as o

Re: Publish code quality reports on web-site?

2009-11-28 Thread Isabel Drost
On Saturday 28 November 2009 21:29:05 Drew Farris wrote: > It will be be interesting to see the reports for the other modules as > well. examples, utils, matrix. As a little preview: Just substitute mahout-core with mahout- in the url below: http://people.apache.org/~isabel/mahout_site/mahout-co

[jira] Created: (MAHOUT-210) Publish code quality reports through maven

2009-11-28 Thread Isabel Drost (JIRA)
Versions: 0.1, 0.2 Reporter: Isabel Drost Fix For: 0.3 We should use mvn site:site to generate code reports and publish them online for users to review and developers to easily spot problems. First version that still needs checks adjusted to our needs is available online

Re: Publish code quality reports on web-site?

2009-11-28 Thread Isabel Drost
On Saturday 28 November 2009 08:30:26 Sean Owen wrote: > I'm all for generating and publishing this. Great. Than I will go an tweak the checks to match our guidelines, twiddle a bit with the output format and than integrate the stuff into our nightly build. > I didn't see anything big flagged,

Publish code quality reports on web-site?

2009-11-27 Thread Isabel Drost
Hello, I just ran several code analysis reports over the Mahout source code. Results are published at http://people.apache.org/~isabel/mahout_site/mahout-core/project-reports.html It includes several reports on code quality, test coverage, java docs and the like. When generated regularly say on

[jira] Commented: (MAHOUT-11) Static fields used throughout clustering code (Canopy, K-Means).

2009-11-25 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-11?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12782470#action_12782470 ] Isabel Drost commented on MAHOUT-11: Drew, go ahead then. > Static fiel

Re: SVM algo, code, etc.

2009-11-25 Thread Isabel Drost
On Fri Grant Ingersoll wrote: > On Nov 19, 2009, at 1:15 PM, Sean Owen wrote: > > Post a patch if you'd like to proceed, IMHO. > +1 +1 from me as well. I would love to see solid svm support in Mahout. Isabel

[jira] Commented: (MAHOUT-11) Static fields used throughout clustering code (Canopy, K-Means).

2009-11-19 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-11?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12780476#action_12780476 ] Isabel Drost commented on MAHOUT-11: First of all, thanks for the review. Passing

[jira] Updated: (MAHOUT-11) Static fields used throughout clustering code (Canopy, K-Means).

2009-11-19 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-11?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Isabel Drost updated MAHOUT-11: --- Attachment: MAHOUT-11.patch Not the original author of the source, but still managed to get the

Re: Trunk is now open

2009-11-18 Thread Isabel Drost
On Wed Grant Ingersoll wrote: > Trunk is now open for commits. Yeah! > Seems like we have some good things in store for 0.3, so have at it! +1 Isabel

[jira] Resolved: (MAHOUT-200) Update information on Mahout site

2009-11-18 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Isabel Drost resolved MAHOUT-200. - Resolution: Fixed Fix Version/s: (was: 0.3) 0.2 Updated web page

Re: [jira] Commented: (MAHOUT-18) Embrace interoperability with other softwares

2009-11-17 Thread Isabel Drost
On Tue Andrew Wang wrote: > As you know, i am new guy about the Mahout. suppose i have one model > trained in WEKA using distinct classifiers, if the Mahout have some > port to import the model, and using the model in the up-coming > process, it will be very cool. Could you please explain exactly

Re: [VOTE] Release 0.2

2009-11-16 Thread Isabel Drost
On Monday 16 November 2009 19:44:38 Ted Dunning wrote: > Congrats. Congratulations from me as well! Isabel -- |\ _,,,---,,_ Web: /,`.-'`'-. ;-;;,_ |,4- ) )-,_..;\ ( `'-' '---''(_/--' `-'\_) (fL) IM: signature.asc Description: Thi

  1   2   3   4   5   >