[jira] Updated: (MAHOUT-157) Frequent Pattern Mining using Parallel FP-Growth

2009-09-30 Thread Robin Anil (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robin Anil updated MAHOUT-157: -- Component/s: Frequent Itemset/Association Rule Mining > Frequent Pattern Mining using Parallel FP-Growt

[jira] Created: (MAHOUT-181) DistanceMeasure is broken: iteration is done over nonZeroElements of v1.plus(v2), not v1.minus(v2)

2009-09-30 Thread Jake Mannix (JIRA)
DistanceMeasure is broken: iteration is done over nonZeroElements of v1.plus(v2), not v1.minus(v2) -- Key: MAHOUT-181 URL: https://issues.apache.org/jira/browse/MAHOUT

[jira] Updated: (MAHOUT-181) DistanceMeasure is broken: iteration is done over nonZeroElements of v1.plus(v2), not v1.minus(v2)

2009-09-30 Thread Jake Mannix (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jake Mannix updated MAHOUT-181: --- Status: Patch Available (was: Open) Fix for all but TanimotoDistanceMeasure, as well as (currently f

[jira] Updated: (MAHOUT-181) DistanceMeasure is broken: iteration is done over nonZeroElements of v1.plus(v2), not v1.minus(v2)

2009-09-30 Thread Jake Mannix (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jake Mannix updated MAHOUT-181: --- Attachment: MAHOUT-181.patch Hmm... missed the patch the first time. > DistanceMeasure is broken: it

[jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-09-30 Thread Jake Mannix (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12760879#action_12760879 ] Jake Mannix commented on MAHOUT-165: Hey Ted, I tried bringing your patch up to current

[jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-09-30 Thread Jake Mannix (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12760886#action_12760886 ] Jake Mannix commented on MAHOUT-165: One test which is failing is the basic VectorTest

[jira] Commented: (MAHOUT-170) Enable Java compile optimize flag during build

2009-09-30 Thread Robin Anil (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-170?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12760890#action_12760890 ] Robin Anil commented on MAHOUT-170: --- A Quick Test, uses some parameters which i felt woul

[jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-09-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12760893#action_12760893 ] Sean Owen commented on MAHOUT-165: -- It's a good, important question and one I think needs

[jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-09-30 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12760898#action_12760898 ] Grant Ingersoll commented on MAHOUT-165: There are some thoughts on equals, etc. in

[jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-09-30 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12760902#action_12760902 ] Grant Ingersoll commented on MAHOUT-165: There are some thoughts on equals, etc. in

[jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-09-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12760956#action_12760956 ] Sean Owen commented on MAHOUT-165: -- Are my conclusions sound then: We agree that equals()

Re: [jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-09-30 Thread Jake Mannix
Regarding having equals() effectively delegate to getName().equals(other.getName()) && equivalent(other) means that we need to be extra special careful about implementations of hashCode() : If we are not going to break the contract between equals() and hashCode(), and we're having equals() *only*

Re: [jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-09-30 Thread Grant Ingersoll
On Sep 30, 2009, at 4:03 PM, Jake Mannix wrote: Regarding having equals() effectively delegate to getName().equals(other.getName()) && equivalent(other) means that we need to be extra special careful about implementations of hashCode() : If we are not going to break the contract between equ

Re: [jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-09-30 Thread Sean Owen
No I don't hear anyone wanting to make equals() ignore the name. (Otherwise, hashCode() would have to ignore it as well.) JIRA also seems pretty laggy to me. On Wed, Sep 30, 2009 at 9:03 PM, Jake Mannix wrote: > If we are not going to break the contract between equals() and hashCode(), > and we'

Re: [jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-09-30 Thread Jake Mannix
On Wed, Sep 30, 2009 at 1:16 PM, Grant Ingersoll wrote: > > On Sep 30, 2009, at 4:03 PM, Jake Mannix wrote: > > Regarding having equals() effectively delegate to >> getName().equals(other.getName()) && equivalent(other) means that we need >> to >> be extra special careful about implementations of

Re: [jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-09-30 Thread Jake Mannix
I didn't say that equals() should ignore name, I said the opposite - equals and hashCode() should *only* take into account the contents and the name, and not implementation (which means that hashCode() needs to stay in one place and not get monkeyed with in subclasses. On Wed, Sep 30, 2009 at 1:18

[jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-09-30 Thread Jake Mannix (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12761002#action_12761002 ] Jake Mannix commented on MAHOUT-165: Good luck with the "quick" part - there seem to be

[jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-09-30 Thread Ted Dunning (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12761000#action_12761000 ] Ted Dunning commented on MAHOUT-165: I will take a quick look this evening at the patc

[jira] Updated: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-09-30 Thread Jake Mannix (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jake Mannix updated MAHOUT-165: --- Attachment: MAHOUT-165-updated.patch > Using better primitives hash for sparse vector for performance

[jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-09-30 Thread Jake Mannix (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12761018#action_12761018 ] Jake Mannix commented on MAHOUT-165: Ted, some notes on your patch: * with the two

[jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-09-30 Thread Ted Dunning (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12761021#action_12761021 ] Ted Dunning commented on MAHOUT-165: THanks Jake, that could be very helpful. The thr

[jira] Created: (MAHOUT-182) New helper methods for Matrix: times(Vector), timesSquared(Vector), numRows() and numCols()

2009-09-30 Thread Jake Mannix (JIRA)
New helper methods for Matrix: times(Vector), timesSquared(Vector), numRows() and numCols() --- Key: MAHOUT-182 URL: https://issues.apache.org/jira/browse/MAHOUT-182

[jira] Updated: (MAHOUT-182) New helper methods for Matrix: times(Vector), timesSquared(Vector), numRows() and numCols()

2009-09-30 Thread Jake Mannix (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jake Mannix updated MAHOUT-182: --- Attachment: matrixTimes.patch Patch to add these methods with unit tests. > New helper methods for M

Re: [ANNOUNCEMENT] Apache Commons Math 2.0 Released

2009-09-30 Thread Jake Mannix
So what's the status on integration of commons-math-2.0 in Mahout? Do we need that stuff? Some of their apis are pretty ugly (look at the number of methods you need to implement to qualify to be a "RealVector"), but piggybacking on some of their functionality would be pretty useful (especially st

Re: [ANNOUNCEMENT] Apache Commons Math 2.0 Released

2009-09-30 Thread Ted Dunning
No motion. I was pushing that integration because it looked like MTJ was integrating with them. That would give some pretty high performance linear algebra to commons-math. That hasn't gone anywhere lately as far as I know. The only other integration point is that every time we have needed some

Re: [ANNOUNCEMENT] Apache Commons Math 2.0 Released

2009-09-30 Thread Jake Mannix
On Wed, Sep 30, 2009 at 8:26 PM, Ted Dunning wrote: > No motion. I was pushing that integration because it looked like MTJ was > integrating with them. That would give some pretty high performance linear > algebra to commons-math. > MTJ is LGPL, how was that ever going anywhere? Luc has been