[jira] Created: (MAHOUT-388) Upgrade Lucene

2010-04-28 Thread Grant Ingersoll (JIRA)
Upgrade Lucene -- Key: MAHOUT-388 URL: https://issues.apache.org/jira/browse/MAHOUT-388 Project: Mahout Issue Type: Improvement Reporter: Grant Ingersoll Priority: Minor Upgrade Lucene version used to

[jira] Commented: (MAHOUT-379) SequentialAccessSparseVector.equals does not agree with AbstractVector.equivalent

2010-04-14 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12856835#action_12856835 ] Grant Ingersoll commented on MAHOUT-379: I think we probably should have a

[jira] Created: (MAHOUT-342) [GSOC] Implement Map/Reduce Enabled Neural Networks

2010-03-19 Thread Grant Ingersoll (JIRA)
[GSOC] Implement Map/Reduce Enabled Neural Networks --- Key: MAHOUT-342 URL: https://issues.apache.org/jira/browse/MAHOUT-342 Project: Mahout Issue Type: New Feature Reporter:

[jira] Created: (MAHOUT-343) [GSOC] Implement Integration of Mahout Clustering or Classification with Apache Solr

2010-03-19 Thread Grant Ingersoll (JIRA)
[GSOC] Implement Integration of Mahout Clustering or Classification with Apache Solr Key: MAHOUT-343 URL: https://issues.apache.org/jira/browse/MAHOUT-343 Project:

[jira] Commented: (MAHOUT-335) Mahout Logo tweak

2010-03-14 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-335?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12845079#action_12845079 ] Grant Ingersoll commented on MAHOUT-335: Can we see #2 w/o the hair on the person?

[jira] Commented: (MAHOUT-301) Improve command-line shell script by allowing default properties files

2010-02-26 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12838834#action_12838834 ] Grant Ingersoll commented on MAHOUT-301: Just capturing something longer term here,

[jira] Created: (MAHOUT-290) Make SequenceFileFromDirectory input args consistent with others

2010-02-13 Thread Grant Ingersoll (JIRA)
Make SequenceFileFromDirectory input args consistent with others Key: MAHOUT-290 URL: https://issues.apache.org/jira/browse/MAHOUT-290 Project: Mahout Issue Type: Bug

[jira] Assigned: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

2010-02-11 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll reassigned MAHOUT-185: -- Assignee: Grant Ingersoll Add mahout shell script for easy launching of various

[jira] Commented: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

2010-02-11 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12832626#action_12832626 ] Grant Ingersoll commented on MAHOUT-185: Looks like a good start. Longer term, we

[jira] Updated: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

2010-02-11 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated MAHOUT-185: --- Affects Version/s: (was: 0.2) Fix Version/s: (was: 0.4)

[jira] Commented: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

2010-02-11 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12832661#action_12832661 ] Grant Ingersoll commented on MAHOUT-185: Committed revision 909120. Add mahout

[jira] Commented: (MAHOUT-215) Provide jars with mahout release.

2010-01-28 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12805931#action_12805931 ] Grant Ingersoll commented on MAHOUT-215: Just an FYI, we need to make sure we can

[jira] Commented: (MAHOUT-153) Implement kmeans++ for initial cluster selection in kmeans

2010-01-18 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12801755#action_12801755 ] Grant Ingersoll commented on MAHOUT-153: Please keep the same issue. That way the

[jira] Commented: (MAHOUT-85) Perceptron/Winnow Trainer

2010-01-10 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-85?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12798489#action_12798489 ] Grant Ingersoll commented on MAHOUT-85: --- Why is PerceptronTrainingMapper empty? Are

[jira] Assigned: (MAHOUT-235) GenericSorting.java also needs replacing

2010-01-05 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll reassigned MAHOUT-235: -- Assignee: Grant Ingersoll GenericSorting.java also needs replacing

[jira] Commented: (MAHOUT-106) PLSI/EM in pig based on hofmann's ACM 04 paper.

2010-01-03 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12795996#action_12795996 ] Grant Ingersoll commented on MAHOUT-106: I still intend to review this for 0.3.

[jira] Created: (MAHOUT-236) Cluster Evaluation Tools

2010-01-03 Thread Grant Ingersoll (JIRA)
Cluster Evaluation Tools Key: MAHOUT-236 URL: https://issues.apache.org/jira/browse/MAHOUT-236 Project: Mahout Issue Type: New Feature Components: Clustering Reporter: Grant Ingersoll Per

[jira] Commented: (MAHOUT-220) Mahout Bayes Code cleanup

2009-12-31 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12795603#action_12795603 ] Grant Ingersoll commented on MAHOUT-220: Yeah, I don't think Utils should need to

[jira] Commented: (MAHOUT-163) Get (better) cluster labels using Log Likelihood Ratio

2009-12-31 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12795604#action_12795604 ] Grant Ingersoll commented on MAHOUT-163: Yep, I committed a change w/ those things

[jira] Updated: (MAHOUT-230) Replace org.apache.mahout.math.Sorting with code of clear provenance

2009-12-31 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated MAHOUT-230: --- Resolution: Fixed Status: Resolved (was: Patch Available) Anyone can. Replace

[jira] Commented: (MAHOUT-220) Mahout Bayes Code cleanup

2009-12-30 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12795320#action_12795320 ] Grant Ingersoll commented on MAHOUT-220: FWIW, I'd say stuff that converts text,

[jira] Updated: (MAHOUT-163) Get (better) cluster labels using Log Likelihood Ratio

2009-12-30 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated MAHOUT-163: --- Attachment: MAHOUT-163.patch Cleans up some issues, adds license headers. Gives more

[jira] Updated: (MAHOUT-163) Get (better) cluster labels using Log Likelihood Ratio

2009-12-30 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated MAHOUT-163: --- Attachment: MAHOUT-163.patch a little more clean up. Get (better) cluster labels using Log

[jira] Commented: (MAHOUT-163) Get (better) cluster labels using Log Likelihood Ratio

2009-12-30 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12795360#action_12795360 ] Grant Ingersoll commented on MAHOUT-163: Committed revision 894684. Get (better)

[jira] Commented: (MAHOUT-230) Replace org.apache.mahout.math.Sorting with code of clear provenance

2009-12-29 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12795048#action_12795048 ] Grant Ingersoll commented on MAHOUT-230: I think we need to commit and then worry

[jira] Commented: (MAHOUT-230) Replace org.apache.mahout.math.Sorting with code of clear provenance

2009-12-29 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12795053#action_12795053 ] Grant Ingersoll commented on MAHOUT-230: Committed revision 894390. Replace

[jira] Updated: (MAHOUT-225) Rename mahout-matrix to mahout-math

2009-12-17 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated MAHOUT-225: --- Resolution: Fixed Status: Resolved (was: Patch Available) Committed revision

[jira] Commented: (MAHOUT-204) Better integration of Mahout matrix capabilities with Colt Matrix additions

2009-11-26 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12782898#action_12782898 ] Grant Ingersoll commented on MAHOUT-204: +1 on aggressive pruning and cleanup.

[jira] Commented: (MAHOUT-204) Better integration of Mahout matrix capabilities with Colt Matrix additions

2009-11-24 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12781896#action_12781896 ] Grant Ingersoll commented on MAHOUT-204: Yeah, go ahead and submit the patch, then

[jira] Assigned: (MAHOUT-207) AbstractVector.hashCode() should not care about the order of iteration over elements

2009-11-24 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll reassigned MAHOUT-207: -- Assignee: Grant Ingersoll AbstractVector.hashCode() should not care about the order

[jira] Commented: (MAHOUT-207) AbstractVector.hashCode() should not care about the order of iteration over elements

2009-11-24 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12782041#action_12782041 ] Grant Ingersoll commented on MAHOUT-207: How does this all relate to

[jira] Assigned: (MAHOUT-206) Separate and clearly label different SparseVector implementations

2009-11-24 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll reassigned MAHOUT-206: -- Assignee: Grant Ingersoll Separate and clearly label different SparseVector

[jira] Commented: (MAHOUT-206) Separate and clearly label different SparseVector implementations

2009-11-24 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12782054#action_12782054 ] Grant Ingersoll commented on MAHOUT-206: Jake, there's something weird in this

[jira] Commented: (MAHOUT-207) AbstractVector.hashCode() should not care about the order of iteration over elements

2009-11-24 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12782064#action_12782064 ] Grant Ingersoll commented on MAHOUT-207: Aren't we loosing some of the benefits of

[jira] Commented: (MAHOUT-207) AbstractVector.hashCode() should not care about the order of iteration over elements

2009-11-24 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12782080#action_12782080 ] Grant Ingersoll commented on MAHOUT-207: All makes sense. Per the refactoring in

[jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-11-23 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12781427#action_12781427 ] Grant Ingersoll commented on MAHOUT-165: OK, I am committing the Matrix module.

[jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-11-23 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12781436#action_12781436 ] Grant Ingersoll commented on MAHOUT-165: OK, I moved over the matrix module, but

[jira] Created: (MAHOUT-204) Better integration of Mahout matrix capabilities with Colt Matrix additions

2009-11-23 Thread Grant Ingersoll (JIRA)
Better integration of Mahout matrix capabilities with Colt Matrix additions --- Key: MAHOUT-204 URL: https://issues.apache.org/jira/browse/MAHOUT-204 Project: Mahout

[jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-11-23 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12781438#action_12781438 ] Grant Ingersoll commented on MAHOUT-165: d'oh, missed the correct package names.

[jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-11-23 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12781467#action_12781467 ] Grant Ingersoll commented on MAHOUT-165: OK, I committed Shashi's patch and fixed

[jira] Commented: (MAHOUT-204) Better integration of Mahout matrix capabilities with Colt Matrix additions

2009-11-23 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12781640#action_12781640 ] Grant Ingersoll commented on MAHOUT-204: Command is good, but patch would be useful

[jira] Commented: (MAHOUT-206) Separate and clearly label different SparseVector implementations

2009-11-23 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12781701#action_12781701 ] Grant Ingersoll commented on MAHOUT-206: Sorry, yes, I missed that and I agree we

[jira] Updated: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-11-22 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated MAHOUT-165: --- Attachment: MAHOUT-165-colt.patch The Colt stuff looks good, my only concern, legally, is

[jira] Commented: (MAHOUT-182) New helper methods for Matrix: times(Vector), timesSquared(Vector), numRows() and numCols()

2009-11-22 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12781146#action_12781146 ] Grant Ingersoll commented on MAHOUT-182: reviewing this morning. New helper

[jira] Resolved: (MAHOUT-182) New helper methods for Matrix: times(Vector), timesSquared(Vector), numRows() and numCols()

2009-11-22 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll resolved MAHOUT-182. Resolution: Fixed Fix Version/s: 0.3 Committed revision 883094. New helper

[jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-11-18 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12779456#action_12779456 ] Grant Ingersoll commented on MAHOUT-165: All sounding pretty good. If you don't

[jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-11-17 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12778832#action_12778832 ] Grant Ingersoll commented on MAHOUT-165: Shashi, can you make sure the patch is up

[jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-11-17 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12778831#action_12778831 ] Grant Ingersoll commented on MAHOUT-165: Yep, I think we are all agreed on Colt.

[jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-11-17 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12779111#action_12779111 ] Grant Ingersoll commented on MAHOUT-165: bq. So I found Wolfgang Hoschek, the

[jira] Commented: (MAHOUT-198) Cleanup pom, remove lib dependencies, etc.

2009-11-12 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12777003#action_12777003 ] Grant Ingersoll commented on MAHOUT-198: Yep, still require mvn install As for

[jira] Commented: (MAHOUT-198) Cleanup pom, remove lib dependencies, etc.

2009-11-12 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12777005#action_12777005 ] Grant Ingersoll commented on MAHOUT-198: OK, I have a fix for the mail thing.

[jira] Commented: (MAHOUT-198) Cleanup pom, remove lib dependencies, etc.

2009-11-11 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12776733#action_12776733 ] Grant Ingersoll commented on MAHOUT-198: Committed revision 835150. Please test.

[jira] Commented: (MAHOUT-190) Make all instance fields private

2009-10-27 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12770740#action_12770740 ] Grant Ingersoll commented on MAHOUT-190: -1 on a blanket move to private nor final,

[jira] Updated: (MAHOUT-163) Get (better) cluster labels using Log Likelihood Ratio

2009-10-19 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated MAHOUT-163: --- Fix Version/s: (was: 0.2) 0.3 Moving to 0.3, I'd like to see this be

[jira] Commented: (MAHOUT-114) Release Process Needs to sign published dependencies such as Hadoop, etc.

2009-10-17 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12766884#action_12766884 ] Grant Ingersoll commented on MAHOUT-114: Yeah, they are, to some extent our

[jira] Commented: (MAHOUT-114) Release Process Needs to sign published dependencies such as Hadoop, etc.

2009-10-16 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12766540#action_12766540 ] Grant Ingersoll commented on MAHOUT-114: Every artifact we release, we have to sign

[jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-10-15 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12766231#action_12766231 ] Grant Ingersoll commented on MAHOUT-165: Shashi's vectors are at:

[jira] Updated: (MAHOUT-155) ARFF VectorIterable

2009-10-14 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated MAHOUT-155: --- Fix Version/s: (was: 0.2) 0.3 ARFF VectorIterable

[jira] Updated: (MAHOUT-181) DistanceMeasure is broken: iteration is done over nonZeroElements of v1.plus(v2), not v1.minus(v2)

2009-10-14 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated MAHOUT-181: --- Resolution: Fixed Status: Resolved (was: Patch Available) Committed revision

[jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-10-14 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12765554#action_12765554 ] Grant Ingersoll commented on MAHOUT-165: Shashi, can you share your test vectors?

[jira] Assigned: (MAHOUT-181) DistanceMeasure is broken: iteration is done over nonZeroElements of v1.plus(v2), not v1.minus(v2)

2009-10-10 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll reassigned MAHOUT-181: -- Assignee: Grant Ingersoll DistanceMeasure is broken: iteration is done over

[jira] Commented: (MAHOUT-181) DistanceMeasure is broken: iteration is done over nonZeroElements of v1.plus(v2), not v1.minus(v2)

2009-10-10 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12764369#action_12764369 ] Grant Ingersoll commented on MAHOUT-181: Jake, can you bring this patch up to date?

[jira] Commented: (MAHOUT-138) Convert main() methods to use Commons CLI for argument processing

2009-10-07 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12763265#action_12763265 ] Grant Ingersoll commented on MAHOUT-138: I think we just need to go through the

[jira] Commented: (MAHOUT-138) Convert main() methods to use Commons CLI for argument processing

2009-10-06 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12762776#action_12762776 ] Grant Ingersoll commented on MAHOUT-138: I don't understand why this was moved to

[jira] Commented: (MAHOUT-138) Convert main() methods to use Commons CLI for argument processing

2009-10-06 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12762811#action_12762811 ] Grant Ingersoll commented on MAHOUT-138: I've been just committing as I go, so I

[jira] Updated: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-10-01 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated MAHOUT-165: --- Attachment: mahout-165.patch This gets the VectorTest testEquals to pass. Also fixes an

[jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-10-01 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12761201#action_12761201 ] Grant Ingersoll commented on MAHOUT-165: The exception in the test is: {quote}

[jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-09-30 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12760898#action_12760898 ] Grant Ingersoll commented on MAHOUT-165: There are some thoughts on equals, etc. in

[jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-09-30 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12760902#action_12760902 ] Grant Ingersoll commented on MAHOUT-165: There are some thoughts on equals, etc. in

[jira] Resolved: (MAHOUT-160) ClusterDumper utility to output all the clusters in all sequence files and points

2009-09-17 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll resolved MAHOUT-160. Resolution: Fixed ClusterDumper utility to output all the clusters in all sequence files

[jira] Resolved: (MAHOUT-146) Make Wikipedia Example Classifier more generic

2009-09-14 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll resolved MAHOUT-146. Resolution: Fixed Make Wikipedia Example Classifier more generic

[jira] Commented: (MAHOUT-138) Convert main() methods to use Commons CLI for argument processing

2009-09-14 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12755029#action_12755029 ] Grant Ingersoll commented on MAHOUT-138: I think for these, you can just start

[jira] Commented: (MAHOUT-171) Move deployment to repository.apache.org

2009-09-14 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12755153#action_12755153 ] Grant Ingersoll commented on MAHOUT-171: I think you can request an account. Move

[jira] Assigned: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-09-12 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll reassigned MAHOUT-165: -- Assignee: Grant Ingersoll Using better primitives hash for sparse vector for

[jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-09-12 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12754586#action_12754586 ] Grant Ingersoll commented on MAHOUT-165: Ted, can you bring your patch up to date

[jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-09-12 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12754585#action_12754585 ] Grant Ingersoll commented on MAHOUT-165: Shashi, did you try Ted's patch? If that

[jira] Commented: (MAHOUT-163) Get (better) cluster labels using Log Likelihood Ratio

2009-09-12 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12754590#action_12754590 ] Grant Ingersoll commented on MAHOUT-163: Hmm, deleting the out of cluster docs from

[jira] Updated: (MAHOUT-163) Get (better) cluster labels using Log Likelihood Ratio

2009-09-12 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated MAHOUT-163: --- Attachment: MAHOUT-163.patch Updates some of the Lucene code a wee bit. Get (better)

[jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-09-11 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12754147#action_12754147 ] Grant Ingersoll commented on MAHOUT-165: I think we will want doubles, but perhaps

[jira] Created: (MAHOUT-176) Remove VectorIterable in favor of just using IterableVector

2009-09-09 Thread Grant Ingersoll (JIRA)
Remove VectorIterable in favor of just using IterableVector - Key: MAHOUT-176 URL: https://issues.apache.org/jira/browse/MAHOUT-176 Project: Mahout Issue Type: Improvement

[jira] Updated: (MAHOUT-176) Remove VectorIterable in favor of just using IterableVector

2009-09-09 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated MAHOUT-176: --- Fix Version/s: 0.2 Remove VectorIterable in favor of just using IterableVector

[jira] Assigned: (MAHOUT-163) Get (better) cluster labels using Log Likelihood Ratio

2009-09-08 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll reassigned MAHOUT-163: -- Assignee: Grant Ingersoll Get (better) cluster labels using Log Likelihood Ratio

[jira] Commented: (MAHOUT-163) Get (better) cluster labels using Log Likelihood Ratio

2009-09-08 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12752861#action_12752861 ] Grant Ingersoll commented on MAHOUT-163: Shashi, I'm having trouble applying the

[jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-09-04 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12751484#action_12751484 ] Grant Ingersoll commented on MAHOUT-165: Yes, Sean is correct. _IF_ the part of

[jira] Resolved: (MAHOUT-159) SparseVector and DenseVector hashCode does not conform to the Java standard

2009-09-01 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll resolved MAHOUT-159. Resolution: Fixed Fix Version/s: 0.2 Committed revision 810184. SparseVector and

[jira] Commented: (MAHOUT-168) Need integer compression routines

2009-08-31 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12749703#action_12749703 ] Grant Ingersoll commented on MAHOUT-168: Can we leverage some of Lucene's

[jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-08-28 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12748870#action_12748870 ] Grant Ingersoll commented on MAHOUT-165: Shashi, Any thoughts on whether we can

[jira] Commented: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-08-20 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12745415#action_12745415 ] Grant Ingersoll commented on MAHOUT-165:

[jira] Issue Comment Edited: (MAHOUT-165) Using better primitives hash for sparse vector for performance gains

2009-08-20 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12745415#action_12745415 ] Grant Ingersoll edited comment on MAHOUT-165 at 8/20/09 5:05 AM:

[jira] Commented: (MAHOUT-121) Speed up distance calculations for sparse vectors

2009-08-19 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-121?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12745015#action_12745015 ] Grant Ingersoll commented on MAHOUT-121: Let's open a new issue for this one, as

[jira] Commented: (MAHOUT-121) Speed up distance calculations for sparse vectors

2009-08-19 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-121?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12745097#action_12745097 ] Grant Ingersoll commented on MAHOUT-121: Hi Rob, Thanks! Here's the ASF's stance

[jira] Commented: (MAHOUT-123) Implement Latent Dirichlet Allocation

2009-08-17 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12744057#action_12744057 ] Grant Ingersoll commented on MAHOUT-123: Committed revision 804979. Implement

[jira] Commented: (MAHOUT-124) Online Classification using HBase

2009-08-17 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12744078#action_12744078 ] Grant Ingersoll commented on MAHOUT-124: A few comments after a quick scan: 1.

[jira] Commented: (MAHOUT-163) Get (better) cluster labels using Log Likelihood Ratio

2009-08-13 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12742842#action_12742842 ] Grant Ingersoll commented on MAHOUT-163: I only briefly scanned the patch, but I've

[jira] Resolved: (MAHOUT-147) Wikipedia Example improvements

2009-08-13 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll resolved MAHOUT-147. Resolution: Fixed Wikipedia Example improvements --

[jira] Updated: (MAHOUT-83) Mahout/Hama Integration

2009-08-13 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-83?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated MAHOUT-83: -- Fix Version/s: (was: 0.2) Marking as unknown release date, since there is no patch for

[jira] Updated: (MAHOUT-163) Get (better) cluster labels using Log Likelihood Ratio

2009-08-13 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated MAHOUT-163: --- Fix Version/s: 0.2 Get (better) cluster labels using Log Likelihood Ratio

[jira] Updated: (MAHOUT-121) Speed up distance calculations for sparse vectors

2009-08-12 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated MAHOUT-121: --- Resolution: Fixed Status: Resolved (was: Patch Available) I think we have this one

[jira] Commented: (MAHOUT-159) SparseVector and DenseVector hashCode does not conform to the Java standard

2009-08-09 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12741065#action_12741065 ] Grant Ingersoll commented on MAHOUT-159: My only suggestion is that the hashCode

[jira] Commented: (MAHOUT-123) Implement Latent Dirichlet Allocation

2009-08-09 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12741064#action_12741064 ] Grant Ingersoll commented on MAHOUT-123: bq. The problem was that the edited patch

  1   2   3   4   >