[jira] Commented: (MAHOUT-236) Cluster Evaluation Tools

2010-04-26 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12860981#action_12860981 ] Jeff Eastman commented on MAHOUT-236: - Ok, the above patch was committed on the 21st

[jira] Commented: (MAHOUT-297) Canopy and Kmeans clustering slows down on using SeqAccVector for center

2010-04-26 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12861194#action_12861194 ] Jeff Eastman commented on MAHOUT-297: - I don't understand why the constructors for

[jira] Issue Comment Edited: (MAHOUT-297) Canopy and Kmeans clustering slows down on using SeqAccVector for center

2010-04-26 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12861194#action_12861194 ] Jeff Eastman edited comment on MAHOUT-297 at 4/26/10 9:15 PM: --

[jira] Commented: (MAHOUT-236) Cluster Evaluation Tools

2010-04-20 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12859027#action_12859027 ] Jeff Eastman commented on MAHOUT-236: - I'm running into a challenge integrating Fuzzy

[jira] Updated: (MAHOUT-236) Cluster Evaluation Tools

2010-04-20 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-236?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman updated MAHOUT-236: Attachment: MAHOUT-236.patch Added a mean shift clustering job and now it works for CDbw too. On

[jira] Commented: (MAHOUT-270) Make ClusterDumper dump Dirichlet clusters too

2010-04-07 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12854742#action_12854742 ] Jeff Eastman commented on MAHOUT-270: - r931372 renames Printable to Cluster and adds

[jira] Commented: (MAHOUT-339) Class Cast Exception Running Synthetic Control MeanShift Clustering Job

2010-04-02 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-339?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12852948#action_12852948 ] Jeff Eastman commented on MAHOUT-339: - Problem with example was introduced by a recent

[jira] Created: (MAHOUT-339) Class Cast Exception Running Synthetic Control MeanShift Clustering Job

2010-03-17 Thread Jeff Eastman (JIRA)
Class Cast Exception Running Synthetic Control MeanShift Clustering Job --- Key: MAHOUT-339 URL: https://issues.apache.org/jira/browse/MAHOUT-339 Project: Mahout Issue

[jira] Commented: (MAHOUT-270) Make ClusterDumper dump Dirichlet clusters too

2010-02-09 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12831678#action_12831678 ] Jeff Eastman commented on MAHOUT-270: - r908235 commits the Printable interface and

[jira] Issue Comment Edited: (MAHOUT-270) Make ClusterDumper dump Dirichlet clusters too

2010-02-09 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12831678#action_12831678 ] Jeff Eastman edited comment on MAHOUT-270 at 2/9/10 9:39 PM: -

[jira] Created: (MAHOUT-276) Alpha_0 mixture parameter is not implemented correctly in Dirichlet

2010-02-07 Thread Jeff Eastman (JIRA)
Alpha_0 mixture parameter is not implemented correctly in Dirichlet --- Key: MAHOUT-276 URL: https://issues.apache.org/jira/browse/MAHOUT-276 Project: Mahout Issue Type: Bug

[jira] Commented: (MAHOUT-276) Alpha_0 mixture parameter is not implemented correctly in Dirichlet

2010-02-07 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12830740#action_12830740 ] Jeff Eastman commented on MAHOUT-276: - The fix involves adding alpha_0 as an argument

[jira] Commented: (MAHOUT-270) Make ClusterDumper dump Dirichlet clusters too

2010-02-07 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12830741#action_12830741 ] Jeff Eastman commented on MAHOUT-270: - I'd like to deprecate the asFormatString()

[jira] Commented: (MAHOUT-270) Make ClusterDumper dump Dirichlet clusters too

2010-01-31 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12806873#action_12806873 ] Jeff Eastman commented on MAHOUT-270: - In the beginning, vectors, canopies and clusters

[jira] Created: (MAHOUT-270) Make ClusterDumper dump Dirichlet clusters too

2010-01-27 Thread Jeff Eastman (JIRA)
Make ClusterDumper dump Dirichlet clusters too -- Key: MAHOUT-270 URL: https://issues.apache.org/jira/browse/MAHOUT-270 Project: Mahout Issue Type: Improvement Components: Clustering

[jira] Resolved: (MAHOUT-251) Generalize Dirichlet models and model distributions to handle n-d and sparse vectors

2010-01-18 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman resolved MAHOUT-251. - Resolution: Fixed r900519 wrapped up loose ends in the patch, adding new command line arguments

[jira] Created: (MAHOUT-251) Generalize Dirichlet models and model distributions to handle n-d and sparse vectors

2010-01-15 Thread Jeff Eastman (JIRA)
Generalize Dirichlet models and model distributions to handle n-d and sparse vectors Key: MAHOUT-251 URL: https://issues.apache.org/jira/browse/MAHOUT-251 Project:

[jira] Updated: (MAHOUT-251) Generalize Dirichlet models and model distributions to handle n-d and sparse vectors

2010-01-15 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman updated MAHOUT-251: Attachment: MAHOUT-251.patch This patch generalizes the 2-d dense models by introducing a new

[jira] Updated: (MAHOUT-167) Convert clustering code to Hadoop 0.20 API

2009-10-21 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman updated MAHOUT-167: Attachment: MAHOUT-167.patch Work in progress patch which compiles most Canopy changes needed for

[jira] Commented: (MAHOUT-136) Change Canopy MR Implementation to use Vector Writable

2009-09-28 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12760462#action_12760462 ] Jeff Eastman commented on MAHOUT-136: - I think this issue has been completed and should

[jira] Commented: (MAHOUT-137) Convert Clustering Algs to use Vector Writable

2009-06-24 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12723678#action_12723678 ] Jeff Eastman commented on MAHOUT-137: - revision 788071 and 788116 implement Writable

[jira] Commented: (MAHOUT-137) Convert Clustering Algs to use Vector Writable

2009-06-23 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12723193#action_12723193 ] Jeff Eastman commented on MAHOUT-137: - Short term: have the examples job just convert

[jira] Commented: (MAHOUT-137) Convert Clustering Algs to use Vector Writable

2009-06-22 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12722663#action_12722663 ] Jeff Eastman commented on MAHOUT-137: - Here's some code (which depends upon the

[jira] Commented: (MAHOUT-137) Convert Clustering Algs to use Vector Writable

2009-06-22 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12722708#action_12722708 ] Jeff Eastman commented on MAHOUT-137: - Yes, I saw that and that was my original

[jira] Commented: (MAHOUT-137) Convert Clustering Algs to use Vector Writable

2009-06-21 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12722360#action_12722360 ] Jeff Eastman commented on MAHOUT-137: - You got bit by the fact that the reader is not

[jira] Commented: (MAHOUT-137) Convert Clustering Algs to use Vector Writable

2009-06-21 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12722367#action_12722367 ] Jeff Eastman commented on MAHOUT-137: - I find it a bit troubling that

[jira] Commented: (MAHOUT-137) Convert Clustering Algs to use Vector Writable

2009-06-21 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12722372#action_12722372 ] Jeff Eastman commented on MAHOUT-137: - How about we add a job argument to set whether

[jira] Commented: (MAHOUT-137) Convert Clustering Algs to use Vector Writable

2009-06-21 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12722382#action_12722382 ] Jeff Eastman commented on MAHOUT-137: - Evidently, Hadoop needs to know the concrete

[jira] Commented: (MAHOUT-137) Convert Clustering Algs to use Vector Writable

2009-06-21 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12722383#action_12722383 ] Jeff Eastman commented on MAHOUT-137: - This is a bit more efficient patch for the

[jira] Commented: (MAHOUT-137) Convert Clustering Algs to use Vector Writable

2009-06-20 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12722246#action_12722246 ] Jeff Eastman commented on MAHOUT-137: - MAHOUT-136 changed Canopy to use Writable

[jira] Created: (MAHOUT-136) Change Canopy MR Implementation to use Vector Writable

2009-06-19 Thread Jeff Eastman (JIRA)
Change Canopy MR Implementation to use Vector Writable -- Key: MAHOUT-136 URL: https://issues.apache.org/jira/browse/MAHOUT-136 Project: Mahout Issue Type: Improvement

[jira] Updated: (MAHOUT-121) Speed up distance calculations for sparse vectors

2009-06-17 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman updated MAHOUT-121: Attachment: MAHOUT-121jfe.patch I still had problems applying the previous patch so here's another

[jira] Commented: (MAHOUT-121) Speed up distance calculations for sparse vectors

2009-06-17 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-121?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12720764#action_12720764 ] Jeff Eastman commented on MAHOUT-121: - A bit premature perhaps. There is still an

[jira] Commented: (MAHOUT-65) Add Element Labels to Vectors and Matrices

2009-06-17 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-65?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12720768#action_12720768 ] Jeff Eastman commented on MAHOUT-65: Perhaps, though the Json representation of Sean's

[jira] Issue Comment Edited: (MAHOUT-65) Add Element Labels to Vectors and Matrices

2009-06-17 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-65?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12720871#action_12720871 ] Jeff Eastman edited comment on MAHOUT-65 at 6/17/09 1:38 PM: - I

[jira] Commented: (MAHOUT-65) Add Element Labels to Vectors and Matrices

2009-06-17 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-65?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12720886#action_12720886 ] Jeff Eastman commented on MAHOUT-65: I changed the code to substitute 10 random double

[jira] Updated: (MAHOUT-65) Add Element Labels to Vectors and Matrices

2009-06-16 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-65?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman updated MAHOUT-65: --- Attachment: MAHOUT-65d.patch Naming a Vector and having that be stateful - as opposed to bindings

[jira] Updated: (MAHOUT-65) Add Element Labels to Vectors and Matrices

2009-06-15 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-65?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman updated MAHOUT-65: --- Attachment: MAHOUT-65b.patch Here's the patch. Add Element Labels to Vectors and Matrices

[jira] Updated: (MAHOUT-65) Add Element Labels to Vectors and Matrices

2009-06-15 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-65?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman updated MAHOUT-65: --- Attachment: MAHOUT-65c.patch This patch goes a step further than 65b and changes Vector and Matrix

[jira] Commented: (MAHOUT-65) Add Element Labels to Vectors and Matrices

2009-06-15 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-65?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12719863#action_12719863 ] Jeff Eastman commented on MAHOUT-65: Here's an issue that needs some further discussion:

[jira] Commented: (MAHOUT-65) Add Element Labels to Vectors and Matrices

2009-06-13 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-65?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12719181#action_12719181 ] Jeff Eastman commented on MAHOUT-65: Will do, Jeff Add Element Labels to Vectors

[jira] Resolved: (MAHOUT-129) Kmeans sample does not expose numIterations control from KMeansDriver

2009-05-29 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman resolved MAHOUT-129. - Resolution: Fixed Rename completed Kmeans sample does not expose numIterations control from

[jira] Updated: (MAHOUT-109) Implementation of Cosine distance measure, plus unit test.

2009-05-13 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman updated MAHOUT-109: Resolution: Fixed Status: Resolved (was: Patch Available) Committed to trunk r774521

[jira] Commented: (MAHOUT-66) EuclideanDistanceMeasure and ManhattanDistanceMeasure classes are not optimized for Sparse Vectors

2009-05-13 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-66?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12709184#action_12709184 ] Jeff Eastman commented on MAHOUT-66: r 774566 implemented the SparseVector times

[jira] Commented: (MAHOUT-66) EuclideanDistanceMeasure and ManhattanDistanceMeasure classes are not optimized for Sparse Vectors

2009-05-13 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-66?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12709265#action_12709265 ] Jeff Eastman commented on MAHOUT-66: Sure. Let's consider them individually: -

[jira] Resolved: (MAHOUT-118) Mahout needs to respect the file system type when getting a FileSystem for an input or output path

2009-04-19 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman resolved MAHOUT-118. - Resolution: Fixed Given Deneche's comment above I'm going to mark this issue fixed. Thanks

[jira] Assigned: (MAHOUT-118) Mahout needs to respect the file system type when getting a FileSystem for an input or output path

2009-04-16 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman reassigned MAHOUT-118: --- Assignee: Jeff Eastman Mahout needs to respect the file system type when getting a

[jira] Commented: (MAHOUT-116) Decode matrix methods

2009-04-14 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12698916#action_12698916 ] Jeff Eastman commented on MAHOUT-116: - We have been living with the ad-hoc

[jira] Updated: (MAHOUT-30) dirichlet process implementation

2009-03-15 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-30?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman updated MAHOUT-30: --- Attachment: MAHOUT-30f.patch Final patch file is ready to commit. Need to add entry to pom for gson

[jira] Updated: (MAHOUT-30) dirichlet process implementation

2009-03-15 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-30?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman updated MAHOUT-30: --- Resolution: Fixed Status: Resolved (was: Patch Available) Committed revision 754797.

[jira] Updated: (MAHOUT-30) dirichlet process implementation

2009-03-14 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-30?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman updated MAHOUT-30: --- Attachment: screenshot-1.jpg generateSamples(500, 0, 0, 0.5); generateSamples(500, 2, 0,

[jira] Updated: (MAHOUT-30) dirichlet process implementation

2009-03-12 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-30?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman updated MAHOUT-30: --- Attachment: dirichlet-2.tar - fixed bug in rBeta where arguments to rGamma were backwards - fixed

[jira] Updated: (MAHOUT-30) dirichlet process implementation

2009-01-28 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-30?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman updated MAHOUT-30: --- Attachment: MAHOUT-30d.patch This patch moves the display-related classes into the examples subtree

[jira] Updated: (MAHOUT-30) dirichlet process implementation

2009-01-04 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-30?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman updated MAHOUT-30: --- Attachment: (was: jeastman.vcf) dirichlet process implementation

[jira] Updated: (MAHOUT-30) dirichlet process implementation

2008-11-27 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-30?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman updated MAHOUT-30: --- Attachment: MAHOUT-30c.patch This patch fixes a randomization problem caused by using two Random

[jira] Assigned: (MAHOUT-30) dirichlet process implementation

2008-11-15 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-30?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman reassigned MAHOUT-30: -- Assignee: Jeff Eastman dirichlet process implementation

[jira] Updated: (MAHOUT-30) dirichlet process implementation

2008-11-15 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-30?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman updated MAHOUT-30: --- Attachment: MAHOUT-30b.patch This patch is a complete implementation of a non-M/R Dirichlet Process

[jira] Commented: (MAHOUT-30) dirichlet process implementation

2008-11-15 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-30?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12647880#action_12647880 ] Jeff Eastman commented on MAHOUT-30: The above patch makes several improvements to the

[jira] Commented: (MAHOUT-30) dirichlet process implementation

2008-11-12 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-30?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12647186#action_12647186 ] Jeff Eastman commented on MAHOUT-30: I refactored again and was able eliminate

[jira] Issue Comment Edited: (MAHOUT-30) dirichlet process implementation

2008-11-12 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-30?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12647186#action_12647186 ] jeastman edited comment on MAHOUT-30 at 11/12/08 8:50 PM: -- I

[jira] Commented: (MAHOUT-30) dirichlet process implementation

2008-11-11 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-30?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12646790#action_12646790 ] Jeff Eastman commented on MAHOUT-30: I did some refactoring to better localize the major

[jira] Commented: (MAHOUT-82) Canopy map intermediate file structure should be keyed by canopyId.

2008-10-17 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-82?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12640559#action_12640559 ] Jeff Eastman commented on MAHOUT-82: I applied the patch and the unit tests continue to

[jira] Commented: (MAHOUT-65) Add Element Labels to Vectors and Matrices

2008-10-17 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-65?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12640616#action_12640616 ] Jeff Eastman commented on MAHOUT-65: I did a test implementation of element labeling

[jira] Created: (MAHOUT-86) A New Vector Assignment Operator

2008-10-17 Thread Jeff Eastman (JIRA)
A New Vector Assignment Operator Key: MAHOUT-86 URL: https://issues.apache.org/jira/browse/MAHOUT-86 Project: Mahout Issue Type: New Feature Components: Matrix Reporter: Jeff Eastman

[jira] Resolved: (MAHOUT-82) Canopy map intermediate file structure should be keyed by canopyId.

2008-10-17 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-82?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman resolved MAHOUT-82. Resolution: Fixed Assignee: Jeff Eastman r705676 committed the change. Thanks Edward.

[jira] Resolved: (MAHOUT-86) A New Vector Assignment Operator

2008-10-17 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-86?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman resolved MAHOUT-86. Resolution: Fixed Assignee: Jeff Eastman r705702 committed these minor additions to the

[jira] Updated: (MAHOUT-30) dirichlet process implementation

2008-10-17 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-30?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman updated MAHOUT-30: --- Attachment: MAHOUT-30.patch Here's a work-in-progress Dirichlet Process Clustering algorithm that Ted

[jira] Commented: (MAHOUT-82) Canopy map intermediate file structure should be keyed by canopyId.

2008-10-14 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-82?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12639691#action_12639691 ] Jeff Eastman commented on MAHOUT-82: This assertion needs a lot more justification

[jira] Commented: (MAHOUT-65) Add Element Labels to Vectors and Matrices

2008-07-22 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-65?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12615560#action_12615560 ] Jeff Eastman commented on MAHOUT-65: This thread is beginning to diverge significantly

[jira] Commented: (MAHOUT-65) Add Element Labels to Vectors and Matrices

2008-07-22 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-65?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12615869#action_12615869 ] Jeff Eastman commented on MAHOUT-65: Whether or not we decide to split this issue or

[jira] Created: (MAHOUT-65) Add Element Labels to Vectors and Matrices

2008-07-06 Thread Jeff Eastman (JIRA)
Add Element Labels to Vectors and Matrices -- Key: MAHOUT-65 URL: https://issues.apache.org/jira/browse/MAHOUT-65 Project: Mahout Issue Type: New Feature Components: Matrix

[jira] Commented: (MAHOUT-54) parallelize k-means sharing the predominance of canopies

2008-05-12 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-54?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12596266#action_12596266 ] Jeff Eastman commented on MAHOUT-54: What I get is you are concerned by kmeans comparing

[jira] Resolved: (MAHOUT-47) Point class is now redundant and should be removed

2008-04-23 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-47?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman resolved MAHOUT-47. Resolution: Fixed r651077 completed the removal of Point and its unit test from all clustering

[jira] Assigned: (MAHOUT-48) isConverged() and converge flag OK?

2008-04-23 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-48?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman reassigned MAHOUT-48: -- Assignee: Jeff Eastman isConverged() and converge flag OK?

[jira] Resolved: (MAHOUT-48) isConverged() and converge flag OK?

2008-04-23 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-48?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman resolved MAHOUT-48. Resolution: Fixed r651087 cleaned up isConverged methods in KMeansDriver and MeanShiftCanopyJob. I

[jira] Commented: (MAHOUT-47) Point class is now redundant and should be removed

2008-04-21 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-47?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12590973#action_12590973 ] Jeff Eastman commented on MAHOUT-47: r650209 added asFormatString() to Vector and Matrix

[jira] Created: (MAHOUT-47) Point class is now redundant and should be removed

2008-04-18 Thread Jeff Eastman (JIRA)
Point class is now redundant and should be removed -- Key: MAHOUT-47 URL: https://issues.apache.org/jira/browse/MAHOUT-47 Project: Mahout Issue Type: Improvement Components:

[jira] Commented: (MAHOUT-36) WeightedDistanceMeasure

2008-04-14 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-36?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12588606#action_12588606 ] Jeff Eastman commented on MAHOUT-36: The problem I see with including the weights in

[jira] Commented: (MAHOUT-39) Vector improvments

2008-04-14 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-39?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12588694#action_12588694 ] Jeff Eastman commented on MAHOUT-39: -1 Vector#assign already implements fill(double)

[jira] Resolved: (MAHOUT-15) Investigate Mean Shift Clustering

2008-04-14 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-15?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman resolved MAHOUT-15. Resolution: Fixed r648085 committed the latest patch, which additionally adds a method to the

[jira] Updated: (MAHOUT-23) Getting a row or column from a matrix view gives a row or column from the wrapped matrix.

2008-04-13 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-23?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman updated MAHOUT-23: --- Component/s: Matrix Getting a row or column from a matrix view gives a row or column from the

[jira] Commented: (MAHOUT-36) WeightedDistanceMeasure

2008-04-13 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-36?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12588414#action_12588414 ] Jeff Eastman commented on MAHOUT-36: This should be straightforward, as DistanceMeasure

[jira] Updated: (MAHOUT-20) Migrate Canopy and KMeans Implementations to Vectors

2008-04-09 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-20?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman updated MAHOUT-20: --- Attachment: jeastman.vcf +1 There are many, many opportunities to improve on the Vector and Matrix

[jira] Commented: (MAHOUT-23) Getting a row or column from a matrix view gives a row or column from the wrapped matrix.

2008-04-01 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-23?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12584343#action_12584343 ] Jeff Eastman commented on MAHOUT-23: Actually, there is a whole unit test class missing

[jira] Updated: (MAHOUT-23) Getting a row or column from a matrix view gives a row or column from the wrapped matrix.

2008-04-01 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-23?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman updated MAHOUT-23: --- Attachment: MAHOUT-23b.patch This patch adds a new unit test class, TestMatrixView, and fixes several

[jira] Created: (MAHOUT-15) Investigate Mean Shift Clustering

2008-03-11 Thread Jeff Eastman (JIRA)
Investigate Mean Shift Clustering - Key: MAHOUT-15 URL: https://issues.apache.org/jira/browse/MAHOUT-15 Project: Mahout Issue Type: New Feature Components: Clustering Reporter: Jeff

[jira] Updated: (MAHOUT-15) Investigate Mean Shift Clustering

2008-03-11 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-15?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman updated MAHOUT-15: --- Attachment: MAHOUT-15a.patch I've implemented a minimal, non-MR version of the algorithm below to see

[jira] Issue Comment Edited: (MAHOUT-6) Need a matrix implementation

2008-03-07 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-6?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12576037#action_12576037 ] jeastman edited comment on MAHOUT-6 at 3/7/08 8:20 AM: --- Boy, I am

[jira] Updated: (MAHOUT-6) Need a matrix implementation

2008-03-03 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-6?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman updated MAHOUT-6: -- Attachment: MAHOUT-6j.diff Sorted out the two patches and added back my Vector unit tests that fell out

[jira] Updated: (MAHOUT-6) Need a matrix implementation

2008-03-01 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-6?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman updated MAHOUT-6: -- Attachment: MAHOUT-6i.diff Initial implementation of Matrices and unit tests based upon the spirit of

[jira] Updated: (MAHOUT-6) Need a matrix implementation

2008-02-28 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-6?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman updated MAHOUT-6: -- Attachment: MAHOUT-6g.diff - Renamed Matrix1D to Vector and Matrix2D to Matrix, in all interfaces and

[jira] Updated: (MAHOUT-6) Need a matrix implementation

2008-02-26 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-6?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman updated MAHOUT-6: -- Attachment: MAHOUT-6e.diff Ok, I get the case for side-effects. It is a line to be crossed with eyes

[jira] Commented: (MAHOUT-6) Need a matrix implementation

2008-02-25 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-6?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12572112#action_12572112 ] Jeff Eastman commented on MAHOUT-6: --- On the point about interfaces, the current diff has

[jira] Updated: (MAHOUT-5) Implement a k-means clustering prototype

2008-02-25 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-5?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman updated MAHOUT-5: -- Attachment: MAHOUT-5e.diff I did the merge with trunk r630688 and this diff runs all the unit tests.

[jira] Commented: (MAHOUT-6) Need a matrix implementation

2008-02-25 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-6?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12572120#action_12572120 ] Jeff Eastman commented on MAHOUT-6: --- Well, here's a story that suggests using checked

[jira] Updated: (MAHOUT-6) Need a matrix implementation

2008-02-25 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-6?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman updated MAHOUT-6: -- Attachment: MAHOUT-6d.diff This patch adds a Matrix1DView wrapper and tests thereof. In order to avoid

[jira] Updated: (MAHOUT-6) Need a matrix implementation

2008-02-21 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-6?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Eastman updated MAHOUT-6: -- Attachment: MAHOUT-6a.diff +1, assuming you can come up with stories for all the leavesgrin, the overall

[jira] Issue Comment Edited: (MAHOUT-5) Implement a k-means clustering prototype

2008-02-19 Thread Jeff Eastman (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-5?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12570576#action_12570576 ] jeastman edited comment on MAHOUT-5 at 2/19/08 9:19 PM: This

[jira] Created: (MAHOUT-5) Implement a k-means clustering prototype

2008-02-17 Thread Jeff Eastman (JIRA)
Implement a k-means clustering prototype - Key: MAHOUT-5 URL: https://issues.apache.org/jira/browse/MAHOUT-5 Project: Mahout Issue Type: New Feature Components: Clustering Affects