Yeah, I was debating doing that. I don't like committing tests that fail. Might make sense to branch.

On Jun 22, 2009, at 11:20 PM, Jeff Eastman wrote:

Hi Grant,

For me it would be easier if you commit what you have now and all of us commit to work through the remaining issues. I think we understand the migration gotcha patterns, we just haven't found them all yet. Having to install/reinstall big patch wads doesn't help IMO.



Grant Ingersoll (JIRA) wrote:
[ https://issues.apache.org/jira/browse/MAHOUT-137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Grant Ingersoll updated MAHOUT-137:
-----------------------------------

   Attachment: MAHOUT-137.patch

Fuzzy kMeans conversion, but tests fail. Some doubt in my mind about the validity of some of the tests, but still working through those. Could use some extra eyes from the authors of these pieces.


Convert Clustering Algs to use Vector Writable
----------------------------------------------

               Key: MAHOUT-137
               URL: https://issues.apache.org/jira/browse/MAHOUT-137
           Project: Mahout
        Issue Type: Improvement
          Reporter: Grant Ingersoll
          Assignee: Grant Ingersoll
           Fix For: 0.2

Attachments: MAHOUT-137.patch, MAHOUT-137.patch, MAHOUT-137.patch, MAHOUT-137.patch, MAHOUT-137.patch


All M/R jobs should use Vector writable instead of encoding and decoding strings. We can have a separate utility that converts serialized GSON, Strings, whatever into the appropriate vectors. See MAHOUT-136 and http://www.lucidimagination.com/search/document/6a55f260826fd77f/jira_commented_mahout_136_change_canopy_mr_implementation_to_use_vector_writable





--------------------------
Grant Ingersoll
http://www.lucidimagination.com/

Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids) using Solr/Lucene:
http://www.lucidimagination.com/search

Reply via email to