Hi Grant,

For me it would be easier if you commit what you have now and all of us commit to work through the remaining issues. I think we understand the migration gotcha patterns, we just haven't found them all yet. Having to install/reinstall big patch wads doesn't help IMO.



Grant Ingersoll (JIRA) wrote:
     [ 
https://issues.apache.org/jira/browse/MAHOUT-137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Grant Ingersoll updated MAHOUT-137:
-----------------------------------

    Attachment: MAHOUT-137.patch

Fuzzy kMeans conversion, but tests fail.  Some doubt in my mind about the 
validity of some of the tests, but still working through those.  Could use some 
extra eyes from the authors of these pieces.

Convert Clustering Algs to use Vector Writable
----------------------------------------------

                Key: MAHOUT-137
                URL: https://issues.apache.org/jira/browse/MAHOUT-137
            Project: Mahout
         Issue Type: Improvement
           Reporter: Grant Ingersoll
           Assignee: Grant Ingersoll
            Fix For: 0.2

        Attachments: MAHOUT-137.patch, MAHOUT-137.patch, MAHOUT-137.patch, 
MAHOUT-137.patch, MAHOUT-137.patch


All M/R jobs should use Vector writable instead of encoding and decoding 
strings.  We can have a separate utility that converts serialized GSON, 
Strings, whatever into the appropriate vectors.  See MAHOUT-136 and 
http://www.lucidimagination.com/search/document/6a55f260826fd77f/jira_commented_mahout_136_change_canopy_mr_implementation_to_use_vector_writable


Attachment: PGP.sig
Description: PGP signature

Reply via email to