On Wed, Sep 22, 2010 at 7:05 AM, Sean Owen <[email protected]> wrote: > And between them, if I squint, that (text and writable) answers the use > cases. >
Except when it takes 20GB of extra heap to serialize a model (it really can). > > Wild idea: is that about right? What happens if GSON is removed, Avro > not used, Serializable not used? > Right. This migration sounds good to me. It gives us what we have with better memory feasibility. > > Mahout's nature will always be a bit of a 'bazaar' project, really a > loose confederation of implementations that are not entirely > consistent. I imagine though that taking targeted shots at chunky > issues like this (and standardizing on Hadoop 0.20.x APIs for > instance) gets rid of 80% of the divergence. And that's pretty fine > for such a project. Better perhaps than many closed / proprietary code > bases. > Sounds right to me.
