On Wed, Sep 22, 2010 at 7:05 AM, Sean Owen <[email protected]> wrote:

> And between them, if I squint, that (text and writable) answers the use
> cases.
>

Except when it takes 20GB of extra heap to serialize a model (it really
can).


>
> Wild idea: is that about right? What happens if GSON is removed, Avro
> not used, Serializable not used?
>

Right.  This migration sounds good to me.  It gives us what we have with
better memory feasibility.


>
> Mahout's nature will always be a bit of a 'bazaar' project, really a
> loose confederation of implementations that are not entirely
> consistent. I imagine though that taking targeted shots at chunky
> issues like this (and standardizing on Hadoop 0.20.x APIs for
> instance) gets rid of 80% of the divergence. And that's pretty fine
> for such a project. Better perhaps than many closed / proprietary code
> bases.
>

Sounds right to me.

Reply via email to