Good discussion. While I'd like to play around with Avro some other time as
it is very interesting, I'll stick with Writables for this project because
everything else in Mahout uses them.

Here are some benchmarking results for different serialization frameworks,
including Avro:
https://github.com/eishay/jvm-serializers/wiki

Here is what a guest column on Cloudera had to say about Avro:
http://www.cloudera.com/blog/2011/05/three-reasons-why-apache-avro-data-serialization-is-a-good-choice-for-openrtb/

On Sun, May 29, 2011 at 8:08 AM, Sean Owen <[email protected]> wrote:

> Versus... Writable? no, the receiver has to know the Writable class in
> advance and therefore knows how to decode. It's not embedded in the
> serialization. Writable is nothing if not compact -- if you write a good
> encoder that is.
>
> On Sun, May 29, 2011 at 12:42 PM, Grant Ingersoll <[email protected]
> >wrote:
> >
> > > - data knows its own schema and will interoperate with any other
> > compatible
> > > schema
> > >
> >
> > Isn't it also more compact due to it doesn't have to send encoding info?
> >
> >
> >
>

Reply via email to