Good discussion. While I'd like to play around with Avro some other time as it is very interesting, I'll stick with Writables for this project because everything else in Mahout uses them.
Here are some benchmarking results for different serialization frameworks, including Avro: https://github.com/eishay/jvm-serializers/wiki Here is what a guest column on Cloudera had to say about Avro: http://www.cloudera.com/blog/2011/05/three-reasons-why-apache-avro-data-serialization-is-a-good-choice-for-openrtb/ On Sun, May 29, 2011 at 8:08 AM, Sean Owen <[email protected]> wrote: > Versus... Writable? no, the receiver has to know the Writable class in > advance and therefore knows how to decode. It's not embedded in the > serialization. Writable is nothing if not compact -- if you write a good > encoder that is. > > On Sun, May 29, 2011 at 12:42 PM, Grant Ingersoll <[email protected] > >wrote: > > > > > - data knows its own schema and will interoperate with any other > > compatible > > > schema > > > > > > > Isn't it also more compact due to it doesn't have to send encoding info? > > > > > > >
