Re: [PROPOSAL] new subproject: Avro

George Porter Fri, 03 Apr 2009 12:04:18 -0700


On Apr 3, 2009, at 11:37 AM, Doug Cutting wrote:

Field ids are not present in Avro data except in the schema. Arecord's fields are serialized in the order that the fields occur inthe records schema, with no per-field annotations whatsoever. Forexample, a record that contains a string and an int is serializedsimply as a string followed by an int, nothing before, nothingbetween and nothing after. So, yes, it is a different data format.

While this representation would certainly be as compact as possible,wouldn't it prevent evolving the data structure over time? One of thenice features of Google Protocol Buffers and Thrift is that you canevolve the set of fields over time, and older/newer clients can talkto older/newer services. If the proposed Avro is evolvable, thenperhaps I'm misunderstanding your statement about the lack of IDs inthe serialized data.

I also agree with Bryan, in that it would be unfortunate to have twodifferent Apache projects with overlapping goals. Regardless offeatures, both protocol buffers and thrift have the advantage of beingdebugged in mission-critical production environments.


-George

Re: [PROPOSAL] new subproject: Avro

Reply via email to