Re: Google NIH generates yet another incompatible data transfer language

Andrew Lentvorski Thu, 10 Jul 2008 05:30:34 -0700

Chuck Esterbrook wrote:

It's not hard to imagine they needed something faster and slimmer than
XML. Their motivations over XML are compelling:

No argument. There is no question that what they created is better thanXML for their purposes.

Regarding JSON and ASN, I would definitely lean towards them for their
self-description value, but obviously Protocol Buffers are going to be
faster and slimmer.


Faster.  Unlikely.  Take a look at the encoding:
http://code.google.com/apis/protocolbuffers/docs/encoding.html

*Lots* of bit munging. From my experience, ASN.1 is probably going tobe faster.

Comparisons against other binary protocols aren't shown. The onlycomparison that they make is against XML.

Also, almost all the examples use C++ because it's the only languagethat they use that has horrible serialization. Both Java and Pythonserialize very nicely via introspection. Also, they only show itagainst the most primitive DOM XML API, when most people use somethingthat operates at a much higher level nowadays (PullDOM, for example, onPython).

Basically, I don't see the advantage over even something relativelygeneric like ASN.1. And, I'm not suggesting ASN.1 is necessarily agreat solution because it suffers the same problem of not actuallyassociating the data with a label.

I found http://code.google.com/apis/protocolbuffers/docs/overview.html
rather informative. These are engineers solving the problems of their
company, not pariahs on an NIH binge. They have "48,162 different
message types defined in the Google code tree". It's not hard to
imagine that a custom solution was worthwhile for them to pursue.

Is it really NIH if you get a set of pros and cons that are (a)
different, (b) what you wanted and (c) heavily reused?

I don't think so,

You missed my point. Yes, it's better than *XML* because it's a*binary* format.

The problem is that it doesn't seem to be any better than currentlyexisting *binary* formats and seems to have many limitations that evensomething as generic as ASN.1 doesn't.

In addition, it loses the inline association between labeled delimiterand delimited data. That's a large loss that many people won't thinkabout. Even associating a field with a one-byte label helps wheneverything goes FUBAR. "Oh, hang on. Your format placed label "Q" asthe second item whereas we don't even have a label "Q" which means thatwe've completely lost a field that we should have. Did we grab thewrong IDL file? Or did we grab the wrong data file?"

This feels like a bunch of CS folks who couldn't be bothered to gounderstand the tradeoffs of binary wire formats people have been usingfor, oh, the last 20+ years.


-a

--
KPLUG-LPSG@kernel-panic.org
http://www.kernel-panic.org/cgi-bin/mailman/listinfo/kplug-lpsg

Re: Google NIH generates yet another incompatible data transfer language

Reply via email to