Re: Multi-language serialization discussion

Doug Cutting Fri, 24 Oct 2008 17:44:59 -0700

Chad Walters wrote:

Re-open that discussion and I imagine you might get some interested parties.


I think I just did, no?

Bumping up a level, rather than inventing a whole new set of Hadoop-specific 
RPC and serialization mechanisms

Whatever we use, we'd probably end up recycling much of Hadoop'sclient/server implementation, since it's been finely tuned for Hadoop'sperformance needs, and I've not yet seen a Thrift transport that looksappropriate. We also need to add authentication and authorizationlayers to Hadoop's RPC, which don't exist in Thrift either, as far as Ican tell. So mostly what we'd use from Thrift directly is objectserialization.

That said, if we use Thrift for object serialization then we'd probablyeventually contribute our transport, authentication and authorizationstuff to the Thrift project. We'd probably want to build it first inHadoop, since it's critical kernel stuff for Hadoop, but, once it'sstable, contribute it to Thrift if it seemed useful to others.

As a serialization layer, Thrift lacks the self-describing stuff that Ithink is critical. If JSON will be the primary text format, then itlooks to me that it would be easier and more natural to base a binaryself-describing format on JSON schema than on Thrift IDL, but perhaps Ican be convinced otherwise.


Doug

Re: Multi-language serialization discussion

Reply via email to