How fast do we expect the new serialization system to be when it replaces 
existing serialization mechanism in Hadoop RPC?

A clear description of the existing bottlenecks and the performance goals for 
this system would help developers interested in contributing.

-Ankur


-------- Original Message --------
Subject:        [PROPOSAL] new subproject: Avro
Date:   Thu, 02 Apr 2009 15:05:08 -0700
From:   Doug Cutting <[email protected]>
Reply-To:       [email protected]
To:     [email protected]



I propose we add a new Hadoop subproject for Avro, a serialization 
system.  My ambition is for Avro to replace both Hadoop's RPC and to be 
used for most Hadoop data files, e.g., by Pig, Hive, etc.

Initial committers would be Sharad Agarwal and me, both existing Hadoop 
committers.  We are the sole authors of this software to date.

The code is currently at:

http://people.apache.org/~cutting/avro.git/

To learn more:

git clone http://people.apache.org/~cutting/avro.git/ avro
cat avro/README.txt

Comments?  Questions?

Doug


Reply via email to