How fast do we expect the new serialization system to be when it replaces existing serialization mechanism in Hadoop RPC?
A clear description of the existing bottlenecks and the performance goals for this system would help developers interested in contributing. -Ankur -------- Original Message -------- Subject: [PROPOSAL] new subproject: Avro Date: Thu, 02 Apr 2009 15:05:08 -0700 From: Doug Cutting <[email protected]> Reply-To: [email protected] To: [email protected] I propose we add a new Hadoop subproject for Avro, a serialization system. My ambition is for Avro to replace both Hadoop's RPC and to be used for most Hadoop data files, e.g., by Pig, Hive, etc. Initial committers would be Sharad Agarwal and me, both existing Hadoop committers. We are the sole authors of this software to date. The code is currently at: http://people.apache.org/~cutting/avro.git/ To learn more: git clone http://people.apache.org/~cutting/avro.git/ avro cat avro/README.txt Comments? Questions? Doug
