Hi Uri, Once you post to the JIRA, I'd be happy to review it.
-- Philip On Mon, Apr 29, 2013 at 9:22 AM, Doug Cutting <[email protected]> wrote: > Uri, > > This sounds awesome! Is the API compatible with the existing API? If > it's incompatible and cannot easily be made compatible then perhaps we > can add it as the 'new' API and deprecate the old one. Regardless, > please file an issue in Jira (issues.apache.org/jira/browse/AVRO) and > attach your patch there. > > Thanks, > > Doug > > On Sun, Apr 28, 2013 at 10:24 PM, Uri Laserson <[email protected]> > wrote: > > Hi all, > > > > I rewrote some of the python code to read avro files. I was able to > > achieve a ~3x speedup over the current impl, and can probably do better > if > > it was cleaned up more. The main changes are: > > * Eliminated the object-oriented nature of the reader. It's just > functions > > now. Presumably this can be changed back, but it didn't really seem like > > there was any reason for it. > > * Given a reader and writer schema, it precomputes as much helpful info > as > > it can upfront and caches this in a dictionary that the read functions > use > > * The code is compiled with Cython for speedup. > > > > How can this be used to improve the current python api? Let me know how > I > > can be helpful... > > > > Uri > > > > -- > > Uri Laserson, PhD > > Data Scientist, Cloudera > > Twitter/GitHub: @laserson > > +1 617 910 0447 > > [email protected] >
