If I recall correctly, the Python write implementation does a recursive pass to check the data against the schema. This is sometimes necessary to choose which branch of a union to take when you're faced with typeless dicts, but it's done more often than necessary in the python implementation, and is very slow.
I think the right approach is to have a way for the user to tag the various dicts to indicate which branch of a union it'll represent. -- Philip On Fri, Jan 9, 2015 at 4:01 PM, Doug Cutting <[email protected]> wrote: > On Fri, Jan 9, 2015 at 6:05 AM, Bruce Mitchener > <[email protected]> wrote: > > Has anyone profiled the Python code or otherwise looked at the > performance? > > Not that I've heard of. Optimizing Avro Python would be a great project. > > Doug >
