If I recall correctly, the Python write implementation does a recursive
pass to check the data against the schema.  This is sometimes necessary to
choose which branch of a union to take when you're faced with typeless
dicts, but it's done more often than necessary in the python
implementation, and is very slow.

I think the right approach is to have a way for the user to tag the various
dicts to indicate which branch of a union it'll represent.

-- Philip


On Fri, Jan 9, 2015 at 4:01 PM, Doug Cutting <[email protected]> wrote:

> On Fri, Jan 9, 2015 at 6:05 AM, Bruce Mitchener
> <[email protected]> wrote:
> > Has anyone profiled the Python code or otherwise looked at the
> performance?
>
> Not that I've heard of.  Optimizing Avro Python would be a great project.
>
> Doug
>

Reply via email to