[
https://issues.apache.org/jira/browse/AVRO-673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12914729#action_12914729
]
Doug Cutting commented on AVRO-673:
-----------------------------------
It's not clear to me that we need to validate at all before we start writing.
The write should fail on invalid data.
AVRO-654 is also related. Recursive validation is also not required to select
a union branch, and, in the worst case, can result in exponentially bad
performance.
> Reduce time spent validating schemas
> ------------------------------------
>
> Key: AVRO-673
> URL: https://issues.apache.org/jira/browse/AVRO-673
> Project: Avro
> Issue Type: Improvement
> Components: python
> Reporter: Erik Frey
> Priority: Minor
> Attachments: AVRO-673.patch
>
>
> avro.io has a validate method that currently occupies around half the time it
> takes to serialize a fairly complex record through a datafile. validate()
> gets called repeatedly during an object's traversal, even though validate
> itself is already recursive. This introduces combinatorially excessive
> validation that has a significant impact on the performance of serializing
> complex records.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.