[ 
https://issues.apache.org/jira/browse/AVRO-673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12914729#action_12914729
 ] 

Doug Cutting commented on AVRO-673:
-----------------------------------

It's not clear to me that we need to validate at all before we start writing.  
The write should fail on invalid data.

AVRO-654 is also related.  Recursive validation is also not required to select 
a union branch, and, in the worst case, can result in exponentially bad 
performance.

> Reduce time spent validating schemas
> ------------------------------------
>
>                 Key: AVRO-673
>                 URL: https://issues.apache.org/jira/browse/AVRO-673
>             Project: Avro
>          Issue Type: Improvement
>          Components: python
>            Reporter: Erik Frey
>            Priority: Minor
>         Attachments: AVRO-673.patch
>
>
> avro.io has a validate method that currently occupies around half the time it 
> takes to serialize a fairly complex record through a datafile.  validate() 
> gets called repeatedly during an object's traversal, even though validate 
> itself is already recursive.  This introduces combinatorially excessive 
> validation that has a significant impact on the performance of serializing 
> complex records.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to