Michael Skells created AVRO-4249:
------------------------------------

             Summary: Avoid parsing a schema if we have a cache
                 Key: AVRO-4249
                 URL: https://issues.apache.org/jira/browse/AVRO-4249
             Project: Apache Avro
          Issue Type: Improvement
          Components: java
    Affects Versions: 1.12.1
            Reporter: Michael Skells


In a environment where we have many small avro files, and large schemas we see 
multiple TB of schemas being generated when we open files

As the schema is a well defined structure, and the transform from a serialised 
transform is expensive, is seems logical to provide a cache. If the serialised 
form of the schema is the same, then the parsed version can be cached

 

I can provide a PR for discussion



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to