[ 
https://issues.apache.org/jira/browse/IMPALA-3722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Armstrong updated IMPALA-3722:
----------------------------------
    Priority: Minor  (was: Major)

> Avro codegen can be unnecessarily disabled
> ------------------------------------------
>
>                 Key: IMPALA-3722
>                 URL: https://issues.apache.org/jira/browse/IMPALA-3722
>             Project: IMPALA
>          Issue Type: Improvement
>          Components: Backend
>    Affects Versions: Impala 2.6.0
>            Reporter: Skye Wanderman-Milne
>            Priority: Minor
>              Labels: avro, codegen, ramp-up
>
> We use avro_schema_equal() from the Avro C library to determine if a file's 
> schema matches the table schema, and if they don't match we disable codegen 
> for that file 
> (https://github.com/cloudera/Impala/blob/cdh5-trunk/be/src/exec/hdfs-avro-scanner.cc#L153).
>  However, avro_schema_equal() is unnecessarily restrictive, because it 
> compares the records' names and namespaces, which don't have to be the same 
> to enable codegen. There are probably other checks we don't need as well, 
> e.g. default values. We should write our own schema comparison function that 
> is tailored to what must match for codegen specifically.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to