[
https://issues.apache.org/jira/browse/IMPALA-9821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michael Smith updated IMPALA-9821:
----------------------------------
Epic Link: IMPALA-12674 (was: IMPALA-9593)
> Rewrite ds_hll_sketch() and ds_hll_union() and other datasketch generating
> functions to return Binary
> -----------------------------------------------------------------------------------------------------
>
> Key: IMPALA-9821
> URL: https://issues.apache.org/jira/browse/IMPALA-9821
> Project: IMPALA
> Issue Type: New Feature
> Components: Backend
> Reporter: Gabor Kaszab
> Priority: Major
>
> Until Binary implementation is ongoing ds_hll_sketch() and ds_hll_union()
> functions return serialized sketches in String format. Once Binary is
> available in Impala these can return the serialized sketches in Binary format.
> Currently when sketches are written by Hive as BINARY to ORC table and this
> table is loaded to Impala where the sketch columns are STRINGs then we get an
> error
> {code:java}
> ERROR: Type mismatch: table column STRING is map to column binary in ORC file
> {code}
> Interestingly the works with Parquet format.
> Once we have binary support make sure to add coverage for ORC table where the
> table is created and populated by Hive and read for estimating by Impala.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]