[
https://issues.apache.org/jira/browse/IMPALA-9821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Zoltán Borók-Nagy updated IMPALA-9821:
--------------------------------------
Target Version: Impala 5.0.0
Priority: Blocker (was: Major)
> Rewrite ds_hll_sketch() and ds_hll_union() and other datasketch generating
> functions to return Binary
> -----------------------------------------------------------------------------------------------------
>
> Key: IMPALA-9821
> URL: https://issues.apache.org/jira/browse/IMPALA-9821
> Project: IMPALA
> Issue Type: New Feature
> Components: Backend
> Reporter: Gabor Kaszab
> Priority: Blocker
>
> Until Binary implementation is ongoing ds_hll_sketch() and ds_hll_union()
> functions return serialized sketches in String format. Once Binary is
> available in Impala these can return the serialized sketches in Binary format.
> Currently when sketches are written by Hive as BINARY to ORC table and this
> table is loaded to Impala where the sketch columns are STRINGs then we get an
> error
> {code:java}
> ERROR: Type mismatch: table column STRING is map to column binary in ORC file
> {code}
> Interestingly the works with Parquet format.
> Once we have binary support make sure to add coverage for ORC table where the
> table is created and populated by Hive and read for estimating by Impala.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]