[
https://issues.apache.org/jira/browse/HIVE-27475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Akshat Mathur updated HIVE-27475:
---------------------------------
Description:
It was needed to shade a version of zstd-jni that is compatible with parquet in
hive-ql after upgrading parquet version dowstream, otherwise Tez couldn't work
with parquet and zstd encryption. Parquet and kafka-client both use zstd-jni.
In hive-ql, zstd-jni is coming as a transitive dependency from kafka-client.
The zstd-jni version in kafka-client 2.5.0 was not compatible with parquet, so
in [PR-4082|https://github.com/apache/hive/pull/4082] kafka-client was upgraded
to latest version which used zstd-jni version close to the version in parquet.
While the upgrade fixed the zstd-ini issue, It introduced multiple
compatibility issue in the Kafka-handler and due to lack to test cases and
disabled tests these issues went unidentified.
With the recent refactor in [HIVE-27402|http://example.com/], Kafka-client
dependency was moved out of hive-ql, Which fixed the original zstd-ini
issue.(confirmed by [~difin])
Hence , It makes more sense to downgrade kafka versions back to 2.5.0,
stabalize kafka-handler, enable test and then upgrade.
Ref to discussion can be found here:
[#4436|https://github.com/apache/hive/pull/4436]
was:
It was needed to shade a version of zstd-jni that is compatible with parquet in
hive-ql after upgrading parquet version dowstream, otherwise Tez couldn't work
with parquet and zstd encryption. Parquet and kafka-client both use zstd-jni.
In hive-ql, zstd-jni is coming as a transitive dependency from kafka-client.
The zstd-jni version in kafka-client 2.5.0 was not compatible with parquet, so
in [PR-4082|https://github.com/apache/hive/pull/4082] kafka-client was upgraded
to latest version which used zstd-jni version close to the version in parquet.
While the upgrade fixed the zstd-ini issue, It introduced multiple
compatibility issue in the Kafka-handler.
With the recent refactor in [HIVE-27402|http://example.com/], Kafka-client
dependency was moved out of hive-ql, Which fixed the original zstd-ini
issue.(confirmed by [~difin])
Hence , It makes more sense to downgrade kafka versions back to 2.5.0,
stabalize kafka-handler, enable test and then upgrade.
Ref to discussion can be found here:
[#4436|https://github.com/apache/hive/pull/4436]
> Revert Kafka version to stabilise Kafka handler
> ------------------------------------------------
>
> Key: HIVE-27475
> URL: https://issues.apache.org/jira/browse/HIVE-27475
> Project: Hive
> Issue Type: Task
> Components: kafka integration
> Affects Versions: 4.0.0-alpha-2
> Reporter: Akshat Mathur
> Assignee: Akshat Mathur
> Priority: Major
> Labels: pull-request-available
>
> It was needed to shade a version of zstd-jni that is compatible with parquet
> in hive-ql after upgrading parquet version dowstream, otherwise Tez couldn't
> work with parquet and zstd encryption. Parquet and kafka-client both use
> zstd-jni. In hive-ql, zstd-jni is coming as a transitive dependency from
> kafka-client. The zstd-jni version in kafka-client 2.5.0 was not compatible
> with parquet, so in [PR-4082|https://github.com/apache/hive/pull/4082]
> kafka-client was upgraded to latest version which used zstd-jni version close
> to the version in parquet.
> While the upgrade fixed the zstd-ini issue, It introduced multiple
> compatibility issue in the Kafka-handler and due to lack to test cases and
> disabled tests these issues went unidentified.
> With the recent refactor in [HIVE-27402|http://example.com/], Kafka-client
> dependency was moved out of hive-ql, Which fixed the original zstd-ini
> issue.(confirmed by [~difin])
> Hence , It makes more sense to downgrade kafka versions back to 2.5.0,
> stabalize kafka-handler, enable test and then upgrade.
> Ref to discussion can be found here:
> [#4436|https://github.com/apache/hive/pull/4436]
--
This message was sent by Atlassian Jira
(v8.20.10#820010)