[
https://issues.apache.org/jira/browse/HIVE-21737?focusedWorklogId=527110&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-527110
]
ASF GitHub Bot logged work on HIVE-21737:
-----------------------------------------
Author: ASF GitHub Bot
Created on: 22/Dec/20 09:01
Start Date: 22/Dec/20 09:01
Worklog Time Spent: 10m
Work Description: iemejia commented on pull request #1635:
URL: https://github.com/apache/hive/pull/1635#issuecomment-749430903
@sunchao Great to know the cut is happening soon!
If this is absolutely needed is a question of tradeoffs. Avro binary format
has not changed since version 1.8.x when it introduced Logical Types, but APIs
have removed dependencies from the public API (Jackson, Guava, JodaTime, etc)
so I suppose catching up to the latest version can have the same risks as
catching up to 1.8.x which Hive already did for the binary part.
I suppose Hive users rarely use Avro directly from the transitive dependency
and mostly rely on the Hive APIs (which I hope don't leak Avro) so this might
diminish the risk, but of course there is a risk in that particular case. I am
probably biased towards the upgrade because I come from the Avro side and I
expect the full Big Data ecosystem to be updated and avoid issues because Hive
contributors may introduce changes that are API incompatible with more recent
versions of Avro and break downstream projects because of this (like the
current parallel work on Spark).
As usual in software it is all about tradeoffs. This decision is up to you
guys as the maintainers and I might miss some other side effects in my analysis
because I don't know Hive deeply.
In any case if you guys decide to jump to the latest Avro version e.g.
1.10.1 and any issue happens I engage myself from the Avro side to do any fix
and get out a release if required.
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
Issue Time Tracking
-------------------
Worklog Id: (was: 527110)
Time Spent: 5h (was: 4h 50m)
> Upgrade Avro to version 1.10.1
> ------------------------------
>
> Key: HIVE-21737
> URL: https://issues.apache.org/jira/browse/HIVE-21737
> Project: Hive
> Issue Type: Improvement
> Components: Hive
> Reporter: Ismaël Mejía
> Assignee: Fokko Driesprong
> Priority: Major
> Labels: pull-request-available
> Attachments:
> 0001-HIVE-21737-Make-Avro-use-in-Hive-compatible-with-Avr.patch
>
> Time Spent: 5h
> Remaining Estimate: 0h
>
> Avro >= 1.9.x bring a lot of fixes including a leaner version of Avro without
> Jackson in the public API and Guava as a dependency. Worth the update.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)