[
https://issues.apache.org/jira/browse/SPARK-18673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17364409#comment-17364409
]
Neelesh Srinivas Salian commented on SPARK-18673:
-------------------------------------------------
Hitting this in Spark 3.1.1 using Hadoop 3.2.1 with Hive 1.2.2.
I realize the solution is using Hive 2.3.x.
But this looks like an opportunity to improve the documentation here:
[https://spark.apache.org/docs/3.1.1/sql-data-sources-hive-tables.html#interacting-with-different-versions-of-hive-metastore]
with regards to Hive versions and Hadoop versions.
> Dataframes doesn't work on Hadoop 3.x; Hive rejects Hadoop version
> ------------------------------------------------------------------
>
> Key: SPARK-18673
> URL: https://issues.apache.org/jira/browse/SPARK-18673
> Project: Spark
> Issue Type: Sub-task
> Components: SQL
> Affects Versions: 2.1.0
> Environment: Spark built with -Dhadoop.version=3.0.0-alpha2-SNAPSHOT
> Reporter: Steve Loughran
> Assignee: Yuming Wang
> Priority: Major
> Fix For: 3.0.0
>
>
> Spark Dataframes fail to run on Hadoop 3.0.x, because hive.jar's shimloader
> considers 3.x to be an unknown Hadoop version.
> Hive itself will have to fix this; as Spark uses its own hive 1.2.x JAR, it
> will need to be updated to match.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]