nevgin opened a new issue #2832:
URL: https://github.com/apache/hudi/issues/2832
I have installed vanilla versions of hive and spark.
Put the jar hoodie spark bundle in the spark. Put
hudi-hadoop-mr-bundle-x.y.z-SNAPSHOT.jar in aux hive dir and to classpath
hadoop on all datanodes.
When a query is executed in hive with the MP engine, the queries are
executed. Requests also be executed from spark.
However, when running a query to Hive with Engine Spark, an error is thrown
Error: org.apache.hive.service.cli.HiveSQLException: Error while processing
statement: FAILED: Execution Error, return code 3 from
org.apache.hadoop.hive.ql.exec.spark.SparkTask. java.l
ang.NoClassDefFoundError: org / apache / hadoop / hive / ql / io / parquet /
MapredParquetInputFormat
**Expected behavior**
Executed query
**Environment Description**
* Hudi version :
0.7.0
* Spark version :
2.4.7
* Hive version :
2.3.8
* Hadoop version :
2.7.3
* Storage (HDFS/S3/GCS..) :
HDFS
* Running on Docker? (yes/no) :
NO
**Additional context**
Add any other context about the problem here.
**Stacktrace**
java.lang.ClassNotFoundException:
org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormat
```Add the stacktrace of the error.```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]