[GitHub] [hudi] nevgin opened a new issue #2832: [SUPPORT]

GitBox Thu, 15 Apr 2021 15:43:32 -0700


nevgin opened a new issue #2832:
URL: https://github.com/apache/hudi/issues/2832



   I have installed vanilla versions of hive and spark.
   Put the jar hoodie spark bundle in the spark. Put 
hudi-hadoop-mr-bundle-x.y.z-SNAPSHOT.jar in aux hive dir and to classpath 
hadoop on all datanodes.
   When a query is executed in hive with the MP engine, the queries are 
executed. Requests also be executed from spark.
   However, when running a query to Hive with Engine Spark, an error is thrown
   Error: org.apache.hive.service.cli.HiveSQLException: Error while processing 
statement: FAILED: Execution Error, return code 3 from 
org.apache.hadoop.hive.ql.exec.spark.SparkTask. java.l
   ang.NoClassDefFoundError: org / apache / hadoop / hive / ql / io / parquet / 
MapredParquetInputFormat 
   **Expected behavior**
   
   Executed query
   
   **Environment Description**
   
   * Hudi version :
    0.7.0
   
   * Spark version :
   2.4.7
   * Hive version :
   2.3.8
   * Hadoop version :
   2.7.3
   * Storage (HDFS/S3/GCS..) :
   HDFS
   * Running on Docker? (yes/no) :
   NO
   
   **Additional context**
   
   Add any other context about the problem here.
   
   **Stacktrace**
   java.lang.ClassNotFoundException: 
org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormat
   ```Add the stacktrace of the error.```
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [hudi] nevgin opened a new issue #2832: [SUPPORT]

Reply via email to