mattssll opened a new issue, #10968:
URL: https://github.com/apache/hudi/issues/10968

   **Describe the problem you faced**
   
   Fail to use Glue Catalog in a Hudi Delta Streamer job on EKS with Spark 
Operator.
   
   **To Reproduce**
   
   Steps to reproduce the behavior:
   
   1.Start job using hudi delta streamer configured to use glue catalog
   2.Using scala 2.12, Hudi 0.13.1, Spark 3.1.3
   
   
   **Expected behavior**
   Table is sinked to glue DB
   **Environment Description**
   
   * Hudi version :
   0.13.1
   * Spark version :
   3.1.3
   * Hive version :
   ?
   * Hadoop version :
   3.2.0
   * Storage (HDFS/S3/GCS..) :
   S3
   * Running on Docker? (yes/no) :
   DOcker image is built
   
   **Additional context**
   Dependencies on pom.xml
   ```
   <project xmlns="http://maven.apache.org/POM/4.0.0";
            xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance";
            xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 
http://maven.apache.org/xsd/maven-4.0.0.xsd";>
       <modelVersion>4.0.0</modelVersion>
   
       <groupId>com.recharge</groupId>
       <artifactId>spark-hudi-app</artifactId>
       <version>1.0</version>
   
       <properties>
           <aws_java_sdk>1.12.694</aws_java_sdk>
           <hadoop_version>3.2.0</hadoop_version>
           <hudi_version>0.13.1</hudi_version>
           <scala_version>2.12</scala_version>
       </properties>
   
       <dependencies>
           <dependency>
               <groupId>com.amazonaws</groupId>
               <artifactId>aws-java-sdk-bundle</artifactId>
               <version>${aws_java_sdk}</version>
           </dependency>
           <dependency>
               <groupId>org.apache.hadoop</groupId>
               <artifactId>hadoop-aws</artifactId>
               <version>${hadoop_version}</version>
           </dependency>
           <dependency>
               <groupId>org.apache.hudi</groupId>
               <artifactId>hudi-aws</artifactId>
               <version>${hudi_version}</version>
           </dependency>
           <dependency>
               <groupId>org.apache.hudi</groupId>
               <artifactId>hudi-utilities-bundle_${scala_version}</artifactId>
               <version>${hudi_version}</version>
           </dependency>
       </dependencies>
   
   </project>
   
   ```
   Add any other context about the problem here.
   
   **Stacktrace**
   24/04/06 20:53:18 INFO SparkContext: Successfully stopped SparkContext
   Exception in thread "main" org.apache.hudi.exception.HoodieException: 
org.apache.hudi.exception.HoodieException: Could not sync using the meta sync 
class org.apache.hudi.aws.sync.AwsGlueCatalogSyncTool
           at 
org.apache.hudi.utilities.deltastreamer.HoodieDeltaStreamer.lambda$sync$1(HoodieDeltaStreamer.java:206)
           at org.apache.hudi.common.util.Option.ifPresent(Option.java:97)
           at 
org.apache.hudi.utilities.deltastreamer.HoodieDeltaStreamer.sync(HoodieDeltaStreamer.java:201)
           at 
org.apache.hudi.utilities.deltastreamer.HoodieDeltaStreamer.main(HoodieDeltaStreamer.java:592)
           at 
java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
           at 
java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(Unknown Source)
           at 
java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(Unknown 
Source)
           at java.base/java.lang.reflect.Method.invoke(Unknown Source)
           at 
org.apache.spark.deploy.JavaMainApplication.start(SparkApplication.scala:52)
           at 
org.apache.spark.deploy.SparkSubmit.org$apache$spark$deploy$SparkSubmit$$runMain(SparkSubmit.scala:951)
           at 
org.apache.spark.deploy.SparkSubmit.doRunMain$1(SparkSubmit.scala:180)
           at org.apache.spark.deploy.SparkSubmit.submit(SparkSubmit.scala:203)
           at org.apache.spark.deploy.SparkSubmit.doSubmit(SparkSubmit.scala:90)
           at 
org.apache.spark.deploy.SparkSubmit$$anon$2.doSubmit(SparkSubmit.scala:1039)
           at org.apache.spark.deploy.SparkSubmit$.main(SparkSubmit.scala:1048)
           at org.apache.spark.deploy.SparkSubmit.main(SparkSubmit.scala)
   ```
   Caused by: java.util.concurrent.ExecutionException: 
org.apache.hudi.exception.HoodieException: Could not sync using the meta sync 
class org.apache.hudi.aws.sync.AwsGlueCatalogSyncTool
           at 
java.base/java.util.concurrent.CompletableFuture.reportGet(Unknown Source)
           at java.base/java.util.concurrent.CompletableFuture.get(Unknown 
Source)
           at 
org.apache.hudi.async.HoodieAsyncService.waitForShutdown(HoodieAsyncService.java:103)
           at 
org.apache.hudi.utilities.deltastreamer.HoodieDeltaStreamer.lambda$sync$1(HoodieDeltaStreamer.java:204)
           ... 15 more
   Caused by: org.apache.hudi.exception.HoodieException: Could not sync using 
the meta sync class org.apache.hudi.aws.sync.AwsGlueCatalogSyncTool
           at 
org.apache.hudi.utilities.deltastreamer.HoodieDeltaStreamer$DeltaSyncService.lambda$startService$1(HoodieDeltaStreamer.java:778)
           at 
java.base/java.util.concurrent.CompletableFuture$AsyncSupply.run(Unknown Source)
           at 
java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source)
           at 
java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source)
           at java.base/java.lang.Thread.run(Unknown Source)
   Caused by: org.apache.hudi.exception.HoodieException: Could not sync using 
the meta sync class org.apache.hudi.aws.sync.AwsGlueCatalogSyncTool
           at 
org.apache.hudi.sync.common.util.SyncUtilHelpers.runHoodieMetaSync(SyncUtilHelpers.java:61)
           at 
org.apache.hudi.utilities.deltastreamer.DeltaSync.runMetaSync(DeltaSync.java:808)
           at 
org.apache.hudi.utilities.deltastreamer.DeltaSync.writeToSink(DeltaSync.java:723)
           at 
org.apache.hudi.utilities.deltastreamer.DeltaSync.syncOnce(DeltaSync.java:395)
           at 
org.apache.hudi.utilities.deltastreamer.HoodieDeltaStreamer$DeltaSyncService.lambda$startService$1(HoodieDeltaStreamer.java:735)
           ... 4 more
   Caused by: java.lang.NoClassDefFoundError: org/apache/hudi/hive/HiveSyncTool
           at java.base/java.lang.ClassLoader.defineClass1(Native Method)
           at java.base/java.lang.ClassLoader.defineClass(Unknown Source)
           at java.base/java.security.SecureClassLoader.defineClass(Unknown 
Source)
           at 
java.base/jdk.internal.loader.BuiltinClassLoader.defineClass(Unknown Source)
           at 
java.base/jdk.internal.loader.BuiltinClassLoader.findClassOnClassPathOrNull(Unknown
 Source)
           at 
java.base/jdk.internal.loader.BuiltinClassLoader.loadClassOrNull(Unknown Source)
           at 
java.base/jdk.internal.loader.BuiltinClassLoader.loadClass(Unknown Source)
           at 
java.base/jdk.internal.loader.ClassLoaders$AppClassLoader.loadClass(Unknown 
Source)
           at java.base/java.lang.ClassLoader.loadClass(Unknown Source)
           at java.base/java.lang.ClassLoader.loadClass(Unknown Source)
           at java.base/java.lang.Class.forName0(Native Method)
           at java.base/java.lang.Class.forName(Unknown Source)
           at 
org.apache.hudi.common.util.ReflectionUtils.getClass(ReflectionUtils.java:55)
           at 
org.apache.hudi.common.util.ReflectionUtils.hasConstructor(ReflectionUtils.java:95)
           at 
org.apache.hudi.sync.common.util.SyncUtilHelpers.instantiateMetaSyncTool(SyncUtilHelpers.java:83)
           at 
org.apache.hudi.sync.common.util.SyncUtilHelpers.runHoodieMetaSync(SyncUtilHelpers.java:58)
           ... 8 more
   Caused by: java.lang.ClassNotFoundException: 
org.apache.hudi.hive.HiveSyncTool
           at 
java.base/jdk.internal.loader.BuiltinClassLoader.loadClass(Unknown Source)
           at 
java.base/jdk.internal.loader.ClassLoaders$AppClassLoader.loadClass(Unknown 
Source)
           at java.base/java.lang.ClassLoader.loadClass(Unknown Source)
           ... 24 more
   ```
   ```Add the stacktrace of the error.```
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to