Lokesh Jain created HUDI-7983:
---------------------------------
Summary: CDC query fails with ParanamerAnnotationIntrospector
class not found
Key: HUDI-7983
URL: https://issues.apache.org/jira/browse/HUDI-7983
Project: Apache Hudi
Issue Type: Bug
Reporter: Lokesh Jain
Upon trying out CDC query, following error is seen.
java.lang.NoClassDefFoundError:
org/apache/hudi/com/fasterxml/jackson/module/paranamer/ParanamerAnnotationIntrospector
{code:java}
scala> spark.read.option("hoodie.datasource.read.begin.instanttime", 0).
| option("hoodie.datasource.query.type", "incremental").
| option("hoodie.datasource.query.incremental.format", "cdc").
| format("hudi").load(basePath).show(false)
24/07/12 16:16:49 ERROR Executor: Exception in task 0.0 in stage 127.0 (TID 227)
java.lang.NoClassDefFoundError:
org/apache/hudi/com/fasterxml/jackson/module/paranamer/ParanamerAnnotationIntrospector
at
org.apache.hudi.cdc.InternalRowToJsonStringConverter.mapper$lzycompute(InternalRowToJsonStringConverter.scala:36)
at
org.apache.hudi.cdc.InternalRowToJsonStringConverter.mapper(InternalRowToJsonStringConverter.scala:32)
at
org.apache.hudi.cdc.InternalRowToJsonStringConverter.convert(InternalRowToJsonStringConverter.scala:50)
at
org.apache.hudi.cdc.CDCFileGroupIterator.convertRowToJsonString(CDCFileGroupIterator.scala:515)
at
org.apache.hudi.cdc.CDCFileGroupIterator.loadNext(CDCFileGroupIterator.scala:250)
at
org.apache.hudi.cdc.CDCFileGroupIterator.hasNextInternal(CDCFileGroupIterator.scala:218)
at
org.apache.hudi.cdc.CDCFileGroupIterator.hasNext(CDCFileGroupIterator.scala:239)
at
org.apache.spark.sql.execution.datasources.FileScanRDD$$anon$1.hasNext(FileScanRDD.scala:104)
at
org.apache.spark.sql.execution.datasources.FileScanRDD$$anon$1.nextIterator(FileScanRDD.scala:191)
at
org.apache.spark.sql.execution.datasources.FileScanRDD$$anon$1.hasNext(FileScanRDD.scala:104)
at scala.collection.Iterator$$anon$10.hasNext(Iterator.scala:460)
at
org.apache.spark.sql.execution.SparkPlan.$anonfun$getByteArrayRdd$1(SparkPlan.scala:349)
at
org.apache.spark.rdd.RDD.$anonfun$mapPartitionsInternal$2(RDD.scala:898)
at
org.apache.spark.rdd.RDD.$anonfun$mapPartitionsInternal$2$adapted(RDD.scala:898)
at
org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:52)
at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:373)
at org.apache.spark.rdd.RDD.iterator(RDD.scala:337)
at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:90)
at org.apache.spark.scheduler.Task.run(Task.scala:131)
at
org.apache.spark.executor.Executor$TaskRunner.$anonfun$run$3(Executor.scala:506)
at org.apache.spark.util.Utils$.tryWithSafeFinally(Utils.scala:1462)
at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:509)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:750)
Caused by: java.lang.ClassNotFoundException:
org.apache.hudi.com.fasterxml.jackson.module.paranamer.ParanamerAnnotationIntrospector
at java.net.URLClassLoader.findClass(URLClassLoader.java:387)
at java.lang.ClassLoader.loadClass(ClassLoader.java:418)
at java.lang.ClassLoader.loadClass(ClassLoader.java:351)
... 25 more
{code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)