sivabalan narayanan created HUDI-3621:
-----------------------------------------
Summary: DeltaStreamer throws NPE when there is no new data to
consume and clustering is enabled
Key: HUDI-3621
URL: https://issues.apache.org/jira/browse/HUDI-3621
Project: Apache Hudi
Issue Type: Task
Components: deltastreamer
Reporter: sivabalan narayanan
In continuous mode when async clustering is enabled, and if there is no new
data to consume for the first time, deltastreamer throws NPE.
{code:java}
22/03/14 14:10:02 INFO DebeziumSource: About to read 0 from Kafka for topic
:glueserver.public.onehuse_id_as_primary_key_siva
22/03/14 14:10:02 INFO DeltaSync: No new data, source checkpoint has not
changed. Nothing to commit. Old
checkpoint=(Option{val=glueserver.public.onehuse_id_as_primary_key_siva,0:10000}).
New Checkpoint=(glueserver.public.onehuse_id_as_primary_key_siva,0:10000)
22/03/14 14:10:02 ERROR HoodieDeltaStreamer: Shutting down delta-sync due to
exception
java.lang.NullPointerException
at
org.apache.hudi.utilities.deltastreamer.DeltaSync.getClusteringInstantOpt(DeltaSync.java:913)
at
org.apache.hudi.utilities.deltastreamer.HoodieDeltaStreamer$DeltaSyncService.lambda$startService$0(HoodieDeltaStreamer.java:668)
at
java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1604)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:750)
22/03/14 14:10:02 INFO HoodieDeltaStreamer: Delta Sync shutdown. Error ?true
22/03/14 14:10:02 INFO HoodieDeltaStreamer: DeltaSync shutdown. Closing write
client. Error?true
22/03/14 14:10:02 ERROR HoodieAsyncService: Service shutdown with error
java.util.concurrent.ExecutionException:
org.apache.hudi.exception.HoodieException
at
java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:357)
at
java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1908)
at
org.apache.hudi.async.HoodieAsyncService.waitForShutdown(HoodieAsyncService.java:103)
at
org.apache.hudi.utilities.deltastreamer.HoodieDeltaStreamer.lambda$sync$1(HoodieDeltaStreamer.java:182)
at org.apache.hudi.common.util.Option.ifPresent(Option.java:96)
at
org.apache.hudi.utilities.deltastreamer.HoodieDeltaStreamer.sync(HoodieDeltaStreamer.java:179)
at
org.apache.hudi.utilities.deltastreamer.HoodieDeltaStreamer.main(HoodieDeltaStreamer.java:530)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:498)
at
org.apache.spark.deploy.JavaMainApplication.start(SparkApplication.scala:52)
at
org.apache.spark.deploy.SparkSubmit.org$apache$spark$deploy$SparkSubmit$$runMain(SparkSubmit.scala:955)
at
org.apache.spark.deploy.SparkSubmit.doRunMain$1(SparkSubmit.scala:180)
at org.apache.spark.deploy.SparkSubmit.submit(SparkSubmit.scala:203)
at org.apache.spark.deploy.SparkSubmit.doSubmit(SparkSubmit.scala:90)
at
org.apache.spark.deploy.SparkSubmit$$anon$2.doSubmit(SparkSubmit.scala:1043)
at org.apache.spark.deploy.SparkSubmit$.main(SparkSubmit.scala:1052)
at org.apache.spark.deploy.SparkSubmit.main(SparkSubmit.scala)
Caused by: org.apache.hudi.exception.HoodieException
at
org.apache.hudi.utilities.deltastreamer.HoodieDeltaStreamer$DeltaSyncService.lambda$startService$0(HoodieDeltaStreamer.java:690)
at
java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1604)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:750)
Caused by: java.lang.NullPointerException
at
org.apache.hudi.utilities.deltastreamer.DeltaSync.getClusteringInstantOpt(DeltaSync.java:913)
at
org.apache.hudi.utilities.deltastreamer.HoodieDeltaStreamer$DeltaSyncService.lambda$startService$0(HoodieDeltaStreamer.java:668)
... 4 more
22/03/14 14:10:02 INFO DeltaSync: Shutting down embedded timeline server {code}
--
This message was sent by Atlassian Jira
(v8.20.1#820001)