brianbushree opened a new issue #9845:
URL: https://github.com/apache/druid/issues/9845


   ### Affected Version
   
   0.17.0
   
   ### Description
   
   We're ingesting data from Kafka and we've noticed this exception a couple 
times now... Here's how it normally surfaces: 
   
   We'll see druid metrics that indicate there were 2 failed tasks. We then use 
the Druid UI to grab the task IDs that failed. Around the same time we'll 
notice some spike in ingest lag.
   
   After some investigation we'll find the exception below in one of the tasks 
however the other task that was listed as failed in the UI has no exceptions in 
the log...
   
   Also in both tasks, we've noticed that the report.json says the task was 
COMPLETED with no error message.
   
   ```
   org.apache.druid.java.util.common.ISE: Attempting to publish with empty 
segment set, but total row count was not 0: [2202792].
       at 
org.apache.druid.segment.realtime.appenderator.BaseAppenderatorDriver.publishInBackground(BaseAppenderatorDriver.java:566)
       at 
org.apache.druid.segment.realtime.appenderator.StreamAppenderatorDriver.lambda$publish$1(StreamAppenderatorDriver.java:281)
       at 
org.apache.druid.java.util.common.concurrent.ListenableFutures$1.onSuccess(ListenableFutures.java:50)
       at com.google.common.util.concurrent.Futures$4.run(Futures.java:1181)
       at 
com.google.common.util.concurrent.MoreExecutors$SameThreadExecutorService.execute(MoreExecutors.java:297)
       at 
com.google.common.util.concurrent.ExecutionList.executeListener(ExecutionList.java:156)
       at 
com.google.common.util.concurrent.ExecutionList.execute(ExecutionList.java:145)
       at 
com.google.common.util.concurrent.AbstractFuture.set(AbstractFuture.java:185)
       at 
com.google.common.util.concurrent.Futures$ChainingListenableFuture$1.run(Futures.java:872)
       at 
com.google.common.util.concurrent.MoreExecutors$SameThreadExecutorService.execute(MoreExecutors.java:297)
       at 
com.google.common.util.concurrent.Futures$ImmediateFuture.addListener(Futures.java:102)
       at 
com.google.common.util.concurrent.Futures$ChainingListenableFuture.run(Futures.java:868)
       at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
       at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
       at java.lang.Thread.run(Thread.java:748)
   ```
   
   Couple questions here:
   1. What does this exception mean and what's the best way to troubleshoot 
this?
   2. Why does the report.json indicate that there were no errors?
   3. Is it possible that a task failure from an exception like this could 
cause a separate task to fail? Maybe if they were running on the same node?


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to