tvm18860 opened a new issue, #15514:
URL: https://github.com/apache/druid/issues/15514

   ### Affected Version
   
   27.0.0
   
   ### Description
   
   I'm ingesting HLL sketches directly from Kafka (serialized as base64 
strings) with a 10 minute task duration, and it runs successfully until the 
task ends and it tries to publish the segment. There appears to be a case where 
the HLL sketch merging can NPE. It worked for a while but then started hitting 
this error. Other sketch types, including theta and quantiles sketches work 
fine, so it seems to be particular to HLL. 
   
   Cluster size: Small, 2x data nodes, ingesting ~ 1000 msgs/sec
   
   Stack trace:
   ```
   2023-12-07T23:18:26,111 WARN 
[[index_kafka_test-datasource_9ae14c85aba4cc8_fcibdggb]-appenderator-merge] 
org.apache.druid.segment.realtime.appenderator.StreamAppenderator - Failed to 
push merged index for 
segment[test-datasource_2023-12-07T23:00:00.000Z_2023-12-08T00:00:00.000Z_2023-12-07T23:00:20.539Z_21].
   java.lang.NullPointerException: null
        at 
org.apache.druid.query.aggregation.datasketches.hll.HllSketchAggregatorFactory$1.fold(HllSketchAggregatorFactory.java:176)
 ~[?:?]
        at 
org.apache.druid.segment.RowCombiningTimeAndDimsIterator.foldMetrics(RowCombiningTimeAndDimsIterator.java:256)
 ~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
        at 
org.apache.druid.segment.RowCombiningTimeAndDimsIterator.combineToCurrentTimeAndDims(RowCombiningTimeAndDimsIterator.java:243)
 ~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
        at 
org.apache.druid.segment.RowCombiningTimeAndDimsIterator.moveToNext(RowCombiningTimeAndDimsIterator.java:191)
 ~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
        at 
org.apache.druid.segment.IndexMergerV9.mergeIndexesAndWriteColumns(IndexMergerV9.java:606)
 ~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
        at 
org.apache.druid.segment.IndexMergerV9.makeIndexFiles(IndexMergerV9.java:234) 
~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
        at 
org.apache.druid.segment.IndexMergerV9.merge(IndexMergerV9.java:1156) 
~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
        at 
org.apache.druid.segment.IndexMergerV9.multiphaseMerge(IndexMergerV9.java:973) 
~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
        at 
org.apache.druid.segment.IndexMergerV9.mergeQueryableIndex(IndexMergerV9.java:915)
 ~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
        at 
org.apache.druid.segment.realtime.appenderator.StreamAppenderator.mergeAndPush(StreamAppenderator.java:866)
 ~[druid-server-2023.11.0-iap.jar:2023.11.0-iap]
        at 
org.apache.druid.segment.realtime.appenderator.StreamAppenderator.lambda$push$1(StreamAppenderator.java:755)
 ~[druid-server-2023.11.0-iap.jar:2023.11.0-iap]
        at 
com.google.common.util.concurrent.AbstractTransformFuture$TransformFuture.doTransform(AbstractTransformFuture.java:250)
 ~[guava-31.1-jre.jar:?]
        at 
com.google.common.util.concurrent.AbstractTransformFuture$TransformFuture.doTransform(AbstractTransformFuture.java:240)
 ~[guava-31.1-jre.jar:?]
        at 
com.google.common.util.concurrent.AbstractTransformFuture.run(AbstractTransformFuture.java:122)
 ~[guava-31.1-jre.jar:?]
        at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128) 
~[?:?]
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628) 
~[?:?]
        at java.lang.Thread.run(Thread.java:829) ~[?:?]
   2023-12-07T23:18:26,118 ERROR 
[[index_kafka_test-datasource_9ae14c85aba4cc8_fcibdggb]-publish] 
org.apache.druid.indexing.seekablestream.SeekableStreamIndexTaskRunner - Error 
while publishing segments for sequenceNumber[SequenceMetadata{sequenceId=0, 
sequenceName='index_kafka_test-datasource_9ae14c85aba4cc8_0', assignments=[], 
startOffsets={KafkaTopicPartition{partition=6, topic='null', 
multiTopicPartition=false}=165491326}, exclusiveStartPartitions=[], 
endOffsets={KafkaTopicPartition{partition=6, topic='null', 
multiTopicPartition=false}=165596186}, sentinel=false, checkpointed=true}]
   java.lang.RuntimeException: java.lang.NullPointerException
        at 
org.apache.druid.segment.realtime.appenderator.StreamAppenderator.mergeAndPush(StreamAppenderator.java:930)
 ~[druid-server-2023.11.0-iap.jar:2023.11.0-iap]
        at 
org.apache.druid.segment.realtime.appenderator.StreamAppenderator.lambda$push$1(StreamAppenderator.java:755)
 ~[druid-server-2023.11.0-iap.jar:2023.11.0-iap]
        at 
com.google.common.util.concurrent.AbstractTransformFuture$TransformFuture.doTransform(AbstractTransformFuture.java:250)
 ~[guava-31.1-jre.jar:?]
        at 
com.google.common.util.concurrent.AbstractTransformFuture$TransformFuture.doTransform(AbstractTransformFuture.java:240)
 ~[guava-31.1-jre.jar:?]
        at 
com.google.common.util.concurrent.AbstractTransformFuture.run(AbstractTransformFuture.java:122)
 ~[guava-31.1-jre.jar:?]
        at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128) 
~[?:?]
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628) 
~[?:?]
        at java.lang.Thread.run(Thread.java:829) ~[?:?]
   Caused by: java.lang.NullPointerException
        at 
org.apache.druid.query.aggregation.datasketches.hll.HllSketchAggregatorFactory$1.fold(HllSketchAggregatorFactory.java:176)
 ~[?:?]
        at 
org.apache.druid.segment.RowCombiningTimeAndDimsIterator.foldMetrics(RowCombiningTimeAndDimsIterator.java:256)
 ~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
        at 
org.apache.druid.segment.RowCombiningTimeAndDimsIterator.combineToCurrentTimeAndDims(RowCombiningTimeAndDimsIterator.java:243)
 ~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
        at 
org.apache.druid.segment.RowCombiningTimeAndDimsIterator.moveToNext(RowCombiningTimeAndDimsIterator.java:191)
 ~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
        at 
org.apache.druid.segment.IndexMergerV9.mergeIndexesAndWriteColumns(IndexMergerV9.java:606)
 ~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
        at 
org.apache.druid.segment.IndexMergerV9.makeIndexFiles(IndexMergerV9.java:234) 
~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
        at 
org.apache.druid.segment.IndexMergerV9.merge(IndexMergerV9.java:1156) 
~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
        at 
org.apache.druid.segment.IndexMergerV9.multiphaseMerge(IndexMergerV9.java:973) 
~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
        at 
org.apache.druid.segment.IndexMergerV9.mergeQueryableIndex(IndexMergerV9.java:915)
 ~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
        at 
org.apache.druid.segment.realtime.appenderator.StreamAppenderator.mergeAndPush(StreamAppenderator.java:866)
 ~[druid-server-2023.11.0-iap.jar:2023.11.0-iap]
        ... 7 more
   ```
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to