tvm18860 opened a new issue, #15514:
URL: https://github.com/apache/druid/issues/15514
### Affected Version
27.0.0
### Description
I'm ingesting HLL sketches directly from Kafka (serialized as base64
strings) with a 10 minute task duration, and it runs successfully until the
task ends and it tries to publish the segment. There appears to be a case where
the HLL sketch merging can NPE. It worked for a while but then started hitting
this error. Other sketch types, including theta and quantiles sketches work
fine, so it seems to be particular to HLL.
Cluster size: Small, 2x data nodes, ingesting ~ 1000 msgs/sec
Stack trace:
```
2023-12-07T23:18:26,111 WARN
[[index_kafka_test-datasource_9ae14c85aba4cc8_fcibdggb]-appenderator-merge]
org.apache.druid.segment.realtime.appenderator.StreamAppenderator - Failed to
push merged index for
segment[test-datasource_2023-12-07T23:00:00.000Z_2023-12-08T00:00:00.000Z_2023-12-07T23:00:20.539Z_21].
java.lang.NullPointerException: null
at
org.apache.druid.query.aggregation.datasketches.hll.HllSketchAggregatorFactory$1.fold(HllSketchAggregatorFactory.java:176)
~[?:?]
at
org.apache.druid.segment.RowCombiningTimeAndDimsIterator.foldMetrics(RowCombiningTimeAndDimsIterator.java:256)
~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
at
org.apache.druid.segment.RowCombiningTimeAndDimsIterator.combineToCurrentTimeAndDims(RowCombiningTimeAndDimsIterator.java:243)
~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
at
org.apache.druid.segment.RowCombiningTimeAndDimsIterator.moveToNext(RowCombiningTimeAndDimsIterator.java:191)
~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
at
org.apache.druid.segment.IndexMergerV9.mergeIndexesAndWriteColumns(IndexMergerV9.java:606)
~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
at
org.apache.druid.segment.IndexMergerV9.makeIndexFiles(IndexMergerV9.java:234)
~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
at
org.apache.druid.segment.IndexMergerV9.merge(IndexMergerV9.java:1156)
~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
at
org.apache.druid.segment.IndexMergerV9.multiphaseMerge(IndexMergerV9.java:973)
~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
at
org.apache.druid.segment.IndexMergerV9.mergeQueryableIndex(IndexMergerV9.java:915)
~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
at
org.apache.druid.segment.realtime.appenderator.StreamAppenderator.mergeAndPush(StreamAppenderator.java:866)
~[druid-server-2023.11.0-iap.jar:2023.11.0-iap]
at
org.apache.druid.segment.realtime.appenderator.StreamAppenderator.lambda$push$1(StreamAppenderator.java:755)
~[druid-server-2023.11.0-iap.jar:2023.11.0-iap]
at
com.google.common.util.concurrent.AbstractTransformFuture$TransformFuture.doTransform(AbstractTransformFuture.java:250)
~[guava-31.1-jre.jar:?]
at
com.google.common.util.concurrent.AbstractTransformFuture$TransformFuture.doTransform(AbstractTransformFuture.java:240)
~[guava-31.1-jre.jar:?]
at
com.google.common.util.concurrent.AbstractTransformFuture.run(AbstractTransformFuture.java:122)
~[guava-31.1-jre.jar:?]
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
~[?:?]
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
~[?:?]
at java.lang.Thread.run(Thread.java:829) ~[?:?]
2023-12-07T23:18:26,118 ERROR
[[index_kafka_test-datasource_9ae14c85aba4cc8_fcibdggb]-publish]
org.apache.druid.indexing.seekablestream.SeekableStreamIndexTaskRunner - Error
while publishing segments for sequenceNumber[SequenceMetadata{sequenceId=0,
sequenceName='index_kafka_test-datasource_9ae14c85aba4cc8_0', assignments=[],
startOffsets={KafkaTopicPartition{partition=6, topic='null',
multiTopicPartition=false}=165491326}, exclusiveStartPartitions=[],
endOffsets={KafkaTopicPartition{partition=6, topic='null',
multiTopicPartition=false}=165596186}, sentinel=false, checkpointed=true}]
java.lang.RuntimeException: java.lang.NullPointerException
at
org.apache.druid.segment.realtime.appenderator.StreamAppenderator.mergeAndPush(StreamAppenderator.java:930)
~[druid-server-2023.11.0-iap.jar:2023.11.0-iap]
at
org.apache.druid.segment.realtime.appenderator.StreamAppenderator.lambda$push$1(StreamAppenderator.java:755)
~[druid-server-2023.11.0-iap.jar:2023.11.0-iap]
at
com.google.common.util.concurrent.AbstractTransformFuture$TransformFuture.doTransform(AbstractTransformFuture.java:250)
~[guava-31.1-jre.jar:?]
at
com.google.common.util.concurrent.AbstractTransformFuture$TransformFuture.doTransform(AbstractTransformFuture.java:240)
~[guava-31.1-jre.jar:?]
at
com.google.common.util.concurrent.AbstractTransformFuture.run(AbstractTransformFuture.java:122)
~[guava-31.1-jre.jar:?]
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
~[?:?]
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
~[?:?]
at java.lang.Thread.run(Thread.java:829) ~[?:?]
Caused by: java.lang.NullPointerException
at
org.apache.druid.query.aggregation.datasketches.hll.HllSketchAggregatorFactory$1.fold(HllSketchAggregatorFactory.java:176)
~[?:?]
at
org.apache.druid.segment.RowCombiningTimeAndDimsIterator.foldMetrics(RowCombiningTimeAndDimsIterator.java:256)
~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
at
org.apache.druid.segment.RowCombiningTimeAndDimsIterator.combineToCurrentTimeAndDims(RowCombiningTimeAndDimsIterator.java:243)
~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
at
org.apache.druid.segment.RowCombiningTimeAndDimsIterator.moveToNext(RowCombiningTimeAndDimsIterator.java:191)
~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
at
org.apache.druid.segment.IndexMergerV9.mergeIndexesAndWriteColumns(IndexMergerV9.java:606)
~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
at
org.apache.druid.segment.IndexMergerV9.makeIndexFiles(IndexMergerV9.java:234)
~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
at
org.apache.druid.segment.IndexMergerV9.merge(IndexMergerV9.java:1156)
~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
at
org.apache.druid.segment.IndexMergerV9.multiphaseMerge(IndexMergerV9.java:973)
~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
at
org.apache.druid.segment.IndexMergerV9.mergeQueryableIndex(IndexMergerV9.java:915)
~[druid-processing-2023.11.0-iap.jar:2023.11.0-iap]
at
org.apache.druid.segment.realtime.appenderator.StreamAppenderator.mergeAndPush(StreamAppenderator.java:866)
~[druid-server-2023.11.0-iap.jar:2023.11.0-iap]
... 7 more
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]