kgyrtkirk commented on PR #17861: URL: https://github.com/apache/druid/pull/17861#issuecomment-2792465860
downloaded the heapdump - the issue was caused by huge exceptions 5M each - and surefire tried to hold all of them in a map....which have lead to an OOM the underlying exception is a diff related exception; but it contains the following exception(s): ``` > WHERE time_floor(__time, 'PT1H') BETWEEN timestamp '2019-08-25 00:00:00' AND timestamp '2019-08-25 06:00:00'": Remote driver error: QueryInterruptedException: Leaks happened, each suppressed exception represents one code path that checked out an object and didn't return it. -> RuntimeException: Leaks happened, each suppressed exception represents one code path that checked out an object and didn't return it. > at org.apache.calcite.avatica.Helper.createException(Helper.java:54) > at org.apache.calcite.avatica.Helper.createException(Helper.java:41) [...] > Suppressed: org.apache.druid.collections.StupidPool$LeakedException: Originally checked out by thread [Test-runner-processing-pool-msq-worker[00000000-0000-0000-0000-000000000000_2_0]] > at org.apache.druid.collections.StupidPool.take(StupidPool.java:162) > at org.apache.druid.segment.CompressedPools.getByteBuf(CompressedPools.java:110) > at org.apache.druid.segment.data.DecompressingByteBufferObjectStrategy.fromByteBuffer(DecompressingByteBufferObjectStrategy.java:70) > at org.apache.druid.segment.data.DecompressingByteBufferObjectStrategy.fromByteBuffer(DecompressingByteBufferObjectStrategy.java:30) > at org.apache.druid.segment.data.GenericIndexed$BufferIndexed.get(GenericIndexed.java:598) > at org.apache.druid.segment.data.BlockLayoutColumnarLongsSupplier$1.loadBuffer(BlockLayoutColumnarLongsSupplier.java:103) > at org.apache.druid.segment.data.BlockLayoutColumnarLongsSupplier$1.get(BlockLayoutColumnarLongsSupplier.java:90) > at org.apache.druid.segment.column.LongsColumn.getLongSingleValueRow(LongsColumn.java:77) > at org.apache.druid.segment.QueryableIndexCursorHolder.asCursor(QueryableIndexCursorHolder.java:185) > at org.apache.druid.msq.querykit.scan.ScanQueryFrameProcessor.runWithSegment(ScanQueryFrameProcessor.java:271) > at org.apache.druid.msq.querykit.BaseLeafFrameProcessor.runIncrementally(BaseLeafFrameProcessor.java:88) > at org.apache.druid.msq.querykit.scan.ScanQueryFrameProcessor.runIncrementally(ScanQueryFrameProcessor.java:157) > at org.apache.druid.msq.counters.CpuTimeAccumulatingFrameProcessor.runIncrementally(CpuTimeAccumulatingFrameProcessor.java:66) > at org.apache.druid.frame.processor.FrameProcessors$1FrameProcessorWithBaggage.runIncrementally(FrameProcessors.java:72) > at org.apache.druid.frame.processor.FrameProcessorExecutor$1ExecutorRunnable.runProcessorNow(FrameProcessorExecutor.java:239) > at org.apache.druid.frame.processor.FrameProcessorExecutor$1ExecutorRunnable.run(FrameProcessorExecutor.java:141) > at org.apache.druid.msq.exec.WorkerImpl$2$2.run(WorkerImpl.java:900) > ... 3 more [this repeats around a 40 times] ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
