JiajianQtl opened a new issue #11310:
URL: https://github.com/apache/druid/issues/11310
### Affected Version
0.20 and 0.21
### Description
org.apache.druid.server.coordinator.duty.CompactSegments will not run if
org.apache.druid.server.coordinator.KillStalePendingSegments encounter any
exception.
DruidCoordinator.java
`for (CoordinatorDuty duty : duties) {
// Don't read state and run state in the same duty otherwise racy
conditions may exist
if (!coordinationPaused
&& coordLeaderSelector.isLeader()
&& startingLeaderCounter == coordLeaderSelector.localTerm()) {
final long start = System.nanoTime();
params = duty.run(params); // =====================>
exceptions may interrupt the following duty
final long end = System.nanoTime();
if (params == null) {
// This duty wanted to cancel the run. No log message, since
the duty should have logged a reason.
return;
} else {
params.getCoordinatorStats().addToDutyStat("runtime",
duty.getClass().getName(), TimeUnit.NANOSECONDS.toMillis(end - start));
}
}`
Error messages:
`2021-05-27 04:45:58,633 ERROR
org.apache.druid.server.coordinator.DruidCoordinator: Caught exception,
ignoring so that schedule keeps going.:
{class=org.apache.druid.server.coordinator.DruidCoordinator,
exceptionType=class java.lang.RuntimeException,
exceptionMessage=org.apache.druid.java.util.common.ISE: Error while killing
pendingSegments of dataSource[foo] created until [2021-05-25T13:04:17.774Z]}
java.lang.RuntimeException: org.apache.druid.java.util.common.ISE: Error
while killing pendingSegments of dataSource[foo] created until
[2021-05-25T13:04:17.774Z]
at
org.apache.druid.client.indexing.HttpIndexingServiceClient.killPendingSegments(HttpIndexingServiceClient.java:389)
~[druid-server-0.21.0.jar:0.21.0]
at
org.apache.druid.server.coordinator.KillStalePendingSegments.run(KillStalePendingSegments.java:79)
~[druid-server-0.21.0.jar:0.21.0]
at
org.apache.druid.server.coordinator.DruidCoordinator$DutiesRunnable.run(DruidCoordinator.java:815)
~[druid-server-0.21.0.jar:0.21.0]
at
org.apache.druid.server.coordinator.DruidCoordinator$2.call(DruidCoordinator.java:638)
~[druid-server-0.21.0.jar:0.21.0]
at
org.apache.druid.server.coordinator.DruidCoordinator$2.call(DruidCoordinator.java:631)
~[druid-server-0.21.0.jar:0.21.0]
at
org.apache.druid.java.util.common.concurrent.ScheduledExecutors$4.run(ScheduledExecutors.java:163)
~[druid-core-0.21.0.jar:0.21.0]
at
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
~[?:1.8.0_191]
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
~[?:1.8.0_191]
at
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180)
~[?:1.8.0_191]
at
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
~[?:1.8.0_191]
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
[?:1.8.0_191]
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
[?:1.8.0_191]
at java.lang.Thread.run(Thread.java:748) [?:1.8.0_191]
Caused by: org.apache.druid.java.util.common.ISE: Error while killing
pendingSegments of dataSource[foo] created until [2021-05-25T13:04:17.774Z]
at
org.apache.druid.client.indexing.HttpIndexingServiceClient.killPendingSegments(HttpIndexingServiceClient.java:377)
~[druid-server-0.21.0.jar:0.21.0]
... 12 more
`
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]