huijunw opened a new issue #2927: [dhalion] null exception for previousCheckpoint URL: https://github.com/apache/incubator-heron/issues/2927 We saw the below exception in the HealthMgr log. The exception was from Dhalion library. The exception caused the HealthMgr process to quit. ``` [2018-06-20 20:44:11 +0000] [STDERR] stderr: Exception in thread "main" [2018-06-20 20:44:11 +0000] [STDERR] stderr: java.util.concurrent.ExecutionException: java.lang.NullPointerException [2018-06-20 20:44:11 +0000] [STDERR] stderr: at java.util.concurrent.FutureTask.report(FutureTask.java:122) [2018-06-20 20:44:11 +0000] [STDERR] stderr: at java.util.concurrent.FutureTask.get(FutureTask.java:192) [2018-06-20 20:44:11 +0000] [STDERR] stderr: at org.apache.heron.healthmgr.HealthManager.main(HealthManager.java:241) [2018-06-20 20:44:11 +0000] [STDERR] stderr: Caused by: java.lang.NullPointerException [2018-06-20 20:44:11 +0000] [STDERR] stderr: at java.time.Instant.compareTo(Instant.java:1255) [2018-06-20 20:44:11 +0000] [STDERR] stderr: at java.time.Instant.isBefore(Instant.java:1285) [2018-06-20 20:44:11 +0000] [STDERR] stderr: at com.microsoft.dhalion.policy.PoliciesExecutor.lambda$null$0(PoliciesExecutor.java:83) [2018-06-20 20:44:11 +0000] [STDERR] stderr: at java.util.stream.ReferencePipeline$2$1.accept(ReferencePipeline.java:174) [2018-06-20 20:44:11 +0000] [STDERR] stderr: at java.util.ArrayList$ArrayListSpliterator.forEachRemaining(ArrayList.java:1382) [2018-06-20 20:44:11 +0000] [STDERR] stderr: at java.util.stream.AbstractPipeline.copyInto(AbstractPipeline.java:481) [2018-06-20 20:44:11 +0000] [STDERR] stderr: at java.util.stream.AbstractPipeline.wrapAndCopyInto(AbstractPipeline.java:471) [2018-06-20 20:44:11 +0000] [STDERR] stderr: at java.util.stream.ForEachOps$ForEachOp.evaluateSequential(ForEachOps.java:151) [2018-06-20 20:44:11 +0000] [STDERR] stderr: at java.util.stream.ForEachOps$ForEachOp$OfRef.evaluateSequential(ForEachOps.java:174) [2018-06-20 20:44:11 +0000] [STDERR] stderr: at java.util.stream.AbstractPipeline.evaluate(AbstractPipeline.java:234) [2018-06-20 20:44:11 +0000] [STDERR] stderr: at java.util.stream.ReferencePipeline.forEach(ReferencePipeline.java:418) [2018-06-20 20:44:11 +0000] [STDERR] stderr: at com.microsoft.dhalion.policy.PoliciesExecutor.lambda$start$2(PoliciesExecutor.java:84) [2018-06-20 20:44:11 +0000] [STDERR] stderr: at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) [2018-06-20 20:44:11 +0000] [STDERR] stderr: at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308) [2018-06-20 20:44:11 +0000] [STDERR] stderr: at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:180) [2018-06-20 20:44:11 +0000] [STDERR] stderr: at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:294) [2018-06-20 20:44:11 +0000] [STDERR] stderr: at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) [2018-06-20 20:44:11 +0000] [STDERR] stderr: at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) [2018-06-20 20:44:11 +0000] [STDERR] stderr: at java.lang.Thread.run(Thread.java:748) ``` I doubt the Null was from https://github.com/Microsoft/Dhalion/blob/0.2.1/src/main/java/com/microsoft/dhalion/policy/PoliciesExecutor.java#L74? Btw, we have several jobs and saw this issue in only one job.
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services
