[ https://issues.apache.org/jira/browse/STORM-3321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
ASF GitHub Bot updated STORM-3321: ---------------------------------- Labels: pull-request-available (was: ) > Tests are flaky due to long timeouts in Nimbus and supervisor when using > LocalCluster > ------------------------------------------------------------------------------------- > > Key: STORM-3321 > URL: https://issues.apache.org/jira/browse/STORM-3321 > Project: Apache Storm > Issue Type: Bug > Affects Versions: 2.0.0 > Reporter: Stig Rohde Døssing > Assignee: Stig Rohde Døssing > Priority: Major > Labels: pull-request-available > > Tests will sometimes fail with timeout when using e.g. > Testing.completeTopology. > The issue is that the timeout is 10 seconds, and Nimbus and the supervisor > both have timers that monitor for new deployments that are also set to 10 > seconds. This causes tests to time out because a lot of the test time is > wasted waiting for Nimbus/the supervisors to catch that the test topology is > deployed. > We should reduce these timeouts to their minimums. > There is also a race in Nimbus that can cause test failures > {quote} > 2019-01-21 02:00:19.587 [main] WARN org.apache.storm.daemon.nimbus.Nimbus - > Topology submission exception. (topology > name='topologytest-45f5ad59-ec16-45a4-ba4a-eea992411cc1') > java.lang.RuntimeException: not a leader, current leader is > NimbusInfo{host='DESKTOP-AGC8TKM', port=6627, isLeader=true} > at > org.apache.storm.daemon.nimbus.Nimbus.assertIsLeader(Nimbus.java:1525) > ~[classes/:?] > at > org.apache.storm.daemon.nimbus.Nimbus.submitTopologyWithOpts(Nimbus.java:2982) > ~[classes/:?] > at > org.apache.storm.daemon.nimbus.Nimbus.submitTopology(Nimbus.java:2965) > ~[classes/:?] > at org.apache.storm.LocalCluster.submitTopology(LocalCluster.java:444) > ~[classes/:?] > at org.apache.storm.LocalCluster.submitTopology(LocalCluster.java:125) > ~[classes/:?] > at org.apache.storm.Testing.completeTopology(Testing.java:424) > ~[classes/:?] > {quote} > The issue is that Nimbus has to acquire leadership in order to submit > topologies, but LocalCluster doesn't wait for the Nimbus instance it creates > to gain leadership. > We should make LocalCluster wait for Nimbus to gain leadership. -- This message was sent by Atlassian JIRA (v7.6.3#76005)