[jira] [Commented] (STORM-1377) nimbus_auth_test: very short timeouts causing spurious failures

ASF GitHub Bot (JIRA) Wed, 16 Dec 2015 09:55:42 -0800

    [ 
https://issues.apache.org/jira/browse/STORM-1377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15060401#comment-15060401
 ]


ASF GitHub Bot commented on STORM-1377:
---------------------------------------

Github user d2r commented on the pull request:

    https://github.com/apache/storm/pull/941#issuecomment-165193668
  
    > It's an unlikely scenario and I might be paranoid, but what if the tests 
were running on a cluster under heavy traffic? For instance, what if the tests 
were to be run on OpenStack, and the test just happened to run when there is 
unrelated heavy traffic (say, Hadoop shuffle) pressuring the switches?
    
    This should be using the loopback device.  I think the main problem here is 
the CPU was too busy to have the thrift server reply within 30ms.  If traffic 
were the issue, we would see many more of our tests fail similarly.


> nimbus_auth_test: very short timeouts causing spurious failures
> ---------------------------------------------------------------
>
>                 Key: STORM-1377
>                 URL: https://issues.apache.org/jira/browse/STORM-1377
>             Project: Apache Storm
>          Issue Type: Bug
>          Components: storm-core
>    Affects Versions: 0.10.0, 0.11.0
>            Reporter: Derek Dagit
>            Assignee: Derek Dagit
>            Priority: Minor
>
> This is caused by a units mismatch.  We are waiting 30 ms for the thrift 
> server to reply when we thought we were waiting 30s.  This means that 
> sometimes when we expect NotAliveException, we instead get 
> TTransportException(SocketTimeoutException), and this fails the assertions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (STORM-1377) nimbus_auth_test: very short timeouts causing spurious failures

Reply via email to