surahman opened a new issue #3733:
URL: https://github.com/apache/incubator-heron/issues/3733


   **Describe the bug**
   The `stmgr_unittest` is consistently causing failures with timeouts during 
the CI or when running tests with fewer execution threads available.
   
   **To Reproduce**
   Steps to reproduce the behaviour:
   1. Trigger the TravisCI pipeline.
   
    or
   
   1. Run in an environment with only 4 or fewer cores available (no H/T).
   2. Build in a Ubuntu Docker container using the provided build script.
   
   This will not occur if you have more cores and threads at your disposal to 
run the tests.
   
   **Expected behavior**
   Ideally: a passing `stmgr_unittest`.
   
   **Isolated Issue**
   The following three tests in the test suite are causing the problems. 
Disabling them alleviates the problem. It is my belief that this is the result 
of a deadlock or livelock situation but I am investigating further to confirm 
the exact source of the problem.
   
https://github.com/apache/incubator-heron/blob/c32c09a0f09caca74dbf5715aa0632f0a50356c9/heron/stmgr/tests/cpp/server/stmgr_unittest.cpp#L960-L1068
   
https://github.com/apache/incubator-heron/blob/c32c09a0f09caca74dbf5715aa0632f0a50356c9/heron/stmgr/tests/cpp/server/stmgr_unittest.cpp#L1071-L1205
   
https://github.com/apache/incubator-heron/blob/c32c09a0f09caca74dbf5715aa0632f0a50356c9/heron/stmgr/tests/cpp/server/stmgr_unittest.cpp#L1328-L1441
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to