Benno Evers created MESOS-9465:
----------------------------------
Summary: ProcessRemoteLinkTest.RemoteStaleLinkRelink is flaky again
Key: MESOS-9465
URL: https://issues.apache.org/jira/browse/MESOS-9465
Project: Mesos
Issue Type: Bug
Environment: Mac OSX with SSL enabled
Reporter: Benno Evers
The test failed with the following error in our internal CI:
{noformat}
[ RUN ] ProcessRemoteLinkTest.RemoteStaleLinkRelink
[warn] kq_init: detected broken kqueue; not using.: No such process
WARNING: Logging before InitGoogleLogging() is written to STDERR
I1210 10:34:07.134811 351110592 process.cpp:1239] libprocess is initialized on
10.0.49.4:58630 with 8 worker threads
I1210 10:34:07.137801 109821952 test_linkee.cpp:73] EXIT with status 0:
../../../3rdparty/libprocess/src/tests/process_tests.cpp:1176: Failure
Mock function called more times than expected - returning directly.
Function call: exited(@0x7f9ef7f0d888 (1)@10.0.49.4:58631)
Expected: to be called once
Actual: called twice - over-saturated and active
W1210 10:34:07.139040 95457280 process.cpp:838] Failed to recv on socket 8 to
peer 'unknown': Connection reset by peer
[ FAILED ] ProcessRemoteLinkTest.RemoteStaleLinkRelink (22 ms)
{noformat}
Interestingly, looking at some context from the same CI run, it looks like many
similar tests also had severe issues but still succeeded:
{noformat}
[ RUN ] ProcessRemoteLinkTest.RemoteDoubleLinkRelink
[warn] kq_init: detected broken kqueue; not using.: No such process
WARNING: Logging before InitGoogleLogging() is written to STDERR
I1210 10:34:06.945520 368641472 process.cpp:1239] libprocess is initialized on
10.0.49.4:58618 with 8 worker threads
W1210 10:34:06.948437 95457280 process.cpp:838] Failed to recv on socket 8 to
peer 'unknown': Connection reset by peer
W1210 10:34:06.948755 95457280 process.cpp:1423] Failed to recv on socket 11 to
peer 'unknown': Connection reset by peer
[ OK ] ProcessRemoteLinkTest.RemoteDoubleLinkRelink (21 ms)
[ RUN ] ProcessRemoteLinkTest.RemoteLinkLeak
[warn] kq_init: detected broken kqueue; not using.: No such process
WARNING: Logging before InitGoogleLogging() is written to STDERR
I1210 10:34:06.966291 379131328 process.cpp:1239] libprocess is initialized on
10.0.49.4:58623 with 8 worker threads
W1210 10:34:07.055934 300283328 process.cpp:1587] Failed to link to
'10.0.49.4:58624', create socket: Failed to create socket: Too many open files
W1210 10:34:07.096643 95457280 process.cpp:838] Failed to recv on socket 8 to
peer 'unknown': Connection reset by peer
[ OK ] ProcessRemoteLinkTest.RemoteLinkLeak (148 ms)
[ RUN ] ProcessRemoteLinkTest.RemoteUseStaleLink
[warn] kq_init: detected broken kqueue; not using.: No such process
WARNING: Logging before InitGoogleLogging() is written to STDERR
I1210 10:34:07.114372 219854272 process.cpp:1239] libprocess is initialized on
10.0.49.4:58626 with 8 worker threads
W1210 10:34:07.117367 95457280 process.cpp:838] Failed to recv on socket 8 to
peer 'unknown': Connection reset by peer
[ OK ] ProcessRemoteLinkTest.RemoteUseStaleLink (20 ms)
{noformat}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)