James Peach created MESOS-9212: ---------------------------------- Summary: Subprocess tests fail with libev 4.24 Key: MESOS-9212 URL: https://issues.apache.org/jira/browse/MESOS-9212 Project: Mesos Issue Type: Bug Reporter: James Peach
On Fedora 28, building against the system version of libev (version 4.24) causes the following tests to fail: The following tests fail: {noformat} [ FAILED ] ReapTest.NonChildProcess [ FAILED ] ReapTest.ChildProcess [ FAILED ] ReapTest.TerminatedChildProcess [ FAILED ] SubprocessTest.PipeOutputToFileDescriptor [ FAILED ] SubprocessTest.PipeOutputToPath [ FAILED ] SubprocessTest.EnvironmentEcho [ FAILED ] SubprocessTest.Status [ FAILED ] SubprocessTest.PipeOutput [ FAILED ] SubprocessTest.PipeLargeOutput [ FAILED ] SubprocessTest.PipeInput [ FAILED ] SubprocessTest.PipeRedirect [ FAILED ] SubprocessTest.PathOutput [ FAILED ] SubprocessTest.PathInput [ FAILED ] SubprocessTest.FdOutput [ FAILED ] SubprocessTest.FdInput [ FAILED ] SubprocessTest.Default [ FAILED ] SubprocessTest.Flags [ FAILED ] SubprocessTest.Environment [ FAILED ] SubprocessTest.EnvironmentWithSpaces [ FAILED ] SubprocessTest.EnvironmentWithSpacesAndQuotes [ FAILED ] SubprocessTest.EnvironmentOverride {noformat} This build configuration succeeds: {noformat} $ ../configure --disable-java --disable-python --enable-silent-rules --disable-hardening --disable-werror --disable-libtool-wrappers --enable-xfs-disk-isolator --enable-install-module-dependencies --enable-port-mapping-isolator --enable-network-ports-isolator --with-protobuf=/usr --with-curl=/usr --with-libarchive=/usr --with-zookeeper=/usr --prefix=/opt/mesos "CXXFLAGS=-O0 -ggdb3 -fno-omit-frame-pointer -fvisibility-inlines-hidden -Wno-unused-local-typedefs -Wno-deprecated" "CFLAGS=-O0 -ggdb3 -fno-omit-frame-pointer -Wno-unused-local-typedefs -Wno-deprecated" LDFLAGS= CXX=/home/jpeach/src/asf-mesos/build/c++ CC=/home/jpeach/src/asf-mesos/build/cc LD=/home/jpeach/src/asf-mesos/build/ld {noformat} This build configuration fails: {noformat} $ ../configure --disable-java --disable-python --enable-silent-rules --disable-hardening --disable-werror --disable-libtool-wrappers --enable-xfs-disk-isolator --enable-install-module-dependencies --enable-port-mapping-isolator --enable-network-ports-isolator --with-protobuf=/usr --with-curl=/usr --with-libarchive=/usr --with-zookeeper=/usr --prefix=/opt/mesos "CXXFLAGS=-O0 -ggdb3 -fno-omit-frame-pointer -fvisibility-inlines-hidden -Wno-unused-local-typedefs -Wno-deprecated" "CFLAGS=-O0 -ggdb3 -fno-omit-frame-pointer -Wno-unused-local-typedefs -Wno-deprecated" LDFLAGS= CXX=/home/jpeach/src/asf-mesos/build/c++ CC=/home/jpeach/src/asf-mesos/build/cc LD=/home/jpeach/src/asf-mesos/build/ld --with-libev=/usr {noformat} I think what happens here is that the child process gets reaped wrongly somehow: {noformat} [==========] Running 1 test from 1 test case. [----------] Global test environment set-up. [----------] 1 test from SubprocessTest [ RUN ] SubprocessTest.EnvironmentWithSpaces [pid 25909] clone(child_stack=NULL, flags=CLONE_CHILD_CLEARTID|CLONE_CHILD_SETTID|SIGCHLD, child_tidptr=0x7fa11881fcd0) = 25923 strace: Process 25923 attached [pid 25923] execve("/usr/bin/sh", ["sh", "-c", "echo $MESSAGE"], 0x1ff3950 /* 1 var */) = 0 [pid 25923] arch_prctl(ARCH_SET_FS, 0x7f24561c5740) = 0 [pid 25923] exit_group(0) = ? [pid 25923] +++ exited with 0 +++ [pid 25909] --- SIGCHLD {si_signo=SIGCHLD, si_code=CLD_EXITED, si_pid=25923, si_uid=9306, si_status=0, si_utime=0, si_stime=0} --- [pid 25922] wait4(-1, [{WIFEXITED(s) && WEXITSTATUS(s) == 0}], WNOHANG|WSTOPPED|WCONTINUED, NULL) = 25923 [pid 25922] wait4(-1, 0x7fa10a74da44, WNOHANG|WSTOPPED|WCONTINUED, NULL) = -1 ECHILD (No child processes) [pid 25919] wait4(25923, 0x7fa10bf50548, WNOHANG, NULL) = -1 ECHILD (No child processes) ../../../3rdparty/libprocess/src/tests/subprocess_tests.cpp:977: Failure (s->status()).get() is NONE [ FAILED ] SubprocessTest.EnvironmentWithSpaces (12 ms) [----------] 1 test from SubprocessTest (12 ms total) [----------] Global test environment tear-down [==========] 1 test from 1 test case ran. (12 ms total) [ PASSED ] 0 tests. [ FAILED ] 1 test, listed below: [ FAILED ] SubprocessTest.EnvironmentWithSpaces {noformat} -- This message was sent by Atlassian JIRA (v7.6.3#76005)