Till Toenshoff created MESOS-3975:
-------------------------------------

             Summary: SSL build of mesos causes flaky testsuite.
                 Key: MESOS-3975
                 URL: https://issues.apache.org/jira/browse/MESOS-3975
             Project: Mesos
          Issue Type: Bug
    Affects Versions: 0.26.0
         Environment: CentOS 7.1, Kernel 3.10.0-229.20.1.el7.x86_64, gcc 4.8.3, 
Docker 1.9
            Reporter: Till Toenshoff
            Assignee: Joris Van Remoortere


When running the tests of an SSL build of Mesos on CentOS 7.1, I see spurious 
test failures that are, so far, not reproducible.

The following tests did fail for me in complete runs but did seem fine when 
running them individually, in repetition.  

{noformat}
DockerTest.ROOT_DOCKER_CheckPortResource
{noformat}

{noformat}
ContainerizerTest.ROOT_CGROUPS_BalloonFramework
{noformat}

{noformat}
[ RUN      ] 
LinuxFilesystemIsolatorTest.ROOT_ChangeRootFilesystemCommandExecutor
2015-11-20 
19:08:38,826:21380(0x7fa10d5f2700):ZOO_ERROR@handle_socket_error_msg@1697: 
Socket [127.0.0.1:53444] zk retcode=-4, errno=111(Connection refused): server 
refused to accept the client
+ /home/vagrant/mesos/build/src/mesos-containerizer mount --help=false 
--operation=make-rslave --path=/
+ grep -E 
/tmp/LinuxFilesystemIsolatorTest_ROOT_ChangeRootFilesystemCommandExecutor_Tz7P8c/.+
 /proc/self/mountinfo
+ grep -v 2b98025c-74f1-41d2-b35a-ce2cdfae347e
+ cut '-d ' -f5
+ xargs --no-run-if-empty umount -l
+ mount -n --rbind 
/tmp/LinuxFilesystemIsolatorTest_ROOT_ChangeRootFilesystemCommandExecutor_Tz7P8c/provisioner/containers/2b98025c-74f1-41d2-b35a-ce2cdfae347e/backends/copy/rootfses/bed11080-474b-4c69-8e7f-0ab85e895b0d
 
/tmp/LinuxFilesystemIsolatorTest_ROOT_ChangeRootFilesystemCommandExecutor_Tz7P8c/slaves/830e842e-c36a-4e4c-bff4-5b9568d7df12-S0/frameworks/830e842e-c36a-4e4c-bff4-5b9568d7df12-0000/executors/c735be54-c47f-4645-bfc1-2f4647e2cddb/runs/2b98025c-74f1-41d2-b35a-ce2cdfae347e/.rootfs
Could not load cert file
../../src/tests/containerizer/filesystem_isolator_tests.cpp:354: Failure
Value of: statusRunning.get().state()
  Actual: TASK_FAILED
Expected: TASK_RUNNING
2015-11-20 
19:08:42,164:21380(0x7fa10d5f2700):ZOO_ERROR@handle_socket_error_msg@1697: 
Socket [127.0.0.1:53444] zk retcode=-4, errno=111(Connection refused): server 
refused to accept the client
2015-11-20 
19:08:45,501:21380(0x7fa10d5f2700):ZOO_ERROR@handle_socket_error_msg@1697: 
Socket [127.0.0.1:53444] zk retcode=-4, errno=111(Connection refused): server 
refused to accept the client
2015-11-20 
19:08:48,837:21380(0x7fa10d5f2700):ZOO_ERROR@handle_socket_error_msg@1697: 
Socket [127.0.0.1:53444] zk retcode=-4, errno=111(Connection refused): server 
refused to accept the client
2015-11-20 
19:08:52,174:21380(0x7fa10d5f2700):ZOO_ERROR@handle_socket_error_msg@1697: 
Socket [127.0.0.1:53444] zk retcode=-4, errno=111(Connection refused): server 
refused to accept the client
../../src/tests/containerizer/filesystem_isolator_tests.cpp:355: Failure
Failed to wait 15secs for statusFinished
../../src/tests/containerizer/filesystem_isolator_tests.cpp:349: Failure
Actual function call count doesn't match EXPECT_CALL(sched, 
statusUpdate(&driver, _))...
         Expected: to be called twice
           Actual: called once - unsatisfied and active
2015-11-20 
19:08:55,511:21380(0x7fa10d5f2700):ZOO_ERROR@handle_socket_error_msg@1697: 
Socket [127.0.0.1:53444] zk retcode=-4, errno=111(Connection refused): server 
refused to accept the client
*** Aborted at 1448046536 (unix time) try "date -d @1448046536" if you are 
using GNU date ***
PC: @                0x0 (unknown)
*** SIGSEGV (@0x0) received by PID 21380 (TID 0x7fa1549e68c0) from PID 0; stack 
trace: ***
    @     0x7fa141796fbb (unknown)
    @     0x7fa14179b341 (unknown)
    @     0x7fa14f096130 (unknown)
{noformat}


Vagrantfile generator:
{noformat}
cat << EOF > Vagrantfile
# -*- mode: ruby -*-" >
# vi: set ft=ruby :
Vagrant.configure(2) do |config|
  # Disable shared folder to prevent certain kernel module dependencies.
  config.vm.synced_folder ".", "/vagrant", disabled: true

  config.vm.hostname = "centos71"

  config.vm.box = "bento/centos-7.1"

  config.vm.provider "virtualbox" do |vb|
    vb.memory = 16384
    vb.cpus = 8
  end

  config.vm.provider "vmware_fusion" do |vb|
    vb.memory = 9216
    vb.cpus = 4
  end

  config.vm.provision "shell", inline: <<-SHELL

     sudo yum -y update systemd

     sudo yum install -y tar wget
     sudo wget 
http://repos.fedorapeople.org/repos/dchen/apache-maven/epel-apache-maven.repo 
-O /etc/yum.repos.d/epel-apache-maven.repo

     sudo yum groupinstall -y "Development Tools"
     sudo yum install -y apache-maven python-devel java-1.7.0-openjdk-devel 
zlib-devel libcurl-devel openssl-devel cyrus-sasl-devel cyrus-sasl-md5 
apr-devel subversion-devel apr-util-devel

     sudo yum install libevent-devel

     sudo yum install -y git

     sudo yum install -y docker
     sudo service docker start
     sudo docker info

     #sudo wget -qO- https://get.docker.com/ | sh

  SHELL
end
EOF

vagrant up
vagrant reload

vagrant ssh -c "
git clone  https://github.com/apache/mesos.git mesos
cd mesos
git checkout -b 0.26.0-rc1 0.26.0-rc1

./bootstrap
mkdir build
cd build

../configure --enable-libevent --enable-ssl
GTEST_FILTER="" make check
sudo ./bin/mesos-tests.sh
"
{noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to