Alexander Rojas created MESOS-4044:
--------------------------------------
Summary: SlaveRecoveryTest/0.Reboot is flaky
Key: MESOS-4044
URL: https://issues.apache.org/jira/browse/MESOS-4044
Project: Mesos
Issue Type: Bug
Components: slave
Environment: Debian 8 on VirtualBox
{{configure --enable-debug --enable-ssl --enable-libevent}}
Reporter: Alexander Rojas
Running the test program as:
{code}
sudo src/mesos-tests --gtest_filter="SlaveRecoveryTest/0.Reboot"
--gtest_repeat=100 --verbose --gtest_break_on_failure
{code}
ends up every time at some point with the failure:
{noformat}
[ RUN ] SlaveRecoveryTest/0.Reboot
I1202 15:18:00.036594 26328 leveldb.cpp:176] Opened db in 12.924775ms
I1202 15:18:00.037643 26328 leveldb.cpp:183] Compacted db in 980477ns
I1202 15:18:00.037693 26328 leveldb.cpp:198] Created db iterator in 15079ns
I1202 15:18:00.037706 26328 leveldb.cpp:204] Seeked to beginning of db in 1356ns
I1202 15:18:00.037716 26328 leveldb.cpp:273] Iterated through 0 keys in the db
in 313ns
I1202 15:18:00.037753 26328 replica.cpp:780] Replica recovered with log
positions 0 -> 0 with 1 holes and 0 unlearned
I1202 15:18:00.038360 26346 recover.cpp:449] Starting replica recovery
I1202 15:18:00.040987 26346 master.cpp:367] Master
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8 (debian-vm.localdomain) started on
127.0.1.1:33625
I1202 15:18:00.040998 26346 master.cpp:369] Flags at startup: --acls=""
--allocation_interval="1secs" --allocator="HierarchicalDRF"
--authenticate="true" --authenticate_slaves="true" --authenticators="crammd5"
--authorizers="local" --credentials="/tmp/xt1N2F/credentials"
--framework_sorter="drf" --help="false" --hostname_lookup="true"
--initialize_driver_logging="true" --log_auto_initialize="true"
--logbufsecs="0" --logging_level="INFO" --max_slave_ping_timeouts="5"
--quiet="false" --recovery_slave_removal_limit="100%"
--registry="replicated_log" --registry_fetch_timeout="1mins"
--registry_store_timeout="25secs" --registry_strict="true"
--root_submissions="true" --slave_ping_timeout="15secs"
--slave_reregister_timeout="10mins" --user_sorter="drf" --version="false"
--webui_dir="/usr/local/share/mesos/webui" --work_dir="/tmp/xt1N2F/master"
--zk_session_timeout="10secs"
I1202 15:18:00.041157 26346 master.cpp:414] Master only allowing authenticated
frameworks to register
I1202 15:18:00.041163 26346 master.cpp:419] Master only allowing authenticated
slaves to register
I1202 15:18:00.041168 26346 credentials.hpp:37] Loading credentials for
authentication from '/tmp/xt1N2F/credentials'
I1202 15:18:00.041410 26346 master.cpp:458] Using default 'crammd5'
authenticator
I1202 15:18:00.041524 26346 master.cpp:495] Authorization enabled
I1202 15:18:00.042917 26343 recover.cpp:475] Replica is in EMPTY status
I1202 15:18:00.043557 26343 master.cpp:1606] The newly elected leader is
[email protected]:33625 with id baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8
I1202 15:18:00.043577 26343 master.cpp:1619] Elected as the leading master!
I1202 15:18:00.043589 26343 master.cpp:1379] Recovering from registrar
I1202 15:18:00.043766 26343 registrar.cpp:309] Recovering registrar
I1202 15:18:00.044668 26344 replica.cpp:676] Replica in EMPTY status received a
broadcasted recover request from (21064)@127.0.1.1:33625
I1202 15:18:00.045027 26349 recover.cpp:195] Received a recover response from a
replica in EMPTY status
I1202 15:18:00.045497 26349 recover.cpp:566] Updating replica status to STARTING
I1202 15:18:00.055539 26349 leveldb.cpp:306] Persisting metadata (8 bytes) to
leveldb took 9.859161ms
I1202 15:18:00.055599 26349 replica.cpp:323] Persisted replica status to
STARTING
I1202 15:18:00.055958 26346 recover.cpp:475] Replica is in STARTING status
I1202 15:18:00.057106 26342 replica.cpp:676] Replica in STARTING status
received a broadcasted recover request from (21065)@127.0.1.1:33625
I1202 15:18:00.057462 26343 recover.cpp:195] Received a recover response from a
replica in STARTING status
I1202 15:18:00.057886 26347 recover.cpp:566] Updating replica status to VOTING
I1202 15:18:00.058706 26345 leveldb.cpp:306] Persisting metadata (8 bytes) to
leveldb took 634303ns
I1202 15:18:00.058724 26345 replica.cpp:323] Persisted replica status to VOTING
I1202 15:18:00.058821 26345 recover.cpp:580] Successfully joined the Paxos group
I1202 15:18:00.058980 26345 recover.cpp:464] Recover process terminated
I1202 15:18:00.059288 26348 log.cpp:661] Attempting to start the writer
I1202 15:18:00.060330 26342 replica.cpp:496] Replica received implicit promise
request from (21066)@127.0.1.1:33625 with proposal 1
I1202 15:18:00.061751 26342 leveldb.cpp:306] Persisting metadata (8 bytes) to
leveldb took 1.395961ms
I1202 15:18:00.061774 26342 replica.cpp:345] Persisted promised to 1
I1202 15:18:00.062237 26342 coordinator.cpp:240] Coordinator attempting to fill
missing positions
I1202 15:18:00.063148 26342 replica.cpp:391] Replica received explicit promise
request from (21067)@127.0.1.1:33625 for position 0 with proposal 2
I1202 15:18:00.064757 26342 leveldb.cpp:343] Persisting action (8 bytes) to
leveldb took 1.581382ms
I1202 15:18:00.064785 26342 replica.cpp:715] Persisted action at 0
I1202 15:18:00.065717 26342 replica.cpp:540] Replica received write request for
position 0 from (21068)@127.0.1.1:33625
I1202 15:18:00.065758 26342 leveldb.cpp:438] Reading position from leveldb took
21294ns
I1202 15:18:00.066664 26342 leveldb.cpp:343] Persisting action (14 bytes) to
leveldb took 875354ns
I1202 15:18:00.066699 26342 replica.cpp:715] Persisted action at 0
I1202 15:18:00.067416 26349 replica.cpp:694] Replica received learned notice
for position 0 from @0.0.0.0:0
I1202 15:18:00.068152 26349 leveldb.cpp:343] Persisting action (16 bytes) to
leveldb took 682342ns
I1202 15:18:00.068188 26349 replica.cpp:715] Persisted action at 0
I1202 15:18:00.068208 26349 replica.cpp:700] Replica learned NOP action at
position 0
I1202 15:18:00.068622 26345 log.cpp:677] Writer started with ending position 0
I1202 15:18:00.069576 26345 leveldb.cpp:438] Reading position from leveldb took
79910ns
I1202 15:18:00.070322 26349 registrar.cpp:342] Successfully fetched the
registry (0B) in 26528us
I1202 15:18:00.070417 26349 registrar.cpp:441] Applied 1 operations in 27033ns;
attempting to update the 'registry'
I1202 15:18:00.071035 26349 log.cpp:685] Attempting to append 187 bytes to the
log
I1202 15:18:00.071144 26347 coordinator.cpp:350] Coordinator attempting to
write APPEND action at position 1
I1202 15:18:00.071885 26347 replica.cpp:540] Replica received write request for
position 1 from (21069)@127.0.1.1:33625
I1202 15:18:00.072844 26347 leveldb.cpp:343] Persisting action (206 bytes) to
leveldb took 929311ns
I1202 15:18:00.072862 26347 replica.cpp:715] Persisted action at 1
I1202 15:18:00.073323 26344 replica.cpp:694] Replica received learned notice
for position 1 from @0.0.0.0:0
I1202 15:18:00.073979 26344 leveldb.cpp:343] Persisting action (208 bytes) to
leveldb took 637468ns
I1202 15:18:00.073995 26344 replica.cpp:715] Persisted action at 1
I1202 15:18:00.074007 26344 replica.cpp:700] Replica learned APPEND action at
position 1
I1202 15:18:00.075078 26344 registrar.cpp:486] Successfully updated the
'registry' in 4.587008ms
I1202 15:18:00.075166 26344 registrar.cpp:372] Successfully recovered registrar
I1202 15:18:00.075309 26344 log.cpp:704] Attempting to truncate the log to 1
I1202 15:18:00.075595 26344 master.cpp:1416] Recovered 0 slaves from the
Registry (148B) ; allowing 10mins for slaves to re-register
I1202 15:18:00.075649 26344 coordinator.cpp:350] Coordinator attempting to
write TRUNCATE action at position 2
I1202 15:18:00.076445 26344 replica.cpp:540] Replica received write request for
position 2 from (21070)@127.0.1.1:33625
I1202 15:18:00.077129 26344 leveldb.cpp:343] Persisting action (16 bytes) to
leveldb took 660682ns
I1202 15:18:00.077177 26344 replica.cpp:715] Persisted action at 2
I1202 15:18:00.077822 26344 replica.cpp:694] Replica received learned notice
for position 2 from @0.0.0.0:0
I1202 15:18:00.078547 26344 leveldb.cpp:343] Persisting action (18 bytes) to
leveldb took 527711ns
I1202 15:18:00.078614 26344 leveldb.cpp:401] Deleting ~1 keys from leveldb took
21673ns
I1202 15:18:00.078631 26344 replica.cpp:715] Persisted action at 2
I1202 15:18:00.078650 26344 replica.cpp:700] Replica learned TRUNCATE action at
position 2
I1202 15:18:00.087874 26328 containerizer.cpp:142] Using isolation:
cgroups/cpu,cgroups/mem,filesystem/posix
I1202 15:18:00.891749 26328 linux_launcher.cpp:103] Using
/sys/fs/cgroup/freezer as the freezer hierarchy for the Linux launcher
I1202 15:18:00.897735 26328 systemd.cpp:210] Started systemd slice
`mesos_executors.slice`
I1202 15:18:00.917435 26343 slave.cpp:191] Slave started on 655)@127.0.1.1:33625
I1202 15:18:00.917466 26343 slave.cpp:192] Flags at startup:
--appc_store_dir="/tmp/mesos/store/appc" --authenticatee="crammd5"
--cgroups_cpu_enable_pids_and_tids_count="false" --cgroups_enable_cfs="false"
--cgroups_hierarchy="/sys/fs/cgroup" --cgroups_limit_swap="false"
--cgroups_root="mesos_test_d5b6bb71-dab3-4457-ae03-84a046ed9c62"
--container_disk_watch_interval="15secs" --containerizers="mesos"
--credential="/tmp/SlaveRecoveryTest_0_Reboot_qT6DBx/credential"
--default_role="*" --disk_watch_interval="1mins" --docker="docker"
--docker_auth_server="auth.docker.io" --docker_auth_server_port="443"
--docker_kill_orphans="true"
--docker_local_archives_dir="/tmp/mesos/images/docker" --docker_puller="local"
--docker_puller_timeout="60" --docker_registry="registry-1.docker.io"
--docker_registry_port="443" --docker_remove_delay="6hrs"
--docker_socket="/var/run/docker.sock" --docker_stop_timeout="0ns"
--docker_store_dir="/tmp/mesos/store/docker"
--enforce_container_disk_quota="false" --executor_registration_timeout="1mins"
--executor_shutdown_grace_period="5secs"
--fetcher_cache_dir="/tmp/SlaveRecoveryTest_0_Reboot_qT6DBx/fetch"
--fetcher_cache_size="2GB" --frameworks_home="" --gc_delay="1weeks"
--gc_disk_headroom="0.1" --hadoop_home="" --help="false"
--hostname_lookup="true" --image_provisioner_backend="copy"
--initialize_driver_logging="true" --isolation="cgroups/cpu,cgroups/mem"
--launcher_dir="/home/alexander/Documents/workspace/mesos/build/src"
--logbufsecs="0" --logging_level="INFO"
--oversubscribed_resources_interval="15secs" --perf_duration="10secs"
--perf_interval="1mins" --qos_correction_interval_min="0ns" --quiet="false"
--recover="reconnect" --recovery_timeout="15mins"
--registration_backoff_factor="10ms"
--resources="cpus:2;mem:1024;disk:1024;ports:[31000-32000]"
--revocable_cpu_low_priority="true" --sandbox_directory="/mnt/mesos/sandbox"
--slave_subsystems="memory,cpuacct" --strict="false" --switch_user="true"
--systemd_runtime_directory="/run/systemd/system" --version="false"
--work_dir="/tmp/SlaveRecoveryTest_0_Reboot_qT6DBx"
I1202 15:18:00.917687 26343 slave.cpp:212] Moving slave process into its own
cgroup for subsystem: memory
I1202 15:18:00.919559 26328 sched.cpp:166] Version: 0.26.0
I1202 15:18:00.921807 26344 sched.cpp:264] New master detected at
[email protected]:33625
I1202 15:18:00.921869 26344 sched.cpp:320] Authenticating with master
[email protected]:33625
I1202 15:18:00.921880 26344 sched.cpp:327] Using default CRAM-MD5 authenticatee
I1202 15:18:00.922087 26344 authenticatee.cpp:123] Creating new client SASL
connection
I1202 15:18:00.922412 26348 master.cpp:5150] Authenticating
[email protected]:33625
I1202 15:18:00.922798 26348 authenticator.cpp:100] Creating new server SASL
connection
I1202 15:18:00.922977 26344 authenticatee.cpp:214] Received SASL authentication
mechanisms: CRAM-MD5
I1202 15:18:00.922999 26344 authenticatee.cpp:240] Attempting to authenticate
with mechanism 'CRAM-MD5'
I1202 15:18:00.923074 26344 authenticator.cpp:205] Received SASL authentication
start
I1202 15:18:00.923105 26344 authenticator.cpp:327] Authentication requires more
steps
I1202 15:18:00.923151 26344 authenticatee.cpp:260] Received SASL authentication
step
I1202 15:18:00.923216 26344 authenticator.cpp:233] Received SASL authentication
step
I1202 15:18:00.923282 26344 authenticator.cpp:319] Authentication success
I1202 15:18:00.923379 26344 authenticatee.cpp:300] Authentication success
I1202 15:18:00.923432 26344 master.cpp:5180] Successfully authenticated
principal 'test-principal' at
[email protected]:33625
I1202 15:18:00.923672 26344 sched.cpp:409] Successfully authenticated with
master [email protected]:33625
I1202 15:18:00.923964 26349 master.cpp:2176] Received SUBSCRIBE call for
framework 'default' at
[email protected]:33625
I1202 15:18:00.924010 26349 master.cpp:1645] Authorizing framework principal
'test-principal' to receive offers for role '*'
I1202 15:18:00.924242 26349 master.cpp:2247] Subscribing framework default with
checkpointing enabled and capabilities [ ]
I1202 15:18:00.924561 26344 hierarchical.cpp:195] Added framework
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-0000
I1202 15:18:00.924584 26349 sched.cpp:643] Framework registered with
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-0000
I1202 15:18:01.209614 26343 slave.cpp:212] Moving slave process into its own
cgroup for subsystem: cpuacct
I1202 15:18:01.409137 26343 credentials.hpp:85] Loading credential for
authentication from '/tmp/SlaveRecoveryTest_0_Reboot_qT6DBx/credential'
I1202 15:18:01.409307 26343 slave.cpp:322] Slave using credential for:
test-principal
I1202 15:18:01.409860 26343 slave.cpp:392] Slave resources: cpus(*):2;
mem(*):1024; disk(*):1024; ports(*):[31000-32000]
I1202 15:18:01.409906 26343 slave.cpp:400] Slave attributes: [ ]
I1202 15:18:01.409927 26343 slave.cpp:405] Slave hostname: debian-vm.localdomain
I1202 15:18:01.409932 26343 slave.cpp:410] Slave checkpoint: true
I1202 15:18:01.410773 26346 state.cpp:54] Recovering state from
'/tmp/SlaveRecoveryTest_0_Reboot_qT6DBx/meta'
I1202 15:18:01.411038 26346 status_update_manager.cpp:202] Recovering status
update manager
I1202 15:18:01.411185 26346 containerizer.cpp:384] Recovering containerizer
I1202 15:18:01.473423 26349 slave.cpp:4230] Finished recovery
I1202 15:18:01.474261 26344 slave.cpp:729] New master detected at
[email protected]:33625
I1202 15:18:01.474325 26342 status_update_manager.cpp:176] Pausing sending
status updates
I1202 15:18:01.474383 26344 slave.cpp:792] Authenticating with master
[email protected]:33625
I1202 15:18:01.474417 26344 slave.cpp:797] Using default CRAM-MD5 authenticatee
I1202 15:18:01.474566 26344 slave.cpp:765] Detecting new master
I1202 15:18:01.474706 26345 authenticatee.cpp:123] Creating new client SASL
connection
I1202 15:18:01.475159 26345 master.cpp:5150] Authenticating
slave(655)@127.0.1.1:33625
I1202 15:18:01.475553 26345 authenticator.cpp:100] Creating new server SASL
connection
I1202 15:18:01.475754 26342 authenticatee.cpp:214] Received SASL authentication
mechanisms: CRAM-MD5
I1202 15:18:01.475793 26342 authenticatee.cpp:240] Attempting to authenticate
with mechanism 'CRAM-MD5'
I1202 15:18:01.475867 26342 authenticator.cpp:205] Received SASL authentication
start
I1202 15:18:01.475903 26342 authenticator.cpp:327] Authentication requires more
steps
I1202 15:18:01.475989 26342 authenticatee.cpp:260] Received SASL authentication
step
I1202 15:18:01.476095 26342 authenticator.cpp:233] Received SASL authentication
step
I1202 15:18:01.476172 26342 authenticator.cpp:319] Authentication success
I1202 15:18:01.476294 26343 authenticatee.cpp:300] Authentication success
I1202 15:18:01.476307 26349 master.cpp:5180] Successfully authenticated
principal 'test-principal' at slave(655)@127.0.1.1:33625
I1202 15:18:01.476681 26345 slave.cpp:860] Successfully authenticated with
master [email protected]:33625
I1202 15:18:01.476958 26343 master.cpp:3859] Registering slave at
slave(655)@127.0.1.1:33625 (debian-vm.localdomain) with id
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S0
I1202 15:18:01.477411 26345 registrar.cpp:441] Applied 1 operations in 62621ns;
attempting to update the 'registry'
I1202 15:18:01.478070 26345 log.cpp:685] Attempting to append 365 bytes to the
log
I1202 15:18:01.478272 26345 coordinator.cpp:350] Coordinator attempting to
write APPEND action at position 3
I1202 15:18:01.479032 26345 replica.cpp:540] Replica received write request for
position 3 from (21089)@127.0.1.1:33625
I1202 15:18:01.479346 26344 master.cpp:3847] Ignoring register slave message
from slave(655)@127.0.1.1:33625 (debian-vm.localdomain) as admission is already
in progress
I1202 15:18:01.488145 26345 leveldb.cpp:343] Persisting action (384 bytes) to
leveldb took 8.718277ms
I1202 15:18:01.488211 26345 replica.cpp:715] Persisted action at 3
I1202 15:18:01.489114 26345 replica.cpp:694] Replica received learned notice
for position 3 from @0.0.0.0:0
I1202 15:18:01.489850 26345 leveldb.cpp:343] Persisting action (386 bytes) to
leveldb took 620665ns
I1202 15:18:01.489914 26345 replica.cpp:715] Persisted action at 3
I1202 15:18:01.489971 26345 replica.cpp:700] Replica learned APPEND action at
position 3
I1202 15:18:01.491174 26347 registrar.cpp:486] Successfully updated the
'registry' in 13.647104ms
I1202 15:18:01.491349 26345 log.cpp:704] Attempting to truncate the log to 3
I1202 15:18:01.491489 26345 coordinator.cpp:350] Coordinator attempting to
write TRUNCATE action at position 4
I1202 15:18:01.491860 26347 master.cpp:3927] Registered slave
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S0 at slave(655)@127.0.1.1:33625
(debian-vm.localdomain) with cpus(*):2; mem(*):1024; disk(*):1024;
ports(*):[31000-32000]
I1202 15:18:01.492015 26345 hierarchical.cpp:344] Added slave
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S0 (debian-vm.localdomain) with cpus(*):2;
mem(*):1024; disk(*):1024; ports(*):[31000-32000] (allocated: )
I1202 15:18:01.492398 26347 replica.cpp:540] Replica received write request for
position 4 from (21090)@127.0.1.1:33625
I1202 15:18:01.492027 26348 slave.cpp:904] Registered with master
[email protected]:33625; given slave ID baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S0
I1202 15:18:01.492795 26345 master.cpp:4979] Sending 1 offers to framework
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-0000 (default) at
[email protected]:33625
I1202 15:18:01.492897 26345 status_update_manager.cpp:183] Resuming sending
status updates
I1202 15:18:01.493070 26348 slave.cpp:963] Forwarding total oversubscribed
resources
I1202 15:18:01.493188 26345 master.cpp:4269] Received update of slave
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S0 at slave(655)@127.0.1.1:33625
(debian-vm.localdomain) with total oversubscribed resources
I1202 15:18:01.493386 26348 hierarchical.cpp:400] Slave
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S0 (debian-vm.localdomain) updated with
oversubscribed resources (total: cpus(*):2; mem(*):1024; disk(*):1024;
ports(*):[31000-32000], allocated: cpus(*):2; mem(*):1024; disk(*):1024;
ports(*):[31000-32000])
I1202 15:18:01.494815 26344 master.cpp:2915] Processing ACCEPT call for offers:
[ baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-O0 ] on slave
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S0 at slave(655)@127.0.1.1:33625
(debian-vm.localdomain) for framework baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-0000
(default) at [email protected]:33625
I1202 15:18:01.494904 26344 master.cpp:2711] Authorizing framework principal
'test-principal' to launch task b2102462-a9c1-45bf-94f2-9a59abb36e73 as user
'root'
I1202 15:18:01.495087 26347 leveldb.cpp:343] Persisting action (16 bytes) to
leveldb took 2.635152ms
I1202 15:18:01.495126 26347 replica.cpp:715] Persisted action at 4
I1202 15:18:01.495736 26342 replica.cpp:694] Replica received learned notice
for position 4 from @0.0.0.0:0
I1202 15:18:01.496106 26347 master.hpp:176] Adding task
b2102462-a9c1-45bf-94f2-9a59abb36e73 with resources cpus(*):2; mem(*):1024;
disk(*):1024; ports(*):[31000-32000] on slave
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S0 (debian-vm.localdomain)
I1202 15:18:01.496330 26347 master.cpp:3245] Launching task
b2102462-a9c1-45bf-94f2-9a59abb36e73 of framework
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-0000 (default) at
[email protected]:33625 with resources
cpus(*):2; mem(*):1024; disk(*):1024; ports(*):[31000-32000] on slave
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S0 at slave(655)@127.0.1.1:33625
(debian-vm.localdomain)
I1202 15:18:01.496820 26344 slave.cpp:1294] Got assigned task
b2102462-a9c1-45bf-94f2-9a59abb36e73 for framework
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-0000
I1202 15:18:01.497508 26344 slave.cpp:1410] Launching task
b2102462-a9c1-45bf-94f2-9a59abb36e73 for framework
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-0000
I1202 15:18:01.498034 26344 paths.cpp:436] Trying to chown
'/tmp/SlaveRecoveryTest_0_Reboot_qT6DBx/slaves/baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S0/frameworks/baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-0000/executors/b2102462-a9c1-45bf-94f2-9a59abb36e73/runs/ceb8eefc-8de6-461c-add8-2b22666a1617'
to user 'root'
I1202 15:18:01.497663 26342 leveldb.cpp:343] Persisting action (18 bytes) to
leveldb took 1.863299ms
I1202 15:18:01.505702 26342 leveldb.cpp:401] Deleting ~2 keys from leveldb took
86618ns
I1202 15:18:01.505772 26342 replica.cpp:715] Persisted action at 4
I1202 15:18:01.505803 26342 replica.cpp:700] Replica learned TRUNCATE action at
position 4
I1202 15:18:01.508184 26344 slave.cpp:4999] Launching executor
b2102462-a9c1-45bf-94f2-9a59abb36e73 of framework
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-0000 with resources cpus(*):0.1; mem(*):32
in work directory
'/tmp/SlaveRecoveryTest_0_Reboot_qT6DBx/slaves/baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S0/frameworks/baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-0000/executors/b2102462-a9c1-45bf-94f2-9a59abb36e73/runs/ceb8eefc-8de6-461c-add8-2b22666a1617'
I1202 15:18:01.508643 26347 containerizer.cpp:618] Starting container
'ceb8eefc-8de6-461c-add8-2b22666a1617' for executor
'b2102462-a9c1-45bf-94f2-9a59abb36e73' of framework
'baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-0000'
I1202 15:18:01.508885 26344 slave.cpp:1628] Queuing task
'b2102462-a9c1-45bf-94f2-9a59abb36e73' for executor
'b2102462-a9c1-45bf-94f2-9a59abb36e73' of framework
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-0000
I1202 15:18:01.575203 26349 cpushare.cpp:392] Updated 'cpu.shares' to 2150
(cpus 2.1) for container ceb8eefc-8de6-461c-add8-2b22666a1617
I1202 15:18:01.639154 26347 mem.cpp:605] Started listening for OOM events for
container ceb8eefc-8de6-461c-add8-2b22666a1617
2015-12-02
15:18:01,650:26328(0x7f9841de9700):ZOO_ERROR@handle_socket_error_msg@1697:
Socket [127.0.0.1:52030] zk retcode=-4, errno=111(Connection refused): server
refused to accept the client
I1202 15:18:01.656420 26347 mem.cpp:725] Started listening on low memory
pressure events for container ceb8eefc-8de6-461c-add8-2b22666a1617
I1202 15:18:01.678865 26347 mem.cpp:725] Started listening on medium memory
pressure events for container ceb8eefc-8de6-461c-add8-2b22666a1617
I1202 15:18:01.713045 26347 mem.cpp:725] Started listening on critical memory
pressure events for container ceb8eefc-8de6-461c-add8-2b22666a1617
I1202 15:18:01.729801 26347 mem.cpp:356] Updated 'memory.soft_limit_in_bytes'
to 1056MB for container ceb8eefc-8de6-461c-add8-2b22666a1617
I1202 15:18:01.766522 26347 mem.cpp:391] Updated 'memory.limit_in_bytes' to
1056MB for container ceb8eefc-8de6-461c-add8-2b22666a1617
2015-12-02
15:18:01,786:26328(0x7f983adf7700):ZOO_ERROR@handle_socket_error_msg@1697:
Socket [127.0.0.1:56378] zk retcode=-4, errno=111(Connection refused): server
refused to accept the client
I1202 15:18:01.811028 26345 linux_launcher.cpp:365] Cloning child process with
flags =
I1202 15:18:01.850016 26345 linux_launcher.cpp:422] Assigned child process
'14143' to 'mesos_executors.slice'
I1202 15:18:01.850262 26345 containerizer.cpp:851] Checkpointing executor's
forked pid 14143 to
'/tmp/SlaveRecoveryTest_0_Reboot_qT6DBx/meta/slaves/baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S0/frameworks/baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-0000/executors/b2102462-a9c1-45bf-94f2-9a59abb36e73/runs/ceb8eefc-8de6-461c-add8-2b22666a1617/pids/forked.pid'
I1202 15:18:01.944136 14157 exec.cpp:136] Version: 0.26.0
I1202 15:18:01.946939 26343 slave.cpp:2405] Got registration for executor
'b2102462-a9c1-45bf-94f2-9a59abb36e73' of framework
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-0000 from executor(1)@127.0.1.1:57954
I1202 15:18:01.948669 14177 exec.cpp:210] Executor registered on slave
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S0
Registered executor on debian-vm.localdomain
I1202 15:18:01.967314 26347 mem.cpp:356] Updated 'memory.soft_limit_in_bytes'
to 1056MB for container ceb8eefc-8de6-461c-add8-2b22666a1617
I1202 15:18:01.971539 26344 cpushare.cpp:392] Updated 'cpu.shares' to 2150
(cpus 2.1) for container ceb8eefc-8de6-461c-add8-2b22666a1617
I1202 15:18:01.985469 26344 slave.cpp:1793] Sending queued task
'b2102462-a9c1-45bf-94f2-9a59abb36e73' to executor
'b2102462-a9c1-45bf-94f2-9a59abb36e73' of framework
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-0000 at executor(1)@127.0.1.1:57954
Starting task b2102462-a9c1-45bf-94f2-9a59abb36e73
Forked command at 14180
sh -c 'sleep 1000'
I1202 15:18:02.001322 26347 slave.cpp:2762] Handling status update TASK_RUNNING
(UUID: 444a54f5-32d6-49e5-84c6-c2729395428e) for task
b2102462-a9c1-45bf-94f2-9a59abb36e73 of framework
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-0000 from executor(1)@127.0.1.1:57954
I1202 15:18:02.001744 26347 status_update_manager.cpp:322] Received status
update TASK_RUNNING (UUID: 444a54f5-32d6-49e5-84c6-c2729395428e) for task
b2102462-a9c1-45bf-94f2-9a59abb36e73 of framework
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-0000
I1202 15:18:02.002167 26347 status_update_manager.cpp:826] Checkpointing UPDATE
for status update TASK_RUNNING (UUID: 444a54f5-32d6-49e5-84c6-c2729395428e) for
task b2102462-a9c1-45bf-94f2-9a59abb36e73 of framework
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-0000
I1202 15:18:02.013846 26349 slave.cpp:3087] Forwarding the update TASK_RUNNING
(UUID: 444a54f5-32d6-49e5-84c6-c2729395428e) for task
b2102462-a9c1-45bf-94f2-9a59abb36e73 of framework
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-0000 to [email protected]:33625
I1202 15:18:02.014194 26349 slave.cpp:3011] Sending acknowledgement for status
update TASK_RUNNING (UUID: 444a54f5-32d6-49e5-84c6-c2729395428e) for task
b2102462-a9c1-45bf-94f2-9a59abb36e73 of framework
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-0000 to executor(1)@127.0.1.1:57954
I1202 15:18:02.014359 26347 master.cpp:4414] Status update TASK_RUNNING (UUID:
444a54f5-32d6-49e5-84c6-c2729395428e) for task
b2102462-a9c1-45bf-94f2-9a59abb36e73 of framework
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-0000 from slave
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S0 at slave(655)@127.0.1.1:33625
(debian-vm.localdomain)
I1202 15:18:02.014411 26347 master.cpp:4462] Forwarding status update
TASK_RUNNING (UUID: 444a54f5-32d6-49e5-84c6-c2729395428e) for task
b2102462-a9c1-45bf-94f2-9a59abb36e73 of framework
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-0000
I1202 15:18:02.014533 26347 master.cpp:6066] Updating the state of task
b2102462-a9c1-45bf-94f2-9a59abb36e73 of framework
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-0000 (latest state: TASK_RUNNING, status
update state: TASK_RUNNING)
I1202 15:18:02.015163 26347 master.cpp:3571] Processing ACKNOWLEDGE call
444a54f5-32d6-49e5-84c6-c2729395428e for task
b2102462-a9c1-45bf-94f2-9a59abb36e73 of framework
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-0000 (default) at
[email protected]:33625 on slave
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S0
I1202 15:18:02.015419 26347 status_update_manager.cpp:394] Received status
update acknowledgement (UUID: 444a54f5-32d6-49e5-84c6-c2729395428e) for task
b2102462-a9c1-45bf-94f2-9a59abb36e73 of framework
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-0000
I1202 15:18:02.015508 26347 status_update_manager.cpp:826] Checkpointing ACK
for status update TASK_RUNNING (UUID: 444a54f5-32d6-49e5-84c6-c2729395428e) for
task b2102462-a9c1-45bf-94f2-9a59abb36e73 of framework
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-0000
I1202 15:18:02.016258 26328 slave.cpp:601] Slave terminating
I1202 15:18:02.016489 26345 master.cpp:1083] Slave
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S0 at slave(655)@127.0.1.1:33625
(debian-vm.localdomain) disconnected
I1202 15:18:02.016530 26345 master.cpp:2531] Disconnecting slave
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S0 at slave(655)@127.0.1.1:33625
(debian-vm.localdomain)
I1202 15:18:02.017040 26345 master.cpp:2550] Deactivating slave
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S0 at slave(655)@127.0.1.1:33625
(debian-vm.localdomain)
I1202 15:18:02.017151 26344 hierarchical.cpp:429] Slave
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S0 deactivated
I1202 15:18:02.089155 14174 exec.cpp:383] Executor asked to shutdown
Shutting down
Sending SIGTERM to process tree at pid 14180
Killing the following process trees:
[
-+- 14180 sh -c sleep 1000
\--- 14181 sleep 1000
]
Command terminated with signal Terminated (pid: 14180)
I1202 15:18:03.298529 26328 containerizer.cpp:142] Using isolation:
cgroups/cpu,cgroups/mem,filesystem/posix
I1202 15:18:04.043941 26328 linux_launcher.cpp:103] Using
/sys/fs/cgroup/freezer as the freezer hierarchy for the Linux launcher
I1202 15:18:04.050012 26328 systemd.cpp:210] Started systemd slice
`mesos_executors.slice`
I1202 15:18:04.072232 26344 slave.cpp:191] Slave started on 656)@127.0.1.1:33625
I1202 15:18:04.072262 26344 slave.cpp:192] Flags at startup:
--appc_store_dir="/tmp/mesos/store/appc" --authenticatee="crammd5"
--cgroups_cpu_enable_pids_and_tids_count="false" --cgroups_enable_cfs="false"
--cgroups_hierarchy="/sys/fs/cgroup" --cgroups_limit_swap="false"
--cgroups_root="mesos_test_d5b6bb71-dab3-4457-ae03-84a046ed9c62"
--container_disk_watch_interval="15secs" --containerizers="mesos"
--credential="/tmp/SlaveRecoveryTest_0_Reboot_qT6DBx/credential"
--default_role="*" --disk_watch_interval="1mins" --docker="docker"
--docker_auth_server="auth.docker.io" --docker_auth_server_port="443"
--docker_kill_orphans="true"
--docker_local_archives_dir="/tmp/mesos/images/docker" --docker_puller="local"
--docker_puller_timeout="60" --docker_registry="registry-1.docker.io"
--docker_registry_port="443" --docker_remove_delay="6hrs"
--docker_socket="/var/run/docker.sock" --docker_stop_timeout="0ns"
--docker_store_dir="/tmp/mesos/store/docker"
--enforce_container_disk_quota="false" --executor_registration_timeout="1mins"
--executor_shutdown_grace_period="5secs"
--fetcher_cache_dir="/tmp/SlaveRecoveryTest_0_Reboot_qT6DBx/fetch"
--fetcher_cache_size="2GB" --frameworks_home="" --gc_delay="1weeks"
--gc_disk_headroom="0.1" --hadoop_home="" --help="false"
--hostname_lookup="true" --image_provisioner_backend="copy"
--initialize_driver_logging="true" --isolation="cgroups/cpu,cgroups/mem"
--launcher_dir="/home/alexander/Documents/workspace/mesos/build/src"
--logbufsecs="0" --logging_level="INFO"
--oversubscribed_resources_interval="15secs" --perf_duration="10secs"
--perf_interval="1mins" --qos_correction_interval_min="0ns" --quiet="false"
--recover="reconnect" --recovery_timeout="15mins"
--registration_backoff_factor="10ms"
--resources="cpus:2;mem:1024;disk:1024;ports:[31000-32000]"
--revocable_cpu_low_priority="true" --sandbox_directory="/mnt/mesos/sandbox"
--slave_subsystems="memory,cpuacct" --strict="false" --switch_user="true"
--systemd_runtime_directory="/run/systemd/system" --version="false"
--work_dir="/tmp/SlaveRecoveryTest_0_Reboot_qT6DBx"
I1202 15:18:04.072510 26344 slave.cpp:212] Moving slave process into its own
cgroup for subsystem: memory
I1202 15:18:04.334131 26344 slave.cpp:212] Moving slave process into its own
cgroup for subsystem: cpuacct
I1202 15:18:04.516194 26344 credentials.hpp:85] Loading credential for
authentication from '/tmp/SlaveRecoveryTest_0_Reboot_qT6DBx/credential'
I1202 15:18:04.516338 26344 slave.cpp:322] Slave using credential for:
test-principal
I1202 15:18:04.516819 26344 slave.cpp:392] Slave resources: cpus(*):2;
mem(*):1024; disk(*):1024; ports(*):[31000-32000]
I1202 15:18:04.516865 26344 slave.cpp:400] Slave attributes: [ ]
I1202 15:18:04.516873 26344 slave.cpp:405] Slave hostname: debian-vm.localdomain
I1202 15:18:04.516878 26344 slave.cpp:410] Slave checkpoint: true
I1202 15:18:04.517696 26346 state.cpp:54] Recovering state from
'/tmp/SlaveRecoveryTest_0_Reboot_qT6DBx/meta'
I1202 15:18:04.517777 26346 state.cpp:681] No checkpointed resources found at
'/tmp/SlaveRecoveryTest_0_Reboot_qT6DBx/meta/resources/resources.info'
I1202 15:18:04.517849 26346 state.cpp:85] Slave host rebooted
I1202 15:18:04.518209 26346 status_update_manager.cpp:202] Recovering status
update manager
I1202 15:18:04.518307 26346 containerizer.cpp:384] Recovering containerizer
I1202 15:18:04.592492 26345 containerizer.cpp:522] Removing orphan container
ceb8eefc-8de6-461c-add8-2b22666a1617
I1202 15:18:04.651180 26345 slave.cpp:4230] Finished recovery
I1202 15:18:04.651376 26349 cgroups.cpp:2429] Freezing cgroup
/sys/fs/cgroup/freezer/mesos_test_d5b6bb71-dab3-4457-ae03-84a046ed9c62/ceb8eefc-8de6-461c-add8-2b22666a1617
I1202 15:18:04.651582 26345 slave.cpp:4263] Garbage collecting old slave
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S0
I1202 15:18:04.651935 26345 gc.cpp:56] Scheduling
'/tmp/SlaveRecoveryTest_0_Reboot_qT6DBx/slaves/baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S0'
for gc 6.99999245687407days in the future
I1202 15:18:04.652065 26345 gc.cpp:56] Scheduling
'/tmp/SlaveRecoveryTest_0_Reboot_qT6DBx/meta/slaves/baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S0'
for gc 6.99999245635556days in the future
I1202 15:18:04.652354 26345 slave.cpp:729] New master detected at
[email protected]:33625
I1202 15:18:04.652434 26345 slave.cpp:792] Authenticating with master
[email protected]:33625
I1202 15:18:04.652451 26345 slave.cpp:797] Using default CRAM-MD5 authenticatee
I1202 15:18:04.652549 26345 slave.cpp:765] Detecting new master
I1202 15:18:04.652704 26345 status_update_manager.cpp:176] Pausing sending
status updates
I1202 15:18:04.652803 26345 authenticatee.cpp:123] Creating new client SASL
connection
I1202 15:18:04.653151 26345 master.cpp:5150] Authenticating
slave(656)@127.0.1.1:33625
I1202 15:18:04.653491 26345 authenticator.cpp:100] Creating new server SASL
connection
I1202 15:18:04.654045 26345 authenticatee.cpp:214] Received SASL authentication
mechanisms: CRAM-MD5
I1202 15:18:04.654069 26345 authenticatee.cpp:240] Attempting to authenticate
with mechanism 'CRAM-MD5'
I1202 15:18:04.654127 26345 authenticator.cpp:205] Received SASL authentication
start
I1202 15:18:04.654168 26345 authenticator.cpp:327] Authentication requires more
steps
I1202 15:18:04.654232 26345 authenticatee.cpp:260] Received SASL authentication
step
I1202 15:18:04.654295 26345 authenticator.cpp:233] Received SASL authentication
step
I1202 15:18:04.654358 26345 authenticator.cpp:319] Authentication success
I1202 15:18:04.654491 26345 authenticatee.cpp:300] Authentication success
I1202 15:18:04.654752 26344 slave.cpp:860] Successfully authenticated with
master [email protected]:33625
I1202 15:18:04.654968 26345 master.cpp:5180] Successfully authenticated
principal 'test-principal' at slave(656)@127.0.1.1:33625
I1202 15:18:04.655432 26345 master.cpp:3859] Registering slave at
slave(656)@127.0.1.1:33625 (debian-vm.localdomain) with id
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S1
I1202 15:18:04.656077 26343 registrar.cpp:441] Applied 1 operations in 76820ns;
attempting to update the 'registry'
I1202 15:18:04.657047 26343 log.cpp:685] Attempting to append 540 bytes to the
log
I1202 15:18:04.657225 26343 coordinator.cpp:350] Coordinator attempting to
write APPEND action at position 5
I1202 15:18:04.658035 26343 replica.cpp:540] Replica received write request for
position 5 from (21123)@127.0.1.1:33625
I1202 15:18:04.665920 26343 leveldb.cpp:343] Persisting action (559 bytes) to
leveldb took 7.814853ms
I1202 15:18:04.665997 26343 replica.cpp:715] Persisted action at 5
I1202 15:18:04.666776 26343 replica.cpp:694] Replica received learned notice
for position 5 from @0.0.0.0:0
I1202 15:18:04.667973 26343 leveldb.cpp:343] Persisting action (561 bytes) to
leveldb took 1.08753ms
I1202 15:18:04.668018 26343 replica.cpp:715] Persisted action at 5
I1202 15:18:04.668038 26343 replica.cpp:700] Replica learned APPEND action at
position 5
I1202 15:18:04.672534 26346 registrar.cpp:486] Successfully updated the
'registry' in 16.38784ms
I1202 15:18:04.672734 26343 log.cpp:704] Attempting to truncate the log to 5
I1202 15:18:04.672901 26342 master.cpp:3847] Ignoring register slave message
from slave(656)@127.0.1.1:33625 (debian-vm.localdomain) as admission is already
in progress
I1202 15:18:04.672914 26343 coordinator.cpp:350] Coordinator attempting to
write TRUNCATE action at position 6
I1202 15:18:04.673462 26342 master.cpp:3927] Registered slave
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S1 at slave(656)@127.0.1.1:33625
(debian-vm.localdomain) with cpus(*):2; mem(*):1024; disk(*):1024;
ports(*):[31000-32000]
I1202 15:18:04.673705 26343 hierarchical.cpp:344] Added slave
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S1 (debian-vm.localdomain) with cpus(*):2;
mem(*):1024; disk(*):1024; ports(*):[31000-32000] (allocated: )
I1202 15:18:04.673727 26346 replica.cpp:540] Replica received write request for
position 6 from (21124)@127.0.1.1:33625
I1202 15:18:04.674177 26342 slave.cpp:904] Registered with master
[email protected]:33625; given slave ID baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S1
I1202 15:18:04.674335 26347 status_update_manager.cpp:183] Resuming sending
status updates
I1202 15:18:04.674424 26348 master.cpp:4979] Sending 1 offers to framework
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-0000 (default) at
[email protected]:33625
I1202 15:18:04.674509 26342 slave.cpp:963] Forwarding total oversubscribed
resources
I1202 15:18:04.674677 26348 master.cpp:4269] Received update of slave
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S1 at slave(656)@127.0.1.1:33625
(debian-vm.localdomain) with total oversubscribed resources
I1202 15:18:04.674923 26343 hierarchical.cpp:400] Slave
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S1 (debian-vm.localdomain) updated with
oversubscribed resources (total: cpus(*):2; mem(*):1024; disk(*):1024;
ports(*):[31000-32000], allocated: cpus(*):2; mem(*):1024; disk(*):1024;
ports(*):[31000-32000])
I1202 15:18:04.675171 26346 leveldb.cpp:343] Persisting action (16 bytes) to
leveldb took 1.412868ms
I1202 15:18:04.675211 26346 replica.cpp:715] Persisted action at 6
I1202 15:18:04.675493 26328 sched.cpp:1805] Asked to stop the driver
I1202 15:18:04.675585 26328 master.cpp:922] Master terminating
I1202 15:18:04.675717 26346 sched.cpp:1043] Stopping framework
'baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-0000'
I1202 15:18:04.675988 26346 hierarchical.cpp:373] Removed slave
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S1
W1202 15:18:04.676062 26328 master.cpp:6118] Removing task
b2102462-a9c1-45bf-94f2-9a59abb36e73 with resources cpus(*):2; mem(*):1024;
disk(*):1024; ports(*):[31000-32000] of framework
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-0000 on slave
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S0 at slave(655)@127.0.1.1:33625
(debian-vm.localdomain) in non-terminal state TASK_RUNNING
I1202 15:18:04.676136 26346 hierarchical.cpp:373] Removed slave
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-S0
I1202 15:18:04.676923 26347 hierarchical.cpp:230] Removed framework
baeb70c6-c960-4d0d-9dc7-48b9a54ef8a8-0000
I1202 15:18:04.677057 26346 slave.cpp:3215] [email protected]:33625 exited
W1202 15:18:04.677093 26346 slave.cpp:3218] Master disconnected! Waiting for a
new master to be elected
I1202 15:18:04.678817 26343 replica.cpp:694] Replica received learned notice
for position 6 from @0.0.0.0:0
I1202 15:18:04.679985 26343 leveldb.cpp:343] Persisting action (18 bytes) to
leveldb took 1.113234ms
I1202 15:18:04.680058 26343 leveldb.cpp:401] Deleting ~2 keys from leveldb took
25679ns
I1202 15:18:04.680094 26343 replica.cpp:715] Persisted action at 6
I1202 15:18:04.680116 26343 replica.cpp:700] Replica learned TRUNCATE action at
position 6
I1202 15:18:04.681684 26348 slave.cpp:601] Slave terminating
I1202 15:18:04.721125 26349 cgroups.cpp:1411] Successfully froze cgroup
/sys/fs/cgroup/freezer/mesos_test_d5b6bb71-dab3-4457-ae03-84a046ed9c62/ceb8eefc-8de6-461c-add8-2b22666a1617
after 69.67808ms
I1202 15:18:04.778825 26347 cgroups.cpp:2447] Thawing cgroup
/sys/fs/cgroup/freezer/mesos_test_d5b6bb71-dab3-4457-ae03-84a046ed9c62/ceb8eefc-8de6-461c-add8-2b22666a1617
I1202 15:18:04.843261 26349 cgroups.cpp:1440] Successfullly thawed cgroup
/sys/fs/cgroup/freezer/mesos_test_d5b6bb71-dab3-4457-ae03-84a046ed9c62/ceb8eefc-8de6-461c-add8-2b22666a1617
after 64.342016ms
I1202 15:18:04.854326 26345 cgroups.cpp:2429] Freezing cgroup
/sys/fs/cgroup/freezer/mesos_test_d5b6bb71-dab3-4457-ae03-84a046ed9c62/ceb8eefc-8de6-461c-add8-2b22666a1617
../../src/tests/mesos.cpp:781: Failure
(cgroups::destroy(hierarchy, cgroup)).failure(): Failed to kill tasks in nested
cgroups: Collect failed: Invalid freezer cgroup:
'mesos_test_d5b6bb71-dab3-4457-ae03-84a046ed9c62/ceb8eefc-8de6-461c-add8-2b22666a1617'
is not a valid cgroup
*** Aborted at 1449065884 (unix time) try "date -d @1449065884" if you are
using GNU date ***
PC: @ 0x14b07ae testing::UnitTest::AddTestPartResult()
*** SIGSEGV (@0x0) received by PID 26328 (TID 0x7f9891edb7c0) from PID 0; stack
trace: ***
@ 0x7f9879fc366c os::Linux::chained_handler()
@ 0x7f9879fc7a0a JVM_handle_linux_signal
@ 0x7f988b7f88d0 (unknown)
@ 0x14b07ae testing::UnitTest::AddTestPartResult()
@ 0x14a51e7 testing::internal::AssertHelper::operator=()
@ 0xf564d1 mesos::internal::tests::ContainerizerTest<>::TearDown()
@ 0x14ce2d0
testing::internal::HandleSehExceptionsInMethodIfSupported<>()
@ 0x14c9248
testing::internal::HandleExceptionsInMethodIfSupported<>()
@ 0x14aa5d0 testing::Test::Run()
@ 0x14aad15 testing::TestInfo::Run()
@ 0x14ab350 testing::TestCase::Run()
@ 0x14b1c9f testing::internal::UnitTestImpl::RunAllTests()
@ 0x14cef5f
testing::internal::HandleSehExceptionsInMethodIfSupported<>()
@ 0x14c9d9e
testing::internal::HandleExceptionsInMethodIfSupported<>()
@ 0x14b09cf testing::UnitTest::Run()
@ 0xd63e02 RUN_ALL_TESTS()
@ 0xd639e0 main
@ 0x7f988b461b45 (unknown)
{noformat}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)