[
https://issues.apache.org/jira/browse/MESOS-4739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Anand Mazumdar updated MESOS-4739:
----------------------------------
Description:
Showed up on ASF CI:
https://builds.apache.org/job/Mesos/COMPILER=gcc,CONFIGURATION=--verbose,ENVIRONMENT=GLOG_v=1%20MESOS_VERBOSE=1,OS=ubuntu:14.04,label_exp=(docker%7C%7CHadoop)&&(!ubuntu-us1)/1704/consoleFull
{code}
[ RUN ] SlaveRecoveryTest/0.ReconnectHTTPExecutor
I0223 04:54:28.547051 786 leveldb.cpp:174] Opened db in 124.456584ms
I0223 04:54:28.597709 786 leveldb.cpp:181] Compacted db in 50.603402ms
I0223 04:54:28.597779 786 leveldb.cpp:196] Created db iterator in 22429ns
I0223 04:54:28.597797 786 leveldb.cpp:202] Seeked to beginning of db in 2279ns
I0223 04:54:28.597810 786 leveldb.cpp:271] Iterated through 0 keys in the db
in 265ns
I0223 04:54:28.597859 786 replica.cpp:779] Replica recovered with log
positions 0 -> 0 with 1 holes and 0 unlearned
I0223 04:54:28.598731 807 recover.cpp:447] Starting replica recovery
I0223 04:54:28.599493 807 recover.cpp:473] Replica is in EMPTY status
I0223 04:54:28.601400 815 replica.cpp:673] Replica in EMPTY status received a
broadcasted recover request from (9593)@172.17.0.2:44225
I0223 04:54:28.601776 818 recover.cpp:193] Received a recover response from a
replica in EMPTY status
I0223 04:54:28.602247 809 recover.cpp:564] Updating replica status to STARTING
I0223 04:54:28.603353 811 master.cpp:376] Master
81a295fc-fe1b-4ff8-9291-cd54f5c6f303 (5847d87ad902) started on 172.17.0.2:44225
I0223 04:54:28.603376 811 master.cpp:378] Flags at startup: --acls=""
--allocation_interval="1secs" --allocator="HierarchicalDRF"
--authenticate="true" --authenticate_http="true" --authenticate_slaves="true"
--authenticators="crammd5" --authorizers="local"
--credentials="/tmp/f6d1qA/credentials" --framework_sorter="drf" --help="false"
--hostname_lookup="true" --http_authenticators="basic"
--initialize_driver_logging="true" --log_auto_initialize="true"
--logbufsecs="0" --logging_level="INFO" --max_completed_frameworks="50"
--max_completed_tasks_per_framework="1000" --max_slave_ping_timeouts="5"
--quiet="false" --recovery_slave_removal_limit="100%"
--registry="replicated_log" --registry_fetch_timeout="1mins"
--registry_store_timeout="100secs" --registry_strict="true"
--root_submissions="true" --slave_ping_timeout="15secs"
--slave_reregister_timeout="10mins" --user_sorter="drf" --version="false"
--webui_dir="/mesos/mesos-0.28.0/_inst/share/mesos/webui"
--work_dir="/tmp/f6d1qA/master" --zk_session_timeout="10secs"
I0223 04:54:28.603906 811 master.cpp:423] Master only allowing authenticated
frameworks to register
I0223 04:54:28.603920 811 master.cpp:428] Master only allowing authenticated
slaves to register
I0223 04:54:28.603930 811 credentials.hpp:35] Loading credentials for
authentication from '/tmp/f6d1qA/credentials'
I0223 04:54:28.604317 811 master.cpp:468] Using default 'crammd5'
authenticator
I0223 04:54:28.604506 811 master.cpp:537] Using default 'basic' HTTP
authenticator
I0223 04:54:28.604635 811 master.cpp:571] Authorization enabled
I0223 04:54:28.604918 808 whitelist_watcher.cpp:77] No whitelist given
I0223 04:54:28.605023 819 hierarchical.cpp:144] Initialized hierarchical
allocator process
I0223 04:54:28.608273 812 master.cpp:1712] The newly elected leader is
[email protected]:44225 with id 81a295fc-fe1b-4ff8-9291-cd54f5c6f303
I0223 04:54:28.608314 812 master.cpp:1725] Elected as the leading master!
I0223 04:54:28.608333 812 master.cpp:1470] Recovering from registrar
I0223 04:54:28.608610 812 registrar.cpp:307] Recovering registrar
I0223 04:54:28.631079 817 leveldb.cpp:304] Persisting metadata (8 bytes) to
leveldb took 28.524027ms
I0223 04:54:28.631156 817 replica.cpp:320] Persisted replica status to
STARTING
I0223 04:54:28.631431 810 recover.cpp:473] Replica is in STARTING status
I0223 04:54:28.632550 819 replica.cpp:673] Replica in STARTING status
received a broadcasted recover request from (9595)@172.17.0.2:44225
I0223 04:54:28.632968 816 recover.cpp:193] Received a recover response from a
replica in STARTING status
I0223 04:54:28.633414 807 recover.cpp:564] Updating replica status to VOTING
I0223 04:54:28.660815 818 leveldb.cpp:304] Persisting metadata (8 bytes) to
leveldb took 27.23603ms
I0223 04:54:28.660898 818 replica.cpp:320] Persisted replica status to VOTING
I0223 04:54:28.661111 809 recover.cpp:578] Successfully joined the Paxos group
I0223 04:54:28.661368 813 recover.cpp:462] Recover process terminated
I0223 04:54:28.663571 816 log.cpp:659] Attempting to start the writer
I0223 04:54:28.665334 817 replica.cpp:493] Replica received implicit promise
request from (9596)@172.17.0.2:44225 with proposal 1
I0223 04:54:28.685936 817 leveldb.cpp:304] Persisting metadata (8 bytes) to
leveldb took 20.539463ms
I0223 04:54:28.686019 817 replica.cpp:342] Persisted promised to 1
I0223 04:54:28.687299 815 coordinator.cpp:238] Coordinator attempting to fill
missing positions
I0223 04:54:28.689152 807 replica.cpp:388] Replica received explicit promise
request from (9597)@172.17.0.2:44225 for position 0 with proposal 2
I0223 04:54:28.711225 807 leveldb.cpp:341] Persisting action (8 bytes) to
leveldb took 22.011554ms
I0223 04:54:28.711299 807 replica.cpp:712] Persisted action at 0
I0223 04:54:28.712512 808 replica.cpp:537] Replica received write request for
position 0 from (9598)@172.17.0.2:44225
I0223 04:54:28.712585 808 leveldb.cpp:436] Reading position from leveldb took
37473ns
I0223 04:54:28.736302 808 leveldb.cpp:341] Persisting action (14 bytes) to
leveldb took 23.671572ms
I0223 04:54:28.736348 808 replica.cpp:712] Persisted action at 0
I0223 04:54:28.737027 805 replica.cpp:691] Replica received learned notice
for position 0 from @0.0.0.0:0
I0223 04:54:28.761451 805 leveldb.cpp:341] Persisting action (16 bytes) to
leveldb took 24.382179ms
I0223 04:54:28.761497 805 replica.cpp:712] Persisted action at 0
I0223 04:54:28.761533 805 replica.cpp:697] Replica learned NOP action at
position 0
I0223 04:54:28.762406 812 log.cpp:675] Writer started with ending position 0
I0223 04:54:28.763619 811 leveldb.cpp:436] Reading position from leveldb took
32929ns
I0223 04:54:28.764853 806 registrar.cpp:340] Successfully fetched the
registry (0B) in 156.179712ms
I0223 04:54:28.764991 806 registrar.cpp:439] Applied 1 operations in 37207ns;
attempting to update the 'registry'
I0223 04:54:28.766080 809 log.cpp:683] Attempting to append 170 bytes to the
log
I0223 04:54:28.766222 807 coordinator.cpp:348] Coordinator attempting to
write APPEND action at position 1
I0223 04:54:28.766942 817 replica.cpp:537] Replica received write request for
position 1 from (9599)@172.17.0.2:44225
I0223 04:54:28.796957 817 leveldb.cpp:341] Persisting action (189 bytes) to
leveldb took 29.967721ms
I0223 04:54:28.797009 817 replica.cpp:712] Persisted action at 1
I0223 04:54:28.797785 816 replica.cpp:691] Replica received learned notice
for position 1 from @0.0.0.0:0
I0223 04:54:28.838986 816 leveldb.cpp:341] Persisting action (191 bytes) to
leveldb took 41.156446ms
I0223 04:54:28.839037 816 replica.cpp:712] Persisted action at 1
I0223 04:54:28.839076 816 replica.cpp:697] Replica learned APPEND action at
position 1
I0223 04:54:28.840293 814 registrar.cpp:484] Successfully updated the
'registry' in 75.217152ms
I0223 04:54:28.840472 814 registrar.cpp:370] Successfully recovered registrar
I0223 04:54:28.840530 807 log.cpp:702] Attempting to truncate the log to 1
I0223 04:54:28.840692 820 coordinator.cpp:348] Coordinator attempting to
write TRUNCATE action at position 2
I0223 04:54:28.840988 805 master.cpp:1522] Recovered 0 slaves from the
Registry (131B) ; allowing 10mins for slaves to re-register
I0223 04:54:28.841042 815 hierarchical.cpp:171] Skipping recovery of
hierarchical allocator: nothing to recover
I0223 04:54:28.841539 807 replica.cpp:537] Replica received write request for
position 2 from (9600)@172.17.0.2:44225
I0223 04:54:28.877802 807 leveldb.cpp:341] Persisting action (16 bytes) to
leveldb took 36.230135ms
I0223 04:54:28.877836 807 replica.cpp:712] Persisted action at 2
I0223 04:54:28.878690 819 replica.cpp:691] Replica received learned notice
for position 2 from @0.0.0.0:0
I0223 04:54:28.919756 819 leveldb.cpp:341] Persisting action (18 bytes) to
leveldb took 40.9514ms
I0223 04:54:28.919894 819 leveldb.cpp:399] Deleting ~1 keys from leveldb took
63505ns
I0223 04:54:28.919924 819 replica.cpp:712] Persisted action at 2
I0223 04:54:28.919960 819 replica.cpp:697] Replica learned TRUNCATE action at
position 2
I0223 04:54:28.923907 786 containerizer.cpp:149] Using isolation:
posix/cpu,posix/mem,filesystem/posix
W0223 04:54:28.924551 786 backend.cpp:48] Failed to create 'bind' backend:
BindBackend requires root privileges
I0223 04:54:28.927949 814 slave.cpp:193] Slave started on 172.17.0.2:44225
I0223 04:54:28.927978 814 slave.cpp:194] Flags at startup:
--appc_simple_discovery_uri_prefix="http://"
--appc_store_dir="/tmp/mesos/store/appc" --authenticatee="crammd5"
--cgroups_cpu_enable_pids_and_tids_count="false" --cgroups_enable_cfs="false"
--cgroups_hierarchy="/sys/fs/cgroup" --cgroups_limit_swap="false"
--cgroups_root="mesos" --container_disk_watch_interval="15secs"
--containerizers="mesos"
--credential="/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/credential"
--default_role="*" --disk_watch_interval="1mins" --docker="docker"
--docker_auth_server="https://auth.docker.io" --docker_kill_orphans="true"
--docker_puller_timeout="60" --docker_registry="https://registry-1.docker.io"
--docker_remove_delay="6hrs" --docker_socket="/var/run/docker.sock"
--docker_stop_timeout="0ns" --docker_store_dir="/tmp/mesos/store/docker"
--enforce_container_disk_quota="false" --executor_registration_timeout="1mins"
--executor_shutdown_grace_period="5secs"
--fetcher_cache_dir="/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/fetch"
--fetcher_cache_size="2GB" --frameworks_home="" --gc_delay="1weeks"
--gc_disk_headroom="0.1" --hadoop_home="" --help="false"
--hostname_lookup="true" --image_provisioner_backend="copy"
--initialize_driver_logging="true" --isolation="posix/cpu,posix/mem"
--launcher_dir="/mesos/mesos-0.28.0/_build/src" --logbufsecs="0"
--logging_level="INFO" --oversubscribed_resources_interval="15secs"
--perf_duration="10secs" --perf_interval="1mins"
--qos_correction_interval_min="0ns" --quiet="false" --recover="reconnect"
--recovery_timeout="15mins" --registration_backoff_factor="10ms"
--resources="cpus:2;mem:1024;disk:1024;ports:[31000-32000]"
--revocable_cpu_low_priority="true" --sandbox_directory="/mnt/mesos/sandbox"
--strict="true" --switch_user="true" --systemd_enable_support="true"
--systemd_runtime_directory="/run/systemd/system" --version="false"
--work_dir="/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU"
I0223 04:54:28.928400 814 credentials.hpp:83] Loading credential for
authentication from
'/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/credential'
I0223 04:54:28.928591 814 slave.cpp:324] Slave using credential for:
test-principal
I0223 04:54:28.928771 814 resources.cpp:576] Parsing resources as JSON
failed: cpus:2;mem:1024;disk:1024;ports:[31000-32000]
Trying semicolon-delimited string format instead
I0223 04:54:28.929157 814 slave.cpp:464] Slave resources: cpus(*):2;
mem(*):1024; disk(*):1024; ports(*):[31000-32000]
I0223 04:54:28.929220 814 slave.cpp:472] Slave attributes: [ ]
I0223 04:54:28.929239 814 slave.cpp:477] Slave hostname: 5847d87ad902
I0223 04:54:28.929936 786 sched.cpp:222] Version: 0.28.0
I0223 04:54:28.930371 817 state.cpp:58] Recovering state from
'/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/meta'
I0223 04:54:28.930495 819 sched.cpp:326] New master detected at
[email protected]:44225
I0223 04:54:28.930583 819 sched.cpp:382] Authenticating with master
[email protected]:44225
I0223 04:54:28.930600 819 sched.cpp:389] Using default CRAM-MD5 authenticatee
I0223 04:54:28.930764 816 status_update_manager.cpp:200] Recovering status
update manager
I0223 04:54:28.930806 807 authenticatee.cpp:121] Creating new client SASL
connection
I0223 04:54:28.930994 819 containerizer.cpp:407] Recovering containerizer
I0223 04:54:28.931043 809 master.cpp:5526] Authenticating
[email protected]:44225
I0223 04:54:28.931161 813 authenticator.cpp:413] Starting authentication
session for crammd5_authenticatee(657)@172.17.0.2:44225
I0223 04:54:28.931468 808 authenticator.cpp:98] Creating new server SASL
connection
I0223 04:54:28.931641 817 authenticatee.cpp:212] Received SASL authentication
mechanisms: CRAM-MD5
I0223 04:54:28.931665 817 authenticatee.cpp:238] Attempting to authenticate
with mechanism 'CRAM-MD5'
I0223 04:54:28.931816 809 authenticator.cpp:203] Received SASL authentication
start
I0223 04:54:28.931879 809 authenticator.cpp:325] Authentication requires more
steps
I0223 04:54:28.931993 809 authenticatee.cpp:258] Received SASL authentication
step
I0223 04:54:28.932368 819 authenticator.cpp:231] Received SASL authentication
step
I0223 04:54:28.932402 819 auxprop.cpp:107] Request to lookup properties for
user: 'test-principal' realm: '5847d87ad902' server FQDN: '5847d87ad902'
SASL_AUXPROP_VERIFY_AGAINST_HASH: false SASL_AUXPROP_OVERRIDE: false
SASL_AUXPROP_AUTHZID: false
I0223 04:54:28.932435 819 auxprop.cpp:179] Looking up auxiliary property
'*userPassword'
I0223 04:54:28.932478 819 auxprop.cpp:179] Looking up auxiliary property
'*cmusaslsecretCRAM-MD5'
I0223 04:54:28.932500 819 auxprop.cpp:107] Request to lookup properties for
user: 'test-principal' realm: '5847d87ad902' server FQDN: '5847d87ad902'
SASL_AUXPROP_VERIFY_AGAINST_HASH: false SASL_AUXPROP_OVERRIDE: false
SASL_AUXPROP_AUTHZID: true
I0223 04:54:28.932514 819 auxprop.cpp:129] Skipping auxiliary property
'*userPassword' since SASL_AUXPROP_AUTHZID == true
I0223 04:54:28.932517 812 provisioner.cpp:245] Provisioner recovery complete
I0223 04:54:28.932526 819 auxprop.cpp:129] Skipping auxiliary property
'*cmusaslsecretCRAM-MD5' since SASL_AUXPROP_AUTHZID == true
I0223 04:54:28.932543 819 authenticator.cpp:317] Authentication success
I0223 04:54:28.932616 813 authenticatee.cpp:298] Authentication success
I0223 04:54:28.932663 805 master.cpp:5556] Successfully authenticated
principal 'test-principal' at
[email protected]:44225
I0223 04:54:28.932965 820 sched.cpp:471] Successfully authenticated with
master [email protected]:44225
I0223 04:54:28.932994 820 sched.cpp:776] Sending SUBSCRIBE call to
[email protected]:44225
I0223 04:54:28.933004 816 authenticator.cpp:431] Authentication session
cleanup for crammd5_authenticatee(657)@172.17.0.2:44225
I0223 04:54:28.933028 814 slave.cpp:4565] Finished recovery
I0223 04:54:28.933128 820 sched.cpp:809] Will retry registration in
313.903761ms if necessary
I0223 04:54:28.933233 818 master.cpp:2280] Received SUBSCRIBE call for
framework 'default' at
[email protected]:44225
I0223 04:54:28.933284 818 master.cpp:1751] Authorizing framework principal
'test-principal' to receive offers for role '*'
I0223 04:54:28.933480 814 slave.cpp:4737] Querying resource estimator for
oversubscribable resources
I0223 04:54:28.933559 806 master.cpp:2351] Subscribing framework default with
checkpointing enabled and capabilities [ ]
I0223 04:54:28.933697 810 status_update_manager.cpp:174] Pausing sending
status updates
I0223 04:54:28.933701 805 slave.cpp:796] New master detected at
[email protected]:44225
I0223 04:54:28.933771 805 slave.cpp:859] Authenticating with master
[email protected]:44225
I0223 04:54:28.933790 805 slave.cpp:864] Using default CRAM-MD5 authenticatee
I0223 04:54:28.933914 805 slave.cpp:832] Detecting new master
I0223 04:54:28.933933 818 authenticatee.cpp:121] Creating new client SASL
connection
I0223 04:54:28.933984 820 hierarchical.cpp:265] Added framework
81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
I0223 04:54:28.934046 805 slave.cpp:4751] Received oversubscribable resources
from the resource estimator
I0223 04:54:28.934058 820 hierarchical.cpp:1434] No resources available to
allocate!
I0223 04:54:28.934094 820 hierarchical.cpp:1529] No inverse offers to send
out!
I0223 04:54:28.934118 819 sched.cpp:703] Framework registered with
81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
I0223 04:54:28.934123 820 hierarchical.cpp:1127] Performed allocation for 0
slaves in 115269ns
I0223 04:54:28.934130 814 master.cpp:5526] Authenticating
[email protected]:44225
I0223 04:54:28.934165 819 sched.cpp:717] Scheduler::registered took 22856ns
I0223 04:54:28.934262 819 authenticator.cpp:413] Starting authentication
session for crammd5_authenticatee(658)@172.17.0.2:44225
I0223 04:54:28.934466 805 authenticator.cpp:98] Creating new server SASL
connection
I0223 04:54:28.934640 816 authenticatee.cpp:212] Received SASL authentication
mechanisms: CRAM-MD5
I0223 04:54:28.934672 816 authenticatee.cpp:238] Attempting to authenticate
with mechanism 'CRAM-MD5'
I0223 04:54:28.934762 814 authenticator.cpp:203] Received SASL authentication
start
I0223 04:54:28.934816 814 authenticator.cpp:325] Authentication requires more
steps
I0223 04:54:28.934908 816 authenticatee.cpp:258] Received SASL authentication
step
I0223 04:54:28.935014 811 authenticator.cpp:231] Received SASL authentication
step
I0223 04:54:28.935047 811 auxprop.cpp:107] Request to lookup properties for
user: 'test-principal' realm: '5847d87ad902' server FQDN: '5847d87ad902'
SASL_AUXPROP_VERIFY_AGAINST_HASH: false SASL_AUXPROP_OVERRIDE: false
SASL_AUXPROP_AUTHZID: false
I0223 04:54:28.935060 811 auxprop.cpp:179] Looking up auxiliary property
'*userPassword'
I0223 04:54:28.935083 811 auxprop.cpp:179] Looking up auxiliary property
'*cmusaslsecretCRAM-MD5'
I0223 04:54:28.935102 811 auxprop.cpp:107] Request to lookup properties for
user: 'test-principal' realm: '5847d87ad902' server FQDN: '5847d87ad902'
SASL_AUXPROP_VERIFY_AGAINST_HASH: false SASL_AUXPROP_OVERRIDE: false
SASL_AUXPROP_AUTHZID: true
I0223 04:54:28.935111 811 auxprop.cpp:129] Skipping auxiliary property
'*userPassword' since SASL_AUXPROP_AUTHZID == true
I0223 04:54:28.935117 811 auxprop.cpp:129] Skipping auxiliary property
'*cmusaslsecretCRAM-MD5' since SASL_AUXPROP_AUTHZID == true
I0223 04:54:28.935128 811 authenticator.cpp:317] Authentication success
I0223 04:54:28.935279 806 authenticatee.cpp:298] Authentication success
I0223 04:54:28.935315 818 master.cpp:5556] Successfully authenticated
principal 'test-principal' at [email protected]:44225
I0223 04:54:28.935340 819 authenticator.cpp:431] Authentication session
cleanup for crammd5_authenticatee(658)@172.17.0.2:44225
I0223 04:54:28.935611 817 slave.cpp:927] Successfully authenticated with
master [email protected]:44225
I0223 04:54:28.935791 817 slave.cpp:1321] Will retry registration in
15.895885ms if necessary
I0223 04:54:28.935920 808 master.cpp:4240] Registering slave at
[email protected]:44225 (5847d87ad902) with id
81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0
I0223 04:54:28.936285 819 registrar.cpp:439] Applied 1 operations in 53091ns;
attempting to update the 'registry'
I0223 04:54:28.936992 818 log.cpp:683] Attempting to append 339 bytes to the
log
I0223 04:54:28.937113 805 coordinator.cpp:348] Coordinator attempting to
write APPEND action at position 3
I0223 04:54:28.937728 816 replica.cpp:537] Replica received write request for
position 3 from (9614)@172.17.0.2:44225
I0223 04:54:28.952417 807 slave.cpp:1321] Will retry registration in
35.446972ms if necessary
I0223 04:54:28.952584 814 master.cpp:4228] Ignoring register slave message
from [email protected]:44225 (5847d87ad902) as admission is already in progress
I0223 04:54:28.981781 816 leveldb.cpp:341] Persisting action (358 bytes) to
leveldb took 43.997431ms
I0223 04:54:28.981883 816 replica.cpp:712] Persisted action at 3
I0223 04:54:28.982956 816 replica.cpp:691] Replica received learned notice
for position 3 from @0.0.0.0:0
I0223 04:54:28.989023 818 slave.cpp:1321] Will retry registration in
52.677739ms if necessary
I0223 04:54:28.989296 818 master.cpp:4228] Ignoring register slave message
from [email protected]:44225 (5847d87ad902) as admission is already in progress
I0223 04:54:29.032562 816 leveldb.cpp:341] Persisting action (360 bytes) to
leveldb took 49.562387ms
I0223 04:54:29.032641 816 replica.cpp:712] Persisted action at 3
I0223 04:54:29.032675 816 replica.cpp:697] Replica learned APPEND action at
position 3
I0223 04:54:29.034543 807 registrar.cpp:484] Successfully updated the
'registry' in 98.188032ms
I0223 04:54:29.034868 818 log.cpp:702] Attempting to truncate the log to 3
I0223 04:54:29.035054 819 coordinator.cpp:348] Coordinator attempting to
write TRUNCATE action at position 4
I0223 04:54:29.035475 806 slave.cpp:3482] Received ping from
slave-observer(278)@172.17.0.2:44225
I0223 04:54:29.035616 820 master.cpp:4308] Registered slave
81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0 at [email protected]:44225
(5847d87ad902) with cpus(*):2; mem(*):1024; disk(*):1024; ports(*):[31000-32000]
I0223 04:54:29.035703 809 slave.cpp:971] Registered with master
[email protected]:44225; given slave ID 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0
I0223 04:54:29.035728 809 fetcher.cpp:81] Clearing fetcher cache
I0223 04:54:29.035836 814 hierarchical.cpp:473] Added slave
81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0 (5847d87ad902) with cpus(*):2;
mem(*):1024; disk(*):1024; ports(*):[31000-32000] (allocated: )
I0223 04:54:29.035892 815 status_update_manager.cpp:181] Resuming sending
status updates
I0223 04:54:29.035941 819 replica.cpp:537] Replica received write request for
position 4 from (9615)@172.17.0.2:44225
I0223 04:54:29.036110 809 slave.cpp:994] Checkpointing SlaveInfo to
'/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/meta/slaves/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0/slave.info'
I0223 04:54:29.036485 809 slave.cpp:1030] Forwarding total oversubscribed
resources
I0223 04:54:29.036645 813 master.cpp:4649] Received update of slave
81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0 at [email protected]:44225
(5847d87ad902) with total oversubscribed resources
I0223 04:54:29.036782 814 hierarchical.cpp:1529] No inverse offers to send
out!
I0223 04:54:29.036818 814 hierarchical.cpp:1147] Performed allocation for
slave 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0 in 935779ns
I0223 04:54:29.037040 813 master.cpp:5355] Sending 1 offers to framework
81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000 (default) at
[email protected]:44225
I0223 04:54:29.037104 814 hierarchical.cpp:531] Slave
81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0 (5847d87ad902) updated with
oversubscribed resources (total: cpus(*):2; mem(*):1024; disk(*):1024;
ports(*):[31000-32000], allocated: cpus(*):2; mem(*):1024; disk(*):1024;
ports(*):[31000-32000])
I0223 04:54:29.037240 814 hierarchical.cpp:1434] No resources available to
allocate!
I0223 04:54:29.037282 814 hierarchical.cpp:1529] No inverse offers to send
out!
I0223 04:54:29.037307 814 hierarchical.cpp:1147] Performed allocation for
slave 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0 in 160900ns
I0223 04:54:29.037569 809 sched.cpp:873] Scheduler::resourceOffers took
195666ns
I0223 04:54:29.039176 807 master.cpp:3138] Processing ACCEPT call for offers:
[ 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-O0 ] on slave
81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0 at [email protected]:44225
(5847d87ad902) for framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
(default) at [email protected]:44225
I0223 04:54:29.039221 807 master.cpp:2825] Authorizing framework principal
'test-principal' to launch task 1 as user 'mesos'
W0223 04:54:29.040493 807 validation.cpp:404] Executor http for task 1 uses
less CPUs (None) than the minimum required (0.01). Please update your executor,
as this will be mandatory in future releases.
W0223 04:54:29.040529 807 validation.cpp:416] Executor http for task 1 uses
less memory (None) than the minimum required (32MB). Please update your
executor, as this will be mandatory in future releases.
I0223 04:54:29.040904 807 master.hpp:176] Adding task 1 with resources
cpus(*):2; mem(*):1024; disk(*):1024; ports(*):[31000-32000] on slave
81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0 (5847d87ad902)
I0223 04:54:29.041144 807 master.cpp:3623] Launching task 1 of framework
81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000 (default) at
[email protected]:44225 with resources
cpus(*):2; mem(*):1024; disk(*):1024; ports(*):[31000-32000] on slave
81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0 at [email protected]:44225 (5847d87ad902)
I0223 04:54:29.041574 809 slave.cpp:1361] Got assigned task 1 for framework
81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
I0223 04:54:29.041755 809 slave.cpp:5287] Checkpointing FrameworkInfo to
'/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/meta/slaves/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0/frameworks/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000/framework.info'
I0223 04:54:29.042193 809 slave.cpp:5298] Checkpointing framework pid
'[email protected]:44225' to
'/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/meta/slaves/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0/frameworks/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000/framework.pid'
I0223 04:54:29.043066 809 slave.cpp:1480] Launching task 1 for framework
81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
I0223 04:54:29.043884 809 paths.cpp:474] Trying to chown
'/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/slaves/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0/frameworks/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000/executors/http/runs/a43cbeed-f390-46aa-b7a1-67770a3dd4fc'
to user 'mesos'
I0223 04:54:29.055244 809 slave.cpp:5739] Checkpointing ExecutorInfo to
'/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/meta/slaves/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0/frameworks/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000/executors/http/executor.info'
I0223 04:54:29.055868 809 slave.cpp:5367] Launching executor http of
framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000 with resources in work
directory
'/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/slaves/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0/frameworks/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000/executors/http/runs/a43cbeed-f390-46aa-b7a1-67770a3dd4fc'
I0223 04:54:29.056241 820 containerizer.cpp:666] Starting container
'a43cbeed-f390-46aa-b7a1-67770a3dd4fc' for executor 'http' of framework
'81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000'
I0223 04:54:29.056298 809 slave.cpp:5762] Checkpointing TaskInfo to
'/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/meta/slaves/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0/frameworks/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000/executors/http/runs/a43cbeed-f390-46aa-b7a1-67770a3dd4fc/tasks/1/task.info'
I0223 04:54:29.056675 809 slave.cpp:1698] Queuing task '1' for executor
'http' of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
I0223 04:54:29.056779 809 slave.cpp:749] Successfully attached file
'/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/slaves/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0/frameworks/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000/executors/http/runs/a43cbeed-f390-46aa-b7a1-67770a3dd4fc'
I0223 04:54:29.061602 815 launcher.cpp:147] Forked child with pid '6997' for
container 'a43cbeed-f390-46aa-b7a1-67770a3dd4fc'
I0223 04:54:29.061995 815 containerizer.cpp:1104] Checkpointing executor's
forked pid 6997 to
'/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/meta/slaves/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0/frameworks/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000/executors/http/runs/a43cbeed-f390-46aa-b7a1-67770a3dd4fc/pids/forked.pid'
I0223 04:54:29.083487 819 leveldb.cpp:341] Persisting action (16 bytes) to
leveldb took 47.478443ms
I0223 04:54:29.083585 819 replica.cpp:712] Persisted action at 4
I0223 04:54:29.084709 810 replica.cpp:691] Replica received learned notice
for position 4 from @0.0.0.0:0
I0223 04:54:29.122370 810 leveldb.cpp:341] Persisting action (18 bytes) to
leveldb took 37.598534ms
I0223 04:54:29.122570 810 leveldb.cpp:399] Deleting ~2 keys from leveldb took
89419ns
I0223 04:54:29.122599 810 replica.cpp:712] Persisted action at 4
I0223 04:54:29.122635 810 replica.cpp:697] Replica learned TRUNCATE action at
position 4
I0223 04:54:29.606348 806 hierarchical.cpp:1434] No resources available to
allocate!
I0223 04:54:29.606449 806 hierarchical.cpp:1529] No inverse offers to send
out!
I0223 04:54:29.606494 806 hierarchical.cpp:1127] Performed allocation for 1
slaves in 411327ns
I0223 04:54:30.607574 810 hierarchical.cpp:1434] No resources available to
allocate!
I0223 04:54:30.607647 810 hierarchical.cpp:1529] No inverse offers to send
out!
I0223 04:54:30.607676 810 hierarchical.cpp:1127] Performed allocation for 1
slaves in 309149ns
WARNING: Logging before InitGoogleLogging() is written to STDERR
I0223 04:54:31.163267 7011 process.cpp:991] libprocess is initialized on
172.17.0.2:47892 for 16 cpus
I0223 04:54:31.164299 7036 logging.cpp:193] Logging to STDERR
I0223 04:54:31.164322 7036 executor.cpp:172] Version: 0.28.0
I0223 04:54:31.166379 7045 executor.cpp:316] Connected with the agent
I0223 04:54:31.167551 7035 executor.cpp:247] Sending SUBSCRIBE call to
http://172.17.0.2:44225/slave/api/v1/executor
I0223 04:54:31.176286 808 process.cpp:3141] Handling HTTP event for process
'slave' with path: '/slave/api/v1/executor'
I0223 04:54:31.176759 814 http.cpp:190] HTTP POST for /slave/api/v1/executor
from 172.17.0.2:42711
I0223 04:54:31.177016 814 slave.cpp:2476] Received Subscribe request for HTTP
executor 'http' of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
I0223 04:54:31.177106 814 slave.cpp:2539] Creating a marker file for HTTP
based executor 'http' of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
(via HTTP) at path
'/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/meta/slaves/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0/frameworks/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000/executors/http/runs/a43cbeed-f390-46aa-b7a1-67770a3dd4fc/http.marker'
I0223 04:54:31.179092 813 slave.cpp:1863] Sending queued task '1' to executor
'http' of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000 (via HTTP)
I0223 04:54:31.181721 7036 executor.cpp:588] Enqueuing event SUBSCRIBED
received from http://172.17.0.2:44225/slave/api/v1/executor
I0223 04:54:31.182701 7035 executor.cpp:588] Enqueuing event LAUNCH received
from http://172.17.0.2:44225/slave/api/v1/executor
Received a SUBSCRIBED event
Starting task 1
Finishing task 1
I0223 04:54:31.185159 7040 executor.cpp:247] Sending UPDATE call to
http://172.17.0.2:44225/slave/api/v1/executor
I0223 04:54:31.185442 7040 executor.cpp:247] Sending UPDATE call to
http://172.17.0.2:44225/slave/api/v1/executor
I0223 04:54:31.187504 811 slave.cpp:668] Slave terminating
I0223 04:54:31.187763 819 master.cpp:1174] Slave
81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0 at [email protected]:44225
(5847d87ad902) disconnected
I0223 04:54:31.187794 819 master.cpp:2635] Disconnecting slave
81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0 at [email protected]:44225 (5847d87ad902)
I0223 04:54:31.187850 819 master.cpp:2654] Deactivating slave
81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0 at [email protected]:44225 (5847d87ad902)
I0223 04:54:31.187969 809 hierarchical.cpp:560] Slave
81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0 deactivated
I0223 04:54:31.188148 7037 executor.cpp:586] Enqueuing locally injected event
ERROR
Received an ERROR event
I0223 04:54:31.189012 7044 executor.cpp:586] Enqueuing locally injected event
ERROR
Received an ERROR event
E0223 04:54:31.189673 7044 executor.cpp:553] End-Of-File received from agent.
The agent closed the event stream
I0223 04:54:31.189709 7044 executor.cpp:357] Disconnected from agent:
End-Of-File received from agent. The agent closed the event stream
I0223 04:54:31.190441 7044 executor.cpp:411] Will retry connecting with the
agent again in 1.548772469secs
I0223 04:54:31.191680 786 containerizer.cpp:149] Using isolation:
posix/cpu,posix/mem,filesystem/posix
W0223 04:54:31.192128 786 backend.cpp:48] Failed to create 'bind' backend:
BindBackend requires root privileges
I0223 04:54:31.192425 7033 executor.cpp:316] Connected with the agent
I0223 04:54:31.193727 7034 executor.cpp:247] Sending SUBSCRIBE call to
http://172.17.0.2:44225/slave/api/v1/executor
I0223 04:54:31.194473 821 process.cpp:2409] Returning '404 Not Found' for
'/slave/api/v1/executor'
W0223 04:54:31.195523 7041 executor.cpp:509] Received '404 Not Found' () for
SUBSCRIBE
I0223 04:54:31.195714 817 slave.cpp:193] Slave started on 172.17.0.2:44225
I0223 04:54:31.195780 817 slave.cpp:194] Flags at startup:
--appc_simple_discovery_uri_prefix="http://"
--appc_store_dir="/tmp/mesos/store/appc" --authenticatee="crammd5"
--cgroups_cpu_enable_pids_and_tids_count="false" --cgroups_enable_cfs="false"
--cgroups_hierarchy="/sys/fs/cgroup" --cgroups_limit_swap="false"
--cgroups_root="mesos" --container_disk_watch_interval="15secs"
--containerizers="mesos"
--credential="/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/credential"
--default_role="*" --disk_watch_interval="1mins" --docker="docker"
--docker_auth_server="https://auth.docker.io" --docker_kill_orphans="true"
--docker_puller_timeout="60" --docker_registry="https://registry-1.docker.io"
--docker_remove_delay="6hrs" --docker_socket="/var/run/docker.sock"
--docker_stop_timeout="0ns" --docker_store_dir="/tmp/mesos/store/docker"
--enforce_container_disk_quota="false" --executor_registration_timeout="1mins"
--executor_shutdown_grace_period="5secs"
--fetcher_cache_dir="/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/fetch"
--fetcher_cache_size="2GB" --frameworks_home="" --gc_delay="1weeks"
--gc_disk_headroom="0.1" --hadoop_home="" --help="false"
--hostname_lookup="true" --image_provisioner_backend="copy"
--initialize_driver_logging="true" --isolation="posix/cpu,posix/mem"
--launcher_dir="/mesos/mesos-0.28.0/_build/src" --logbufsecs="0"
--logging_level="INFO" --oversubscribed_resources_interval="15secs"
--perf_duration="10secs" --perf_interval="1mins"
--qos_correction_interval_min="0ns" --quiet="false" --recover="reconnect"
--recovery_timeout="15mins" --registration_backoff_factor="10ms"
--resources="cpus:2;mem:1024;disk:1024;ports:[31000-32000]"
--revocable_cpu_low_priority="true" --sandbox_directory="/mnt/mesos/sandbox"
--strict="true" --switch_user="true" --systemd_enable_support="true"
--systemd_runtime_directory="/run/systemd/system" --version="false"
--work_dir="/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU"
I0223 04:54:31.196280 817 credentials.hpp:83] Loading credential for
authentication from
'/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/credential'
I0223 04:54:31.196491 817 slave.cpp:324] Slave using credential for:
test-principal
I0223 04:54:31.196655 817 resources.cpp:576] Parsing resources as JSON
failed: cpus:2;mem:1024;disk:1024;ports:[31000-32000]
Trying semicolon-delimited string format instead
I0223 04:54:31.197144 817 slave.cpp:464] Slave resources: cpus(*):2;
mem(*):1024; disk(*):1024; ports(*):[31000-32000]
I0223 04:54:31.197226 817 slave.cpp:472] Slave attributes: [ ]
I0223 04:54:31.197242 817 slave.cpp:477] Slave hostname: 5847d87ad902
I0223 04:54:31.198366 816 state.cpp:58] Recovering state from
'/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/meta'
I0223 04:54:31.198439 816 state.cpp:698] No checkpointed resources found at
'/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/meta/resources/resources.info'
W0223 04:54:31.200115 816 state.cpp:607] Failed to find status updates file
'/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/meta/slaves/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0/frameworks/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000/executors/http/runs/a43cbeed-f390-46aa-b7a1-67770a3dd4fc/tasks/1/task.updates'
I0223 04:54:31.202293 819 fetcher.cpp:81] Clearing fetcher cache
I0223 04:54:31.202379 819 slave.cpp:4653] Recovering framework
81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
I0223 04:54:31.202507 819 slave.cpp:5476] Recovering executor 'http' of
framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
I0223 04:54:31.203364 810 status_update_manager.cpp:200] Recovering status
update manager
I0223 04:54:31.203483 810 status_update_manager.cpp:208] Recovering executor
'http' of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
W0223 04:54:31.203572 810 status_update_manager.cpp:247] No updates found for
task 1 of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
I0223 04:54:31.203832 819 slave.cpp:749] Successfully attached file
'/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/slaves/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0/frameworks/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000/executors/http/runs/a43cbeed-f390-46aa-b7a1-67770a3dd4fc'
I0223 04:54:31.204326 820 containerizer.cpp:407] Recovering containerizer
I0223 04:54:31.204448 820 containerizer.cpp:462] Recovering container
'a43cbeed-f390-46aa-b7a1-67770a3dd4fc' for executor 'http' of framework
81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
I0223 04:54:31.206951 820 provisioner.cpp:245] Provisioner recovery complete
I0223 04:54:31.208343 806 slave.cpp:4512] Waiting for executor 'http' of
framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000 (via HTTP) to subscribe
I0223 04:54:31.608290 815 hierarchical.cpp:1434] No resources available to
allocate!
I0223 04:54:31.608355 815 hierarchical.cpp:1529] No inverse offers to send
out!
I0223 04:54:31.608387 815 hierarchical.cpp:1127] Performed allocation for 1
slaves in 252147ns
I0223 04:54:32.169790 7044 executor.cpp:247] Sending SUBSCRIBE call to
http://172.17.0.2:44225/slave/api/v1/executor
I0223 04:54:32.172236 807 process.cpp:3141] Handling HTTP event for process
'slave' with path: '/slave/api/v1/executor'
I0223 04:54:32.173068 807 http.cpp:190] HTTP POST for /slave/api/v1/executor
from 172.17.0.2:42713
I0223 04:54:32.173709 807 slave.cpp:2476] Received Subscribe request for HTTP
executor 'http' of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000 (via
HTTP)
I0223 04:54:32.173815 807 slave.cpp:2539] Creating a marker file for HTTP
based executor 'http' of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
(via HTTP) at path
'/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/meta/slaves/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0/frameworks/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000/executors/http/runs/a43cbeed-f390-46aa-b7a1-67770a3dd4fc/http.marker'
I0223 04:54:32.194341 7044 executor.cpp:247] Sending SUBSCRIBE call to
http://172.17.0.2:44225/slave/api/v1/executor
I0223 04:54:32.196682 807 slave.cpp:3002] Handling status update TASK_RUNNING
(UUID: 848bc119-6693-4601-a8d4-4211db292639) for task 1 of framework
81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
I0223 04:54:32.196950 807 slave.cpp:3002] Handling status update
TASK_FINISHED (UUID: e7010e72-616f-4612-a218-47b26031630c) for task 1 of
framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
I0223 04:54:32.198343 811 process.cpp:3141] Handling HTTP event for process
'slave' with path: '/slave/api/v1/executor'
I0223 04:54:32.198863 808 status_update_manager.cpp:320] Received status
update TASK_RUNNING (UUID: 848bc119-6693-4601-a8d4-4211db292639) for task 1 of
framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
I0223 04:54:32.198905 808 status_update_manager.cpp:497] Creating
StatusUpdate stream for task 1 of framework
81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
I0223 04:54:32.198945 811 http.cpp:190] HTTP POST for /slave/api/v1/executor
from 172.17.0.2:42713
I0223 04:54:32.199236 811 slave.cpp:2476] Received Subscribe request for HTTP
executor 'http' of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000 (via
HTTP)
W0223 04:54:32.199256 811 slave.cpp:2518] Closing already existing HTTP
connection from executor 'http' of framework
81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000 (via HTTP)
I0223 04:54:32.199362 811 slave.cpp:2539] Creating a marker file for HTTP
based executor 'http' of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
(via HTTP) at path
'/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/meta/slaves/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0/frameworks/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000/executors/http/runs/a43cbeed-f390-46aa-b7a1-67770a3dd4fc/http.marker'
I0223 04:54:32.199448 808 status_update_manager.cpp:824] Checkpointing UPDATE
for status update TASK_RUNNING (UUID: 848bc119-6693-4601-a8d4-4211db292639) for
task 1 of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
I0223 04:54:32.199654 811 slave.cpp:3002] Handling status update TASK_RUNNING
(UUID: 848bc119-6693-4601-a8d4-4211db292639) for task 1 of framework
81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
I0223 04:54:32.199822 811 slave.cpp:3002] Handling status update
TASK_FINISHED (UUID: e7010e72-616f-4612-a218-47b26031630c) for task 1 of
framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
I0223 04:54:32.200212 7040 executor.cpp:588] Enqueuing event SUBSCRIBED
received from http://172.17.0.2:44225/slave/api/v1/executor
I0223 04:54:32.200510 811 slave.cpp:5677] Terminating task 1
F0223 04:54:32.200620 806 process.cpp:1114] Check failed: items.size() > 0
E0223 04:54:32.201045 7040 executor.cpp:553] End-Of-File received from agent.
The agent closed the event stream
I0223 04:54:32.201124 7040 executor.cpp:357] Disconnected from agent:
End-Of-File received from agent. The agent closed the event stream
Received a SUBSCRIBED event
I0223 04:54:32.201858 7040 executor.cpp:411] Will retry connecting with the
agent again in 587.356137ms
E0223 04:54:32.203120 7040 executor.cpp:543] Failed to decode the stream of
events: Pipe::Reader failure: failed to decode body
I0223 04:54:32.203341 7031 executor.cpp:316] Connected with the agent
I0223 04:54:32.204586 7040 executor.cpp:247] Sending SUBSCRIBE call to
http://172.17.0.2:44225/slave/api/v1/executor
*** Check failure stack trace: ***
I0223 04:54:32.205591 820 process.cpp:3141] Handling HTTP event for process
'slave' with path: '/slave/api/v1/executor'
I0223 04:54:32.242341 819 http.cpp:190] HTTP POST for /slave/api/v1/executor
from 172.17.0.2:42715
I0223 04:54:32.242702 819 slave.cpp:2476] Received Subscribe request for HTTP
executor 'http' of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000 (via
HTTP)
W0223 04:54:32.242722 819 slave.cpp:2518] Closing already existing HTTP
connection from executor 'http' of framework
81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000 (via HTTP)
I0223 04:54:32.242769 819 slave.cpp:2539] Creating a marker file for HTTP
based executor 'http' of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
(via HTTP) at path
'/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/meta/slaves/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0/frameworks/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000/executors/http/runs/a43cbeed-f390-46aa-b7a1-67770a3dd4fc/http.marker'
@ 0x2b5689fe3a4e google::LogMessage::Fail()
@ 0x2b5689fe399a google::LogMessage::SendToLog()
I0223 04:54:32.264030 819 slave.cpp:3002] Handling status update TASK_RUNNING
(UUID: 848bc119-6693-4601-a8d4-4211db292639) for task 1 of framework
81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
I0223 04:54:32.264382 819 slave.cpp:3002] Handling status update
TASK_FINISHED (UUID: e7010e72-616f-4612-a218-47b26031630c) for task 1 of
framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
@ 0x2b5689fe339c google::LogMessage::Flush()
I0223 04:54:32.268173 7045 executor.cpp:588] Enqueuing event SUBSCRIBED
received from http://172.17.0.2:44225/slave/api/v1/executor
Received a SUBSCRIBED event
@ 0x2b5689fe62b0 google::LogMessageFatal::~LogMessageFatal()
@ 0x2b5689f3b512 process::HttpProxy::waited()
@ 0x2b5689f68821
_ZZN7process8dispatchINS_9HttpProxyERKNS_6FutureINS_4http8ResponseEEES5_EEvRKNS_3PIDIT_EEMS9_FvT0_ET1_ENKUlPNS_11ProcessBaseEE_clESI_
@ 0x2b5689f84c72
_ZNSt17_Function_handlerIFvPN7process11ProcessBaseEEZNS0_8dispatchINS0_9HttpProxyERKNS0_6FutureINS0_4http8ResponseEEES9_EEvRKNS0_3PIDIT_EEMSD_FvT0_ET1_EUlS2_E_E9_M_invokeERKSt9_Any_dataS2_
@ 0x2b5689f62b93 std::function<>::operator()()
@ 0x2b5689f48b97 process::ProcessBase::visit()
@ 0x2b5689f4ec3e process::DispatchEvent::visit()
@ 0x999aec process::ProcessBase::serve()
@ 0x2b5689f44d7a process::ProcessManager::resume()
@ 0x2b5689f41e32
_ZZN7process14ProcessManager12init_threadsEvENKUlRKSt11atomic_boolE_clES3_
@ 0x2b5689f4e42a
_ZNSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt11atomic_boolE_St17reference_wrapperIS3_EEE6__callIvIEILm0EEEET_OSt5tupleIIDpT0_EESt12_Index_tupleIIXspT1_EEE
@ 0x2b5689f4e3da
_ZNSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt11atomic_boolE_St17reference_wrapperIS3_EEEclIIEvEET0_DpOT_
I0223 04:54:32.322123 808 status_update_manager.cpp:374] Forwarding update
TASK_RUNNING (UUID: 848bc119-6693-4601-a8d4-4211db292639) for task 1 of
framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000 to the slave
@ 0x2b5689f4e36c
_ZNSt12_Bind_simpleIFSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt11atomic_boolE_St17reference_wrapperIS4_EEEvEE9_M_invokeIIEEEvSt12_Index_tupleIIXspT_EEE
W0223 04:54:32.322547 807 slave.cpp:3346] Dropping status update TASK_RUNNING
(UUID: 848bc119-6693-4601-a8d4-4211db292639) for task 1 of framework
81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000 sent by status update manager because
the slave is in RECOVERING state
I0223 04:54:32.322633 807 slave.cpp:3294] Status update manager successfully
handled status update TASK_RUNNING (UUID: 848bc119-6693-4601-a8d4-4211db292639)
for task 1 of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
I0223 04:54:32.322684 808 status_update_manager.cpp:320] Received status
update TASK_RUNNING (UUID: 848bc119-6693-4601-a8d4-4211db292639) for task 1 of
framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
W0223 04:54:32.322749 808 status_update_manager.cpp:729] Ignoring duplicate
status update TASK_RUNNING (UUID: 848bc119-6693-4601-a8d4-4211db292639) for
task 1 of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
I0223 04:54:32.322932 808 status_update_manager.cpp:320] Received status
update TASK_FINISHED (UUID: e7010e72-616f-4612-a218-47b26031630c) for task 1 of
framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
I0223 04:54:32.322988 808 status_update_manager.cpp:824] Checkpointing UPDATE
for status update TASK_FINISHED (UUID: e7010e72-616f-4612-a218-47b26031630c)
for task 1 of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
I0223 04:54:32.322988 807 slave.cpp:3294] Status update manager successfully
handled status update TASK_RUNNING (UUID: 848bc119-6693-4601-a8d4-4211db292639)
for task 1 of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
{code}
This crash was also observed on a recent ArchLinux VM (Virtualbox), running
concurrently with {{stress --cpu 4}}. Repro'd with {{./src/mesos-tests
--gtest_filter="SlaveRecovery*" --gtest_repeat=100 --gtest_break_on_failure}};
took about 20 iterations to trigger a crash.
{noformat}
[ RUN ] SlaveRecoveryTest/0.ReconnectHTTPExecutor
I0223 09:38:55.434953 11158 executor.cpp:172] Version: 0.28.0
Received a SUBSCRIBED event
Starting task 1
Finishing task 1
Received an ERROR event
Received an ERROR event
E0223 09:38:55.504820 11159 executor.cpp:553] End-Of-File received from agent.
The agent closed the event stream
Received an ERROR event
Received an ERROR event
Received an ERROR event
F0223 09:39:00.535778 22159 process.cpp:1114] Check failed: items.size() > 0
*** Check failure stack trace: ***
Received an ERROR event
Received an ERROR event
@ 0x7f4affd0e754 google::LogMessage::Fail()
Received an ERROR event
Received an ERROR event
Received an ERROR event
Received an ERROR event
@ 0x7f4affd0e6ad google::LogMessage::SendToLog()
@ 0x7f4affd0e0a3 google::LogMessage::Flush()
@ 0x7f4affd10f14 google::LogMessageFatal::~LogMessageFatal()
@ 0x7f4affc618d4 process::HttpProxy::waited()
@ 0x7f4affc8f57f
_ZZN7process8dispatchINS_9HttpProxyERKNS_6FutureINS_4http8ResponseEEES5_EEvRKNS_3PIDIT_EEMS9_FvT0_ET1_ENKUlPNS_11ProcessBaseEE_clESI_
@ 0x7f4affcac946
_ZNSt17_Function_handlerIFvPN7process11ProcessBaseEEZNS0_8dispatchINS0_9HttpProxyERKNS0_6FutureINS0_4http8ResponseEEES9_EEvRKNS0_3PIDIT_EEMSD_FvT0_ET1_EUlS2_E_E9_M_invokeERKSt9_Any_dataOS2_
@ 0x7f4affc89961 std::function<>::operator()()
@ 0x7f4affc6ef02 process::ProcessBase::visit()
@ 0x7f4affc74e52 process::DispatchEvent::visit()
@ 0xa3afe8 process::ProcessBase::serve()
@ 0x7f4affc6b073 process::ProcessManager::resume()
@ 0x7f4affc6813b
_ZZN7process14ProcessManager12init_threadsEvENKUlRKSt6atomicIbEE_clES4_
@ 0x7f4affc745fa
_ZNSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt6atomicIbEE_St17reference_wrapperIS4_EEE6__callIvJEJLm0EEEET_OSt5tupleIJDpT0_EESt12_Index_tupleIJXspT1_EEE
@ 0x7f4affc745a8
_ZNSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt6atomicIbEE_St17reference_wrapperIS4_EEEclIJEvEET0_DpOT_
@ 0x7f4affc74556
_ZNSt12_Bind_simpleIFSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt6atomicIbEE_St17reference_wrapperIS5_EEEvEE9_M_invokeIJEEEvSt12_Index_tupleIJXspT_EEE
@ 0x7f4affc744bf
_ZNSt12_Bind_simpleIFSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt6atomicIbEE_St17reference_wrapperIS5_EEEvEEclEv
@ 0x7f4affc7445e
_ZNSt6thread5_ImplISt12_Bind_simpleIFSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt6atomicIbEE_St17reference_wrapperIS7_EEEvEEE6_M_runEv
@ 0x7f4afa6ddc40 execute_native_thread_routine
@ 0x7f4afadba424 start_thread
@ 0x7f4af9e50cbd __clone
@ (nil) (unknown)
Aborted (core dumped)
{noformat}
was:
{noformat}
[ RUN ] SlaveRecoveryTest/0.ReconnectHTTPExecutor
I0223 09:38:55.434953 11158 executor.cpp:172] Version: 0.28.0
Received a SUBSCRIBED event
Starting task 1
Finishing task 1
Received an ERROR event
Received an ERROR event
E0223 09:38:55.504820 11159 executor.cpp:553] End-Of-File received from agent.
The agent closed the event stream
Received an ERROR event
Received an ERROR event
Received an ERROR event
F0223 09:39:00.535778 22159 process.cpp:1114] Check failed: items.size() > 0
*** Check failure stack trace: ***
Received an ERROR event
Received an ERROR event
@ 0x7f4affd0e754 google::LogMessage::Fail()
Received an ERROR event
Received an ERROR event
Received an ERROR event
Received an ERROR event
@ 0x7f4affd0e6ad google::LogMessage::SendToLog()
@ 0x7f4affd0e0a3 google::LogMessage::Flush()
@ 0x7f4affd10f14 google::LogMessageFatal::~LogMessageFatal()
@ 0x7f4affc618d4 process::HttpProxy::waited()
@ 0x7f4affc8f57f
_ZZN7process8dispatchINS_9HttpProxyERKNS_6FutureINS_4http8ResponseEEES5_EEvRKNS_3PIDIT_EEMS9_FvT0_ET1_ENKUlPNS_11ProcessBaseEE_clESI_
@ 0x7f4affcac946
_ZNSt17_Function_handlerIFvPN7process11ProcessBaseEEZNS0_8dispatchINS0_9HttpProxyERKNS0_6FutureINS0_4http8ResponseEEES9_EEvRKNS0_3PIDIT_EEMSD_FvT0_ET1_EUlS2_E_E9_M_invokeERKSt9_Any_dataOS2_
@ 0x7f4affc89961 std::function<>::operator()()
@ 0x7f4affc6ef02 process::ProcessBase::visit()
@ 0x7f4affc74e52 process::DispatchEvent::visit()
@ 0xa3afe8 process::ProcessBase::serve()
@ 0x7f4affc6b073 process::ProcessManager::resume()
@ 0x7f4affc6813b
_ZZN7process14ProcessManager12init_threadsEvENKUlRKSt6atomicIbEE_clES4_
@ 0x7f4affc745fa
_ZNSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt6atomicIbEE_St17reference_wrapperIS4_EEE6__callIvJEJLm0EEEET_OSt5tupleIJDpT0_EESt12_Index_tupleIJXspT1_EEE
@ 0x7f4affc745a8
_ZNSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt6atomicIbEE_St17reference_wrapperIS4_EEEclIJEvEET0_DpOT_
@ 0x7f4affc74556
_ZNSt12_Bind_simpleIFSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt6atomicIbEE_St17reference_wrapperIS5_EEEvEE9_M_invokeIJEEEvSt12_Index_tupleIJXspT_EEE
@ 0x7f4affc744bf
_ZNSt12_Bind_simpleIFSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt6atomicIbEE_St17reference_wrapperIS5_EEEvEEclEv
@ 0x7f4affc7445e
_ZNSt6thread5_ImplISt12_Bind_simpleIFSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt6atomicIbEE_St17reference_wrapperIS7_EEEvEEE6_M_runEv
@ 0x7f4afa6ddc40 execute_native_thread_routine
@ 0x7f4afadba424 start_thread
@ 0x7f4af9e50cbd __clone
@ (nil) (unknown)
Aborted (core dumped)
{noformat}
This crash was observed in a recent ArchLinux VM (Virtualbox), running
concurrently with {{stress --cpu 4}}. Repro'd with {{./src/mesos-tests
--gtest_filter="SlaveRecovery*" --gtest_repeat=100 --gtest_break_on_failure}};
took about 20 iterations to trigger a crash.
> libprocess CHECK failure in SlaveRecoveryTest/0.ReconnectHTTPExecutor
> ---------------------------------------------------------------------
>
> Key: MESOS-4739
> URL: https://issues.apache.org/jira/browse/MESOS-4739
> Project: Mesos
> Issue Type: Bug
> Components: HTTP API, libprocess
> Reporter: Neil Conway
> Labels: flaky-test, libprocess, mesosphere
>
> Showed up on ASF CI:
> https://builds.apache.org/job/Mesos/COMPILER=gcc,CONFIGURATION=--verbose,ENVIRONMENT=GLOG_v=1%20MESOS_VERBOSE=1,OS=ubuntu:14.04,label_exp=(docker%7C%7CHadoop)&&(!ubuntu-us1)/1704/consoleFull
> {code}
> [ RUN ] SlaveRecoveryTest/0.ReconnectHTTPExecutor
> I0223 04:54:28.547051 786 leveldb.cpp:174] Opened db in 124.456584ms
> I0223 04:54:28.597709 786 leveldb.cpp:181] Compacted db in 50.603402ms
> I0223 04:54:28.597779 786 leveldb.cpp:196] Created db iterator in 22429ns
> I0223 04:54:28.597797 786 leveldb.cpp:202] Seeked to beginning of db in
> 2279ns
> I0223 04:54:28.597810 786 leveldb.cpp:271] Iterated through 0 keys in the
> db in 265ns
> I0223 04:54:28.597859 786 replica.cpp:779] Replica recovered with log
> positions 0 -> 0 with 1 holes and 0 unlearned
> I0223 04:54:28.598731 807 recover.cpp:447] Starting replica recovery
> I0223 04:54:28.599493 807 recover.cpp:473] Replica is in EMPTY status
> I0223 04:54:28.601400 815 replica.cpp:673] Replica in EMPTY status received
> a broadcasted recover request from (9593)@172.17.0.2:44225
> I0223 04:54:28.601776 818 recover.cpp:193] Received a recover response from
> a replica in EMPTY status
> I0223 04:54:28.602247 809 recover.cpp:564] Updating replica status to
> STARTING
> I0223 04:54:28.603353 811 master.cpp:376] Master
> 81a295fc-fe1b-4ff8-9291-cd54f5c6f303 (5847d87ad902) started on
> 172.17.0.2:44225
> I0223 04:54:28.603376 811 master.cpp:378] Flags at startup: --acls=""
> --allocation_interval="1secs" --allocator="HierarchicalDRF"
> --authenticate="true" --authenticate_http="true" --authenticate_slaves="true"
> --authenticators="crammd5" --authorizers="local"
> --credentials="/tmp/f6d1qA/credentials" --framework_sorter="drf"
> --help="false" --hostname_lookup="true" --http_authenticators="basic"
> --initialize_driver_logging="true" --log_auto_initialize="true"
> --logbufsecs="0" --logging_level="INFO" --max_completed_frameworks="50"
> --max_completed_tasks_per_framework="1000" --max_slave_ping_timeouts="5"
> --quiet="false" --recovery_slave_removal_limit="100%"
> --registry="replicated_log" --registry_fetch_timeout="1mins"
> --registry_store_timeout="100secs" --registry_strict="true"
> --root_submissions="true" --slave_ping_timeout="15secs"
> --slave_reregister_timeout="10mins" --user_sorter="drf" --version="false"
> --webui_dir="/mesos/mesos-0.28.0/_inst/share/mesos/webui"
> --work_dir="/tmp/f6d1qA/master" --zk_session_timeout="10secs"
> I0223 04:54:28.603906 811 master.cpp:423] Master only allowing
> authenticated frameworks to register
> I0223 04:54:28.603920 811 master.cpp:428] Master only allowing
> authenticated slaves to register
> I0223 04:54:28.603930 811 credentials.hpp:35] Loading credentials for
> authentication from '/tmp/f6d1qA/credentials'
> I0223 04:54:28.604317 811 master.cpp:468] Using default 'crammd5'
> authenticator
> I0223 04:54:28.604506 811 master.cpp:537] Using default 'basic' HTTP
> authenticator
> I0223 04:54:28.604635 811 master.cpp:571] Authorization enabled
> I0223 04:54:28.604918 808 whitelist_watcher.cpp:77] No whitelist given
> I0223 04:54:28.605023 819 hierarchical.cpp:144] Initialized hierarchical
> allocator process
> I0223 04:54:28.608273 812 master.cpp:1712] The newly elected leader is
> [email protected]:44225 with id 81a295fc-fe1b-4ff8-9291-cd54f5c6f303
> I0223 04:54:28.608314 812 master.cpp:1725] Elected as the leading master!
> I0223 04:54:28.608333 812 master.cpp:1470] Recovering from registrar
> I0223 04:54:28.608610 812 registrar.cpp:307] Recovering registrar
> I0223 04:54:28.631079 817 leveldb.cpp:304] Persisting metadata (8 bytes) to
> leveldb took 28.524027ms
> I0223 04:54:28.631156 817 replica.cpp:320] Persisted replica status to
> STARTING
> I0223 04:54:28.631431 810 recover.cpp:473] Replica is in STARTING status
> I0223 04:54:28.632550 819 replica.cpp:673] Replica in STARTING status
> received a broadcasted recover request from (9595)@172.17.0.2:44225
> I0223 04:54:28.632968 816 recover.cpp:193] Received a recover response from
> a replica in STARTING status
> I0223 04:54:28.633414 807 recover.cpp:564] Updating replica status to VOTING
> I0223 04:54:28.660815 818 leveldb.cpp:304] Persisting metadata (8 bytes) to
> leveldb took 27.23603ms
> I0223 04:54:28.660898 818 replica.cpp:320] Persisted replica status to
> VOTING
> I0223 04:54:28.661111 809 recover.cpp:578] Successfully joined the Paxos
> group
> I0223 04:54:28.661368 813 recover.cpp:462] Recover process terminated
> I0223 04:54:28.663571 816 log.cpp:659] Attempting to start the writer
> I0223 04:54:28.665334 817 replica.cpp:493] Replica received implicit
> promise request from (9596)@172.17.0.2:44225 with proposal 1
> I0223 04:54:28.685936 817 leveldb.cpp:304] Persisting metadata (8 bytes) to
> leveldb took 20.539463ms
> I0223 04:54:28.686019 817 replica.cpp:342] Persisted promised to 1
> I0223 04:54:28.687299 815 coordinator.cpp:238] Coordinator attempting to
> fill missing positions
> I0223 04:54:28.689152 807 replica.cpp:388] Replica received explicit
> promise request from (9597)@172.17.0.2:44225 for position 0 with proposal 2
> I0223 04:54:28.711225 807 leveldb.cpp:341] Persisting action (8 bytes) to
> leveldb took 22.011554ms
> I0223 04:54:28.711299 807 replica.cpp:712] Persisted action at 0
> I0223 04:54:28.712512 808 replica.cpp:537] Replica received write request
> for position 0 from (9598)@172.17.0.2:44225
> I0223 04:54:28.712585 808 leveldb.cpp:436] Reading position from leveldb
> took 37473ns
> I0223 04:54:28.736302 808 leveldb.cpp:341] Persisting action (14 bytes) to
> leveldb took 23.671572ms
> I0223 04:54:28.736348 808 replica.cpp:712] Persisted action at 0
> I0223 04:54:28.737027 805 replica.cpp:691] Replica received learned notice
> for position 0 from @0.0.0.0:0
> I0223 04:54:28.761451 805 leveldb.cpp:341] Persisting action (16 bytes) to
> leveldb took 24.382179ms
> I0223 04:54:28.761497 805 replica.cpp:712] Persisted action at 0
> I0223 04:54:28.761533 805 replica.cpp:697] Replica learned NOP action at
> position 0
> I0223 04:54:28.762406 812 log.cpp:675] Writer started with ending position 0
> I0223 04:54:28.763619 811 leveldb.cpp:436] Reading position from leveldb
> took 32929ns
> I0223 04:54:28.764853 806 registrar.cpp:340] Successfully fetched the
> registry (0B) in 156.179712ms
> I0223 04:54:28.764991 806 registrar.cpp:439] Applied 1 operations in
> 37207ns; attempting to update the 'registry'
> I0223 04:54:28.766080 809 log.cpp:683] Attempting to append 170 bytes to
> the log
> I0223 04:54:28.766222 807 coordinator.cpp:348] Coordinator attempting to
> write APPEND action at position 1
> I0223 04:54:28.766942 817 replica.cpp:537] Replica received write request
> for position 1 from (9599)@172.17.0.2:44225
> I0223 04:54:28.796957 817 leveldb.cpp:341] Persisting action (189 bytes) to
> leveldb took 29.967721ms
> I0223 04:54:28.797009 817 replica.cpp:712] Persisted action at 1
> I0223 04:54:28.797785 816 replica.cpp:691] Replica received learned notice
> for position 1 from @0.0.0.0:0
> I0223 04:54:28.838986 816 leveldb.cpp:341] Persisting action (191 bytes) to
> leveldb took 41.156446ms
> I0223 04:54:28.839037 816 replica.cpp:712] Persisted action at 1
> I0223 04:54:28.839076 816 replica.cpp:697] Replica learned APPEND action at
> position 1
> I0223 04:54:28.840293 814 registrar.cpp:484] Successfully updated the
> 'registry' in 75.217152ms
> I0223 04:54:28.840472 814 registrar.cpp:370] Successfully recovered
> registrar
> I0223 04:54:28.840530 807 log.cpp:702] Attempting to truncate the log to 1
> I0223 04:54:28.840692 820 coordinator.cpp:348] Coordinator attempting to
> write TRUNCATE action at position 2
> I0223 04:54:28.840988 805 master.cpp:1522] Recovered 0 slaves from the
> Registry (131B) ; allowing 10mins for slaves to re-register
> I0223 04:54:28.841042 815 hierarchical.cpp:171] Skipping recovery of
> hierarchical allocator: nothing to recover
> I0223 04:54:28.841539 807 replica.cpp:537] Replica received write request
> for position 2 from (9600)@172.17.0.2:44225
> I0223 04:54:28.877802 807 leveldb.cpp:341] Persisting action (16 bytes) to
> leveldb took 36.230135ms
> I0223 04:54:28.877836 807 replica.cpp:712] Persisted action at 2
> I0223 04:54:28.878690 819 replica.cpp:691] Replica received learned notice
> for position 2 from @0.0.0.0:0
> I0223 04:54:28.919756 819 leveldb.cpp:341] Persisting action (18 bytes) to
> leveldb took 40.9514ms
> I0223 04:54:28.919894 819 leveldb.cpp:399] Deleting ~1 keys from leveldb
> took 63505ns
> I0223 04:54:28.919924 819 replica.cpp:712] Persisted action at 2
> I0223 04:54:28.919960 819 replica.cpp:697] Replica learned TRUNCATE action
> at position 2
> I0223 04:54:28.923907 786 containerizer.cpp:149] Using isolation:
> posix/cpu,posix/mem,filesystem/posix
> W0223 04:54:28.924551 786 backend.cpp:48] Failed to create 'bind' backend:
> BindBackend requires root privileges
> I0223 04:54:28.927949 814 slave.cpp:193] Slave started on 172.17.0.2:44225
> I0223 04:54:28.927978 814 slave.cpp:194] Flags at startup:
> --appc_simple_discovery_uri_prefix="http://"
> --appc_store_dir="/tmp/mesos/store/appc" --authenticatee="crammd5"
> --cgroups_cpu_enable_pids_and_tids_count="false" --cgroups_enable_cfs="false"
> --cgroups_hierarchy="/sys/fs/cgroup" --cgroups_limit_swap="false"
> --cgroups_root="mesos" --container_disk_watch_interval="15secs"
> --containerizers="mesos"
> --credential="/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/credential"
> --default_role="*" --disk_watch_interval="1mins" --docker="docker"
> --docker_auth_server="https://auth.docker.io" --docker_kill_orphans="true"
> --docker_puller_timeout="60" --docker_registry="https://registry-1.docker.io"
> --docker_remove_delay="6hrs" --docker_socket="/var/run/docker.sock"
> --docker_stop_timeout="0ns" --docker_store_dir="/tmp/mesos/store/docker"
> --enforce_container_disk_quota="false"
> --executor_registration_timeout="1mins"
> --executor_shutdown_grace_period="5secs"
> --fetcher_cache_dir="/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/fetch"
> --fetcher_cache_size="2GB" --frameworks_home="" --gc_delay="1weeks"
> --gc_disk_headroom="0.1" --hadoop_home="" --help="false"
> --hostname_lookup="true" --image_provisioner_backend="copy"
> --initialize_driver_logging="true" --isolation="posix/cpu,posix/mem"
> --launcher_dir="/mesos/mesos-0.28.0/_build/src" --logbufsecs="0"
> --logging_level="INFO" --oversubscribed_resources_interval="15secs"
> --perf_duration="10secs" --perf_interval="1mins"
> --qos_correction_interval_min="0ns" --quiet="false" --recover="reconnect"
> --recovery_timeout="15mins" --registration_backoff_factor="10ms"
> --resources="cpus:2;mem:1024;disk:1024;ports:[31000-32000]"
> --revocable_cpu_low_priority="true" --sandbox_directory="/mnt/mesos/sandbox"
> --strict="true" --switch_user="true" --systemd_enable_support="true"
> --systemd_runtime_directory="/run/systemd/system" --version="false"
> --work_dir="/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU"
> I0223 04:54:28.928400 814 credentials.hpp:83] Loading credential for
> authentication from
> '/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/credential'
> I0223 04:54:28.928591 814 slave.cpp:324] Slave using credential for:
> test-principal
> I0223 04:54:28.928771 814 resources.cpp:576] Parsing resources as JSON
> failed: cpus:2;mem:1024;disk:1024;ports:[31000-32000]
> Trying semicolon-delimited string format instead
> I0223 04:54:28.929157 814 slave.cpp:464] Slave resources: cpus(*):2;
> mem(*):1024; disk(*):1024; ports(*):[31000-32000]
> I0223 04:54:28.929220 814 slave.cpp:472] Slave attributes: [ ]
> I0223 04:54:28.929239 814 slave.cpp:477] Slave hostname: 5847d87ad902
> I0223 04:54:28.929936 786 sched.cpp:222] Version: 0.28.0
> I0223 04:54:28.930371 817 state.cpp:58] Recovering state from
> '/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/meta'
> I0223 04:54:28.930495 819 sched.cpp:326] New master detected at
> [email protected]:44225
> I0223 04:54:28.930583 819 sched.cpp:382] Authenticating with master
> [email protected]:44225
> I0223 04:54:28.930600 819 sched.cpp:389] Using default CRAM-MD5
> authenticatee
> I0223 04:54:28.930764 816 status_update_manager.cpp:200] Recovering status
> update manager
> I0223 04:54:28.930806 807 authenticatee.cpp:121] Creating new client SASL
> connection
> I0223 04:54:28.930994 819 containerizer.cpp:407] Recovering containerizer
> I0223 04:54:28.931043 809 master.cpp:5526] Authenticating
> [email protected]:44225
> I0223 04:54:28.931161 813 authenticator.cpp:413] Starting authentication
> session for crammd5_authenticatee(657)@172.17.0.2:44225
> I0223 04:54:28.931468 808 authenticator.cpp:98] Creating new server SASL
> connection
> I0223 04:54:28.931641 817 authenticatee.cpp:212] Received SASL
> authentication mechanisms: CRAM-MD5
> I0223 04:54:28.931665 817 authenticatee.cpp:238] Attempting to authenticate
> with mechanism 'CRAM-MD5'
> I0223 04:54:28.931816 809 authenticator.cpp:203] Received SASL
> authentication start
> I0223 04:54:28.931879 809 authenticator.cpp:325] Authentication requires
> more steps
> I0223 04:54:28.931993 809 authenticatee.cpp:258] Received SASL
> authentication step
> I0223 04:54:28.932368 819 authenticator.cpp:231] Received SASL
> authentication step
> I0223 04:54:28.932402 819 auxprop.cpp:107] Request to lookup properties for
> user: 'test-principal' realm: '5847d87ad902' server FQDN: '5847d87ad902'
> SASL_AUXPROP_VERIFY_AGAINST_HASH: false SASL_AUXPROP_OVERRIDE: false
> SASL_AUXPROP_AUTHZID: false
> I0223 04:54:28.932435 819 auxprop.cpp:179] Looking up auxiliary property
> '*userPassword'
> I0223 04:54:28.932478 819 auxprop.cpp:179] Looking up auxiliary property
> '*cmusaslsecretCRAM-MD5'
> I0223 04:54:28.932500 819 auxprop.cpp:107] Request to lookup properties for
> user: 'test-principal' realm: '5847d87ad902' server FQDN: '5847d87ad902'
> SASL_AUXPROP_VERIFY_AGAINST_HASH: false SASL_AUXPROP_OVERRIDE: false
> SASL_AUXPROP_AUTHZID: true
> I0223 04:54:28.932514 819 auxprop.cpp:129] Skipping auxiliary property
> '*userPassword' since SASL_AUXPROP_AUTHZID == true
> I0223 04:54:28.932517 812 provisioner.cpp:245] Provisioner recovery complete
> I0223 04:54:28.932526 819 auxprop.cpp:129] Skipping auxiliary property
> '*cmusaslsecretCRAM-MD5' since SASL_AUXPROP_AUTHZID == true
> I0223 04:54:28.932543 819 authenticator.cpp:317] Authentication success
> I0223 04:54:28.932616 813 authenticatee.cpp:298] Authentication success
> I0223 04:54:28.932663 805 master.cpp:5556] Successfully authenticated
> principal 'test-principal' at
> [email protected]:44225
> I0223 04:54:28.932965 820 sched.cpp:471] Successfully authenticated with
> master [email protected]:44225
> I0223 04:54:28.932994 820 sched.cpp:776] Sending SUBSCRIBE call to
> [email protected]:44225
> I0223 04:54:28.933004 816 authenticator.cpp:431] Authentication session
> cleanup for crammd5_authenticatee(657)@172.17.0.2:44225
> I0223 04:54:28.933028 814 slave.cpp:4565] Finished recovery
> I0223 04:54:28.933128 820 sched.cpp:809] Will retry registration in
> 313.903761ms if necessary
> I0223 04:54:28.933233 818 master.cpp:2280] Received SUBSCRIBE call for
> framework 'default' at
> [email protected]:44225
> I0223 04:54:28.933284 818 master.cpp:1751] Authorizing framework principal
> 'test-principal' to receive offers for role '*'
> I0223 04:54:28.933480 814 slave.cpp:4737] Querying resource estimator for
> oversubscribable resources
> I0223 04:54:28.933559 806 master.cpp:2351] Subscribing framework default
> with checkpointing enabled and capabilities [ ]
> I0223 04:54:28.933697 810 status_update_manager.cpp:174] Pausing sending
> status updates
> I0223 04:54:28.933701 805 slave.cpp:796] New master detected at
> [email protected]:44225
> I0223 04:54:28.933771 805 slave.cpp:859] Authenticating with master
> [email protected]:44225
> I0223 04:54:28.933790 805 slave.cpp:864] Using default CRAM-MD5
> authenticatee
> I0223 04:54:28.933914 805 slave.cpp:832] Detecting new master
> I0223 04:54:28.933933 818 authenticatee.cpp:121] Creating new client SASL
> connection
> I0223 04:54:28.933984 820 hierarchical.cpp:265] Added framework
> 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> I0223 04:54:28.934046 805 slave.cpp:4751] Received oversubscribable
> resources from the resource estimator
> I0223 04:54:28.934058 820 hierarchical.cpp:1434] No resources available to
> allocate!
> I0223 04:54:28.934094 820 hierarchical.cpp:1529] No inverse offers to send
> out!
> I0223 04:54:28.934118 819 sched.cpp:703] Framework registered with
> 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> I0223 04:54:28.934123 820 hierarchical.cpp:1127] Performed allocation for 0
> slaves in 115269ns
> I0223 04:54:28.934130 814 master.cpp:5526] Authenticating
> [email protected]:44225
> I0223 04:54:28.934165 819 sched.cpp:717] Scheduler::registered took 22856ns
> I0223 04:54:28.934262 819 authenticator.cpp:413] Starting authentication
> session for crammd5_authenticatee(658)@172.17.0.2:44225
> I0223 04:54:28.934466 805 authenticator.cpp:98] Creating new server SASL
> connection
> I0223 04:54:28.934640 816 authenticatee.cpp:212] Received SASL
> authentication mechanisms: CRAM-MD5
> I0223 04:54:28.934672 816 authenticatee.cpp:238] Attempting to authenticate
> with mechanism 'CRAM-MD5'
> I0223 04:54:28.934762 814 authenticator.cpp:203] Received SASL
> authentication start
> I0223 04:54:28.934816 814 authenticator.cpp:325] Authentication requires
> more steps
> I0223 04:54:28.934908 816 authenticatee.cpp:258] Received SASL
> authentication step
> I0223 04:54:28.935014 811 authenticator.cpp:231] Received SASL
> authentication step
> I0223 04:54:28.935047 811 auxprop.cpp:107] Request to lookup properties for
> user: 'test-principal' realm: '5847d87ad902' server FQDN: '5847d87ad902'
> SASL_AUXPROP_VERIFY_AGAINST_HASH: false SASL_AUXPROP_OVERRIDE: false
> SASL_AUXPROP_AUTHZID: false
> I0223 04:54:28.935060 811 auxprop.cpp:179] Looking up auxiliary property
> '*userPassword'
> I0223 04:54:28.935083 811 auxprop.cpp:179] Looking up auxiliary property
> '*cmusaslsecretCRAM-MD5'
> I0223 04:54:28.935102 811 auxprop.cpp:107] Request to lookup properties for
> user: 'test-principal' realm: '5847d87ad902' server FQDN: '5847d87ad902'
> SASL_AUXPROP_VERIFY_AGAINST_HASH: false SASL_AUXPROP_OVERRIDE: false
> SASL_AUXPROP_AUTHZID: true
> I0223 04:54:28.935111 811 auxprop.cpp:129] Skipping auxiliary property
> '*userPassword' since SASL_AUXPROP_AUTHZID == true
> I0223 04:54:28.935117 811 auxprop.cpp:129] Skipping auxiliary property
> '*cmusaslsecretCRAM-MD5' since SASL_AUXPROP_AUTHZID == true
> I0223 04:54:28.935128 811 authenticator.cpp:317] Authentication success
> I0223 04:54:28.935279 806 authenticatee.cpp:298] Authentication success
> I0223 04:54:28.935315 818 master.cpp:5556] Successfully authenticated
> principal 'test-principal' at [email protected]:44225
> I0223 04:54:28.935340 819 authenticator.cpp:431] Authentication session
> cleanup for crammd5_authenticatee(658)@172.17.0.2:44225
> I0223 04:54:28.935611 817 slave.cpp:927] Successfully authenticated with
> master [email protected]:44225
> I0223 04:54:28.935791 817 slave.cpp:1321] Will retry registration in
> 15.895885ms if necessary
> I0223 04:54:28.935920 808 master.cpp:4240] Registering slave at
> [email protected]:44225 (5847d87ad902) with id
> 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0
> I0223 04:54:28.936285 819 registrar.cpp:439] Applied 1 operations in
> 53091ns; attempting to update the 'registry'
> I0223 04:54:28.936992 818 log.cpp:683] Attempting to append 339 bytes to
> the log
> I0223 04:54:28.937113 805 coordinator.cpp:348] Coordinator attempting to
> write APPEND action at position 3
> I0223 04:54:28.937728 816 replica.cpp:537] Replica received write request
> for position 3 from (9614)@172.17.0.2:44225
> I0223 04:54:28.952417 807 slave.cpp:1321] Will retry registration in
> 35.446972ms if necessary
> I0223 04:54:28.952584 814 master.cpp:4228] Ignoring register slave message
> from [email protected]:44225 (5847d87ad902) as admission is already in progress
> I0223 04:54:28.981781 816 leveldb.cpp:341] Persisting action (358 bytes) to
> leveldb took 43.997431ms
> I0223 04:54:28.981883 816 replica.cpp:712] Persisted action at 3
> I0223 04:54:28.982956 816 replica.cpp:691] Replica received learned notice
> for position 3 from @0.0.0.0:0
> I0223 04:54:28.989023 818 slave.cpp:1321] Will retry registration in
> 52.677739ms if necessary
> I0223 04:54:28.989296 818 master.cpp:4228] Ignoring register slave message
> from [email protected]:44225 (5847d87ad902) as admission is already in progress
> I0223 04:54:29.032562 816 leveldb.cpp:341] Persisting action (360 bytes) to
> leveldb took 49.562387ms
> I0223 04:54:29.032641 816 replica.cpp:712] Persisted action at 3
> I0223 04:54:29.032675 816 replica.cpp:697] Replica learned APPEND action at
> position 3
> I0223 04:54:29.034543 807 registrar.cpp:484] Successfully updated the
> 'registry' in 98.188032ms
> I0223 04:54:29.034868 818 log.cpp:702] Attempting to truncate the log to 3
> I0223 04:54:29.035054 819 coordinator.cpp:348] Coordinator attempting to
> write TRUNCATE action at position 4
> I0223 04:54:29.035475 806 slave.cpp:3482] Received ping from
> slave-observer(278)@172.17.0.2:44225
> I0223 04:54:29.035616 820 master.cpp:4308] Registered slave
> 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0 at [email protected]:44225
> (5847d87ad902) with cpus(*):2; mem(*):1024; disk(*):1024;
> ports(*):[31000-32000]
> I0223 04:54:29.035703 809 slave.cpp:971] Registered with master
> [email protected]:44225; given slave ID
> 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0
> I0223 04:54:29.035728 809 fetcher.cpp:81] Clearing fetcher cache
> I0223 04:54:29.035836 814 hierarchical.cpp:473] Added slave
> 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0 (5847d87ad902) with cpus(*):2;
> mem(*):1024; disk(*):1024; ports(*):[31000-32000] (allocated: )
> I0223 04:54:29.035892 815 status_update_manager.cpp:181] Resuming sending
> status updates
> I0223 04:54:29.035941 819 replica.cpp:537] Replica received write request
> for position 4 from (9615)@172.17.0.2:44225
> I0223 04:54:29.036110 809 slave.cpp:994] Checkpointing SlaveInfo to
> '/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/meta/slaves/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0/slave.info'
> I0223 04:54:29.036485 809 slave.cpp:1030] Forwarding total oversubscribed
> resources
> I0223 04:54:29.036645 813 master.cpp:4649] Received update of slave
> 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0 at [email protected]:44225
> (5847d87ad902) with total oversubscribed resources
> I0223 04:54:29.036782 814 hierarchical.cpp:1529] No inverse offers to send
> out!
> I0223 04:54:29.036818 814 hierarchical.cpp:1147] Performed allocation for
> slave 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0 in 935779ns
> I0223 04:54:29.037040 813 master.cpp:5355] Sending 1 offers to framework
> 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000 (default) at
> [email protected]:44225
> I0223 04:54:29.037104 814 hierarchical.cpp:531] Slave
> 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0 (5847d87ad902) updated with
> oversubscribed resources (total: cpus(*):2; mem(*):1024; disk(*):1024;
> ports(*):[31000-32000], allocated: cpus(*):2; mem(*):1024; disk(*):1024;
> ports(*):[31000-32000])
> I0223 04:54:29.037240 814 hierarchical.cpp:1434] No resources available to
> allocate!
> I0223 04:54:29.037282 814 hierarchical.cpp:1529] No inverse offers to send
> out!
> I0223 04:54:29.037307 814 hierarchical.cpp:1147] Performed allocation for
> slave 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0 in 160900ns
> I0223 04:54:29.037569 809 sched.cpp:873] Scheduler::resourceOffers took
> 195666ns
> I0223 04:54:29.039176 807 master.cpp:3138] Processing ACCEPT call for
> offers: [ 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-O0 ] on slave
> 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0 at [email protected]:44225
> (5847d87ad902) for framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> (default) at [email protected]:44225
> I0223 04:54:29.039221 807 master.cpp:2825] Authorizing framework principal
> 'test-principal' to launch task 1 as user 'mesos'
> W0223 04:54:29.040493 807 validation.cpp:404] Executor http for task 1 uses
> less CPUs (None) than the minimum required (0.01). Please update your
> executor, as this will be mandatory in future releases.
> W0223 04:54:29.040529 807 validation.cpp:416] Executor http for task 1 uses
> less memory (None) than the minimum required (32MB). Please update your
> executor, as this will be mandatory in future releases.
> I0223 04:54:29.040904 807 master.hpp:176] Adding task 1 with resources
> cpus(*):2; mem(*):1024; disk(*):1024; ports(*):[31000-32000] on slave
> 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0 (5847d87ad902)
> I0223 04:54:29.041144 807 master.cpp:3623] Launching task 1 of framework
> 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000 (default) at
> [email protected]:44225 with
> resources cpus(*):2; mem(*):1024; disk(*):1024; ports(*):[31000-32000] on
> slave 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0 at [email protected]:44225
> (5847d87ad902)
> I0223 04:54:29.041574 809 slave.cpp:1361] Got assigned task 1 for framework
> 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> I0223 04:54:29.041755 809 slave.cpp:5287] Checkpointing FrameworkInfo to
> '/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/meta/slaves/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0/frameworks/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000/framework.info'
> I0223 04:54:29.042193 809 slave.cpp:5298] Checkpointing framework pid
> '[email protected]:44225' to
> '/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/meta/slaves/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0/frameworks/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000/framework.pid'
> I0223 04:54:29.043066 809 slave.cpp:1480] Launching task 1 for framework
> 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> I0223 04:54:29.043884 809 paths.cpp:474] Trying to chown
> '/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/slaves/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0/frameworks/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000/executors/http/runs/a43cbeed-f390-46aa-b7a1-67770a3dd4fc'
> to user 'mesos'
> I0223 04:54:29.055244 809 slave.cpp:5739] Checkpointing ExecutorInfo to
> '/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/meta/slaves/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0/frameworks/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000/executors/http/executor.info'
> I0223 04:54:29.055868 809 slave.cpp:5367] Launching executor http of
> framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000 with resources in work
> directory
> '/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/slaves/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0/frameworks/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000/executors/http/runs/a43cbeed-f390-46aa-b7a1-67770a3dd4fc'
> I0223 04:54:29.056241 820 containerizer.cpp:666] Starting container
> 'a43cbeed-f390-46aa-b7a1-67770a3dd4fc' for executor 'http' of framework
> '81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000'
> I0223 04:54:29.056298 809 slave.cpp:5762] Checkpointing TaskInfo to
> '/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/meta/slaves/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0/frameworks/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000/executors/http/runs/a43cbeed-f390-46aa-b7a1-67770a3dd4fc/tasks/1/task.info'
> I0223 04:54:29.056675 809 slave.cpp:1698] Queuing task '1' for executor
> 'http' of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> I0223 04:54:29.056779 809 slave.cpp:749] Successfully attached file
> '/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/slaves/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0/frameworks/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000/executors/http/runs/a43cbeed-f390-46aa-b7a1-67770a3dd4fc'
> I0223 04:54:29.061602 815 launcher.cpp:147] Forked child with pid '6997'
> for container 'a43cbeed-f390-46aa-b7a1-67770a3dd4fc'
> I0223 04:54:29.061995 815 containerizer.cpp:1104] Checkpointing executor's
> forked pid 6997 to
> '/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/meta/slaves/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0/frameworks/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000/executors/http/runs/a43cbeed-f390-46aa-b7a1-67770a3dd4fc/pids/forked.pid'
> I0223 04:54:29.083487 819 leveldb.cpp:341] Persisting action (16 bytes) to
> leveldb took 47.478443ms
> I0223 04:54:29.083585 819 replica.cpp:712] Persisted action at 4
> I0223 04:54:29.084709 810 replica.cpp:691] Replica received learned notice
> for position 4 from @0.0.0.0:0
> I0223 04:54:29.122370 810 leveldb.cpp:341] Persisting action (18 bytes) to
> leveldb took 37.598534ms
> I0223 04:54:29.122570 810 leveldb.cpp:399] Deleting ~2 keys from leveldb
> took 89419ns
> I0223 04:54:29.122599 810 replica.cpp:712] Persisted action at 4
> I0223 04:54:29.122635 810 replica.cpp:697] Replica learned TRUNCATE action
> at position 4
> I0223 04:54:29.606348 806 hierarchical.cpp:1434] No resources available to
> allocate!
> I0223 04:54:29.606449 806 hierarchical.cpp:1529] No inverse offers to send
> out!
> I0223 04:54:29.606494 806 hierarchical.cpp:1127] Performed allocation for 1
> slaves in 411327ns
> I0223 04:54:30.607574 810 hierarchical.cpp:1434] No resources available to
> allocate!
> I0223 04:54:30.607647 810 hierarchical.cpp:1529] No inverse offers to send
> out!
> I0223 04:54:30.607676 810 hierarchical.cpp:1127] Performed allocation for 1
> slaves in 309149ns
> WARNING: Logging before InitGoogleLogging() is written to STDERR
> I0223 04:54:31.163267 7011 process.cpp:991] libprocess is initialized on
> 172.17.0.2:47892 for 16 cpus
> I0223 04:54:31.164299 7036 logging.cpp:193] Logging to STDERR
> I0223 04:54:31.164322 7036 executor.cpp:172] Version: 0.28.0
> I0223 04:54:31.166379 7045 executor.cpp:316] Connected with the agent
> I0223 04:54:31.167551 7035 executor.cpp:247] Sending SUBSCRIBE call to
> http://172.17.0.2:44225/slave/api/v1/executor
> I0223 04:54:31.176286 808 process.cpp:3141] Handling HTTP event for process
> 'slave' with path: '/slave/api/v1/executor'
> I0223 04:54:31.176759 814 http.cpp:190] HTTP POST for
> /slave/api/v1/executor from 172.17.0.2:42711
> I0223 04:54:31.177016 814 slave.cpp:2476] Received Subscribe request for
> HTTP executor 'http' of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> I0223 04:54:31.177106 814 slave.cpp:2539] Creating a marker file for HTTP
> based executor 'http' of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> (via HTTP) at path
> '/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/meta/slaves/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0/frameworks/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000/executors/http/runs/a43cbeed-f390-46aa-b7a1-67770a3dd4fc/http.marker'
> I0223 04:54:31.179092 813 slave.cpp:1863] Sending queued task '1' to
> executor 'http' of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000 (via
> HTTP)
> I0223 04:54:31.181721 7036 executor.cpp:588] Enqueuing event SUBSCRIBED
> received from http://172.17.0.2:44225/slave/api/v1/executor
> I0223 04:54:31.182701 7035 executor.cpp:588] Enqueuing event LAUNCH received
> from http://172.17.0.2:44225/slave/api/v1/executor
> Received a SUBSCRIBED event
> Starting task 1
> Finishing task 1
> I0223 04:54:31.185159 7040 executor.cpp:247] Sending UPDATE call to
> http://172.17.0.2:44225/slave/api/v1/executor
> I0223 04:54:31.185442 7040 executor.cpp:247] Sending UPDATE call to
> http://172.17.0.2:44225/slave/api/v1/executor
> I0223 04:54:31.187504 811 slave.cpp:668] Slave terminating
> I0223 04:54:31.187763 819 master.cpp:1174] Slave
> 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0 at [email protected]:44225
> (5847d87ad902) disconnected
> I0223 04:54:31.187794 819 master.cpp:2635] Disconnecting slave
> 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0 at [email protected]:44225
> (5847d87ad902)
> I0223 04:54:31.187850 819 master.cpp:2654] Deactivating slave
> 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0 at [email protected]:44225
> (5847d87ad902)
> I0223 04:54:31.187969 809 hierarchical.cpp:560] Slave
> 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0 deactivated
> I0223 04:54:31.188148 7037 executor.cpp:586] Enqueuing locally injected
> event ERROR
> Received an ERROR event
> I0223 04:54:31.189012 7044 executor.cpp:586] Enqueuing locally injected
> event ERROR
> Received an ERROR event
> E0223 04:54:31.189673 7044 executor.cpp:553] End-Of-File received from
> agent. The agent closed the event stream
> I0223 04:54:31.189709 7044 executor.cpp:357] Disconnected from agent:
> End-Of-File received from agent. The agent closed the event stream
> I0223 04:54:31.190441 7044 executor.cpp:411] Will retry connecting with the
> agent again in 1.548772469secs
> I0223 04:54:31.191680 786 containerizer.cpp:149] Using isolation:
> posix/cpu,posix/mem,filesystem/posix
> W0223 04:54:31.192128 786 backend.cpp:48] Failed to create 'bind' backend:
> BindBackend requires root privileges
> I0223 04:54:31.192425 7033 executor.cpp:316] Connected with the agent
> I0223 04:54:31.193727 7034 executor.cpp:247] Sending SUBSCRIBE call to
> http://172.17.0.2:44225/slave/api/v1/executor
> I0223 04:54:31.194473 821 process.cpp:2409] Returning '404 Not Found' for
> '/slave/api/v1/executor'
> W0223 04:54:31.195523 7041 executor.cpp:509] Received '404 Not Found' () for
> SUBSCRIBE
> I0223 04:54:31.195714 817 slave.cpp:193] Slave started on 172.17.0.2:44225
> I0223 04:54:31.195780 817 slave.cpp:194] Flags at startup:
> --appc_simple_discovery_uri_prefix="http://"
> --appc_store_dir="/tmp/mesos/store/appc" --authenticatee="crammd5"
> --cgroups_cpu_enable_pids_and_tids_count="false" --cgroups_enable_cfs="false"
> --cgroups_hierarchy="/sys/fs/cgroup" --cgroups_limit_swap="false"
> --cgroups_root="mesos" --container_disk_watch_interval="15secs"
> --containerizers="mesos"
> --credential="/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/credential"
> --default_role="*" --disk_watch_interval="1mins" --docker="docker"
> --docker_auth_server="https://auth.docker.io" --docker_kill_orphans="true"
> --docker_puller_timeout="60" --docker_registry="https://registry-1.docker.io"
> --docker_remove_delay="6hrs" --docker_socket="/var/run/docker.sock"
> --docker_stop_timeout="0ns" --docker_store_dir="/tmp/mesos/store/docker"
> --enforce_container_disk_quota="false"
> --executor_registration_timeout="1mins"
> --executor_shutdown_grace_period="5secs"
> --fetcher_cache_dir="/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/fetch"
> --fetcher_cache_size="2GB" --frameworks_home="" --gc_delay="1weeks"
> --gc_disk_headroom="0.1" --hadoop_home="" --help="false"
> --hostname_lookup="true" --image_provisioner_backend="copy"
> --initialize_driver_logging="true" --isolation="posix/cpu,posix/mem"
> --launcher_dir="/mesos/mesos-0.28.0/_build/src" --logbufsecs="0"
> --logging_level="INFO" --oversubscribed_resources_interval="15secs"
> --perf_duration="10secs" --perf_interval="1mins"
> --qos_correction_interval_min="0ns" --quiet="false" --recover="reconnect"
> --recovery_timeout="15mins" --registration_backoff_factor="10ms"
> --resources="cpus:2;mem:1024;disk:1024;ports:[31000-32000]"
> --revocable_cpu_low_priority="true" --sandbox_directory="/mnt/mesos/sandbox"
> --strict="true" --switch_user="true" --systemd_enable_support="true"
> --systemd_runtime_directory="/run/systemd/system" --version="false"
> --work_dir="/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU"
> I0223 04:54:31.196280 817 credentials.hpp:83] Loading credential for
> authentication from
> '/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/credential'
> I0223 04:54:31.196491 817 slave.cpp:324] Slave using credential for:
> test-principal
> I0223 04:54:31.196655 817 resources.cpp:576] Parsing resources as JSON
> failed: cpus:2;mem:1024;disk:1024;ports:[31000-32000]
> Trying semicolon-delimited string format instead
> I0223 04:54:31.197144 817 slave.cpp:464] Slave resources: cpus(*):2;
> mem(*):1024; disk(*):1024; ports(*):[31000-32000]
> I0223 04:54:31.197226 817 slave.cpp:472] Slave attributes: [ ]
> I0223 04:54:31.197242 817 slave.cpp:477] Slave hostname: 5847d87ad902
> I0223 04:54:31.198366 816 state.cpp:58] Recovering state from
> '/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/meta'
> I0223 04:54:31.198439 816 state.cpp:698] No checkpointed resources found at
> '/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/meta/resources/resources.info'
> W0223 04:54:31.200115 816 state.cpp:607] Failed to find status updates file
> '/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/meta/slaves/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0/frameworks/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000/executors/http/runs/a43cbeed-f390-46aa-b7a1-67770a3dd4fc/tasks/1/task.updates'
> I0223 04:54:31.202293 819 fetcher.cpp:81] Clearing fetcher cache
> I0223 04:54:31.202379 819 slave.cpp:4653] Recovering framework
> 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> I0223 04:54:31.202507 819 slave.cpp:5476] Recovering executor 'http' of
> framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> I0223 04:54:31.203364 810 status_update_manager.cpp:200] Recovering status
> update manager
> I0223 04:54:31.203483 810 status_update_manager.cpp:208] Recovering
> executor 'http' of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> W0223 04:54:31.203572 810 status_update_manager.cpp:247] No updates found
> for task 1 of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> I0223 04:54:31.203832 819 slave.cpp:749] Successfully attached file
> '/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/slaves/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0/frameworks/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000/executors/http/runs/a43cbeed-f390-46aa-b7a1-67770a3dd4fc'
> I0223 04:54:31.204326 820 containerizer.cpp:407] Recovering containerizer
> I0223 04:54:31.204448 820 containerizer.cpp:462] Recovering container
> 'a43cbeed-f390-46aa-b7a1-67770a3dd4fc' for executor 'http' of framework
> 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> I0223 04:54:31.206951 820 provisioner.cpp:245] Provisioner recovery complete
> I0223 04:54:31.208343 806 slave.cpp:4512] Waiting for executor 'http' of
> framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000 (via HTTP) to subscribe
> I0223 04:54:31.608290 815 hierarchical.cpp:1434] No resources available to
> allocate!
> I0223 04:54:31.608355 815 hierarchical.cpp:1529] No inverse offers to send
> out!
> I0223 04:54:31.608387 815 hierarchical.cpp:1127] Performed allocation for 1
> slaves in 252147ns
> I0223 04:54:32.169790 7044 executor.cpp:247] Sending SUBSCRIBE call to
> http://172.17.0.2:44225/slave/api/v1/executor
> I0223 04:54:32.172236 807 process.cpp:3141] Handling HTTP event for process
> 'slave' with path: '/slave/api/v1/executor'
> I0223 04:54:32.173068 807 http.cpp:190] HTTP POST for
> /slave/api/v1/executor from 172.17.0.2:42713
> I0223 04:54:32.173709 807 slave.cpp:2476] Received Subscribe request for
> HTTP executor 'http' of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> (via HTTP)
> I0223 04:54:32.173815 807 slave.cpp:2539] Creating a marker file for HTTP
> based executor 'http' of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> (via HTTP) at path
> '/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/meta/slaves/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0/frameworks/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000/executors/http/runs/a43cbeed-f390-46aa-b7a1-67770a3dd4fc/http.marker'
> I0223 04:54:32.194341 7044 executor.cpp:247] Sending SUBSCRIBE call to
> http://172.17.0.2:44225/slave/api/v1/executor
> I0223 04:54:32.196682 807 slave.cpp:3002] Handling status update
> TASK_RUNNING (UUID: 848bc119-6693-4601-a8d4-4211db292639) for task 1 of
> framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> I0223 04:54:32.196950 807 slave.cpp:3002] Handling status update
> TASK_FINISHED (UUID: e7010e72-616f-4612-a218-47b26031630c) for task 1 of
> framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> I0223 04:54:32.198343 811 process.cpp:3141] Handling HTTP event for process
> 'slave' with path: '/slave/api/v1/executor'
> I0223 04:54:32.198863 808 status_update_manager.cpp:320] Received status
> update TASK_RUNNING (UUID: 848bc119-6693-4601-a8d4-4211db292639) for task 1
> of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> I0223 04:54:32.198905 808 status_update_manager.cpp:497] Creating
> StatusUpdate stream for task 1 of framework
> 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> I0223 04:54:32.198945 811 http.cpp:190] HTTP POST for
> /slave/api/v1/executor from 172.17.0.2:42713
> I0223 04:54:32.199236 811 slave.cpp:2476] Received Subscribe request for
> HTTP executor 'http' of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> (via HTTP)
> W0223 04:54:32.199256 811 slave.cpp:2518] Closing already existing HTTP
> connection from executor 'http' of framework
> 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000 (via HTTP)
> I0223 04:54:32.199362 811 slave.cpp:2539] Creating a marker file for HTTP
> based executor 'http' of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> (via HTTP) at path
> '/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/meta/slaves/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0/frameworks/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000/executors/http/runs/a43cbeed-f390-46aa-b7a1-67770a3dd4fc/http.marker'
> I0223 04:54:32.199448 808 status_update_manager.cpp:824] Checkpointing
> UPDATE for status update TASK_RUNNING (UUID:
> 848bc119-6693-4601-a8d4-4211db292639) for task 1 of framework
> 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> I0223 04:54:32.199654 811 slave.cpp:3002] Handling status update
> TASK_RUNNING (UUID: 848bc119-6693-4601-a8d4-4211db292639) for task 1 of
> framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> I0223 04:54:32.199822 811 slave.cpp:3002] Handling status update
> TASK_FINISHED (UUID: e7010e72-616f-4612-a218-47b26031630c) for task 1 of
> framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> I0223 04:54:32.200212 7040 executor.cpp:588] Enqueuing event SUBSCRIBED
> received from http://172.17.0.2:44225/slave/api/v1/executor
> I0223 04:54:32.200510 811 slave.cpp:5677] Terminating task 1
> F0223 04:54:32.200620 806 process.cpp:1114] Check failed: items.size() > 0
> E0223 04:54:32.201045 7040 executor.cpp:553] End-Of-File received from
> agent. The agent closed the event stream
> I0223 04:54:32.201124 7040 executor.cpp:357] Disconnected from agent:
> End-Of-File received from agent. The agent closed the event stream
> Received a SUBSCRIBED event
> I0223 04:54:32.201858 7040 executor.cpp:411] Will retry connecting with the
> agent again in 587.356137ms
> E0223 04:54:32.203120 7040 executor.cpp:543] Failed to decode the stream of
> events: Pipe::Reader failure: failed to decode body
> I0223 04:54:32.203341 7031 executor.cpp:316] Connected with the agent
> I0223 04:54:32.204586 7040 executor.cpp:247] Sending SUBSCRIBE call to
> http://172.17.0.2:44225/slave/api/v1/executor
> *** Check failure stack trace: ***
> I0223 04:54:32.205591 820 process.cpp:3141] Handling HTTP event for process
> 'slave' with path: '/slave/api/v1/executor'
> I0223 04:54:32.242341 819 http.cpp:190] HTTP POST for
> /slave/api/v1/executor from 172.17.0.2:42715
> I0223 04:54:32.242702 819 slave.cpp:2476] Received Subscribe request for
> HTTP executor 'http' of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> (via HTTP)
> W0223 04:54:32.242722 819 slave.cpp:2518] Closing already existing HTTP
> connection from executor 'http' of framework
> 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000 (via HTTP)
> I0223 04:54:32.242769 819 slave.cpp:2539] Creating a marker file for HTTP
> based executor 'http' of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> (via HTTP) at path
> '/tmp/SlaveRecoveryTest_0_ReconnectHTTPExecutor_vAcZBU/meta/slaves/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-S0/frameworks/81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000/executors/http/runs/a43cbeed-f390-46aa-b7a1-67770a3dd4fc/http.marker'
> @ 0x2b5689fe3a4e google::LogMessage::Fail()
> @ 0x2b5689fe399a google::LogMessage::SendToLog()
> I0223 04:54:32.264030 819 slave.cpp:3002] Handling status update
> TASK_RUNNING (UUID: 848bc119-6693-4601-a8d4-4211db292639) for task 1 of
> framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> I0223 04:54:32.264382 819 slave.cpp:3002] Handling status update
> TASK_FINISHED (UUID: e7010e72-616f-4612-a218-47b26031630c) for task 1 of
> framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> @ 0x2b5689fe339c google::LogMessage::Flush()
> I0223 04:54:32.268173 7045 executor.cpp:588] Enqueuing event SUBSCRIBED
> received from http://172.17.0.2:44225/slave/api/v1/executor
> Received a SUBSCRIBED event
> @ 0x2b5689fe62b0 google::LogMessageFatal::~LogMessageFatal()
> @ 0x2b5689f3b512 process::HttpProxy::waited()
> @ 0x2b5689f68821
> _ZZN7process8dispatchINS_9HttpProxyERKNS_6FutureINS_4http8ResponseEEES5_EEvRKNS_3PIDIT_EEMS9_FvT0_ET1_ENKUlPNS_11ProcessBaseEE_clESI_
> @ 0x2b5689f84c72
> _ZNSt17_Function_handlerIFvPN7process11ProcessBaseEEZNS0_8dispatchINS0_9HttpProxyERKNS0_6FutureINS0_4http8ResponseEEES9_EEvRKNS0_3PIDIT_EEMSD_FvT0_ET1_EUlS2_E_E9_M_invokeERKSt9_Any_dataS2_
> @ 0x2b5689f62b93 std::function<>::operator()()
> @ 0x2b5689f48b97 process::ProcessBase::visit()
> @ 0x2b5689f4ec3e process::DispatchEvent::visit()
> @ 0x999aec process::ProcessBase::serve()
> @ 0x2b5689f44d7a process::ProcessManager::resume()
> @ 0x2b5689f41e32
> _ZZN7process14ProcessManager12init_threadsEvENKUlRKSt11atomic_boolE_clES3_
> @ 0x2b5689f4e42a
> _ZNSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt11atomic_boolE_St17reference_wrapperIS3_EEE6__callIvIEILm0EEEET_OSt5tupleIIDpT0_EESt12_Index_tupleIIXspT1_EEE
> @ 0x2b5689f4e3da
> _ZNSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt11atomic_boolE_St17reference_wrapperIS3_EEEclIIEvEET0_DpOT_
> I0223 04:54:32.322123 808 status_update_manager.cpp:374] Forwarding update
> TASK_RUNNING (UUID: 848bc119-6693-4601-a8d4-4211db292639) for task 1 of
> framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000 to the slave
> @ 0x2b5689f4e36c
> _ZNSt12_Bind_simpleIFSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt11atomic_boolE_St17reference_wrapperIS4_EEEvEE9_M_invokeIIEEEvSt12_Index_tupleIIXspT_EEE
> W0223 04:54:32.322547 807 slave.cpp:3346] Dropping status update
> TASK_RUNNING (UUID: 848bc119-6693-4601-a8d4-4211db292639) for task 1 of
> framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000 sent by status update
> manager because the slave is in RECOVERING state
> I0223 04:54:32.322633 807 slave.cpp:3294] Status update manager
> successfully handled status update TASK_RUNNING (UUID:
> 848bc119-6693-4601-a8d4-4211db292639) for task 1 of framework
> 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> I0223 04:54:32.322684 808 status_update_manager.cpp:320] Received status
> update TASK_RUNNING (UUID: 848bc119-6693-4601-a8d4-4211db292639) for task 1
> of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> W0223 04:54:32.322749 808 status_update_manager.cpp:729] Ignoring duplicate
> status update TASK_RUNNING (UUID: 848bc119-6693-4601-a8d4-4211db292639) for
> task 1 of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> I0223 04:54:32.322932 808 status_update_manager.cpp:320] Received status
> update TASK_FINISHED (UUID: e7010e72-616f-4612-a218-47b26031630c) for task 1
> of framework 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> I0223 04:54:32.322988 808 status_update_manager.cpp:824] Checkpointing
> UPDATE for status update TASK_FINISHED (UUID:
> e7010e72-616f-4612-a218-47b26031630c) for task 1 of framework
> 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> I0223 04:54:32.322988 807 slave.cpp:3294] Status update manager
> successfully handled status update TASK_RUNNING (UUID:
> 848bc119-6693-4601-a8d4-4211db292639) for task 1 of framework
> 81a295fc-fe1b-4ff8-9291-cd54f5c6f303-0000
> {code}
> This crash was also observed on a recent ArchLinux VM (Virtualbox), running
> concurrently with {{stress --cpu 4}}. Repro'd with {{./src/mesos-tests
> --gtest_filter="SlaveRecovery*" --gtest_repeat=100
> --gtest_break_on_failure}}; took about 20 iterations to trigger a crash.
> {noformat}
> [ RUN ] SlaveRecoveryTest/0.ReconnectHTTPExecutor
> I0223 09:38:55.434953 11158 executor.cpp:172] Version: 0.28.0
> Received a SUBSCRIBED event
> Starting task 1
> Finishing task 1
> Received an ERROR event
> Received an ERROR event
> E0223 09:38:55.504820 11159 executor.cpp:553] End-Of-File received from
> agent. The agent closed the event stream
> Received an ERROR event
> Received an ERROR event
> Received an ERROR event
> F0223 09:39:00.535778 22159 process.cpp:1114] Check failed: items.size() > 0
> *** Check failure stack trace: ***
> Received an ERROR event
> Received an ERROR event
> @ 0x7f4affd0e754 google::LogMessage::Fail()
> Received an ERROR event
> Received an ERROR event
> Received an ERROR event
> Received an ERROR event
> @ 0x7f4affd0e6ad google::LogMessage::SendToLog()
> @ 0x7f4affd0e0a3 google::LogMessage::Flush()
> @ 0x7f4affd10f14 google::LogMessageFatal::~LogMessageFatal()
> @ 0x7f4affc618d4 process::HttpProxy::waited()
> @ 0x7f4affc8f57f
> _ZZN7process8dispatchINS_9HttpProxyERKNS_6FutureINS_4http8ResponseEEES5_EEvRKNS_3PIDIT_EEMS9_FvT0_ET1_ENKUlPNS_11ProcessBaseEE_clESI_
> @ 0x7f4affcac946
> _ZNSt17_Function_handlerIFvPN7process11ProcessBaseEEZNS0_8dispatchINS0_9HttpProxyERKNS0_6FutureINS0_4http8ResponseEEES9_EEvRKNS0_3PIDIT_EEMSD_FvT0_ET1_EUlS2_E_E9_M_invokeERKSt9_Any_dataOS2_
> @ 0x7f4affc89961 std::function<>::operator()()
> @ 0x7f4affc6ef02 process::ProcessBase::visit()
> @ 0x7f4affc74e52 process::DispatchEvent::visit()
> @ 0xa3afe8 process::ProcessBase::serve()
> @ 0x7f4affc6b073 process::ProcessManager::resume()
> @ 0x7f4affc6813b
> _ZZN7process14ProcessManager12init_threadsEvENKUlRKSt6atomicIbEE_clES4_
> @ 0x7f4affc745fa
> _ZNSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt6atomicIbEE_St17reference_wrapperIS4_EEE6__callIvJEJLm0EEEET_OSt5tupleIJDpT0_EESt12_Index_tupleIJXspT1_EEE
> @ 0x7f4affc745a8
> _ZNSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt6atomicIbEE_St17reference_wrapperIS4_EEEclIJEvEET0_DpOT_
> @ 0x7f4affc74556
> _ZNSt12_Bind_simpleIFSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt6atomicIbEE_St17reference_wrapperIS5_EEEvEE9_M_invokeIJEEEvSt12_Index_tupleIJXspT_EEE
> @ 0x7f4affc744bf
> _ZNSt12_Bind_simpleIFSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt6atomicIbEE_St17reference_wrapperIS5_EEEvEEclEv
> @ 0x7f4affc7445e
> _ZNSt6thread5_ImplISt12_Bind_simpleIFSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt6atomicIbEE_St17reference_wrapperIS7_EEEvEEE6_M_runEv
> @ 0x7f4afa6ddc40 execute_native_thread_routine
> @ 0x7f4afadba424 start_thread
> @ 0x7f4af9e50cbd __clone
> @ (nil) (unknown)
> Aborted (core dumped)
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)