Benjamin Bannier created MESOS-7111:
---------------------------------------
Summary:
HttpFaultToleranceTest.SchedulerFailoverFrameworkToExecutorMessage segfaults
Key: MESOS-7111
URL: https://issues.apache.org/jira/browse/MESOS-7111
Project: Mesos
Issue Type: Bug
Components: test
Affects Versions: 1.2.0
Environment: ubuntu-16
Reporter: Benjamin Bannier
We observed a segfault in
{{HttpFaultToleranceTest.SchedulerFailoverFrameworkToExecutorMessage}} in
internal CI on an ubuntu16 machine. Note that ubuntu16 uses gcc-6.
{code}
[ RUN ] HttpFaultToleranceTest.SchedulerFailoverFrameworkToExecutorMessage
I0210 02:47:31.260174 19578 cluster.cpp:160] Creating default 'local' authorizer
I0210 02:47:31.261225 19597 master.cpp:383] Master
d8129420-2a04-48e7-9b28-6b0a0af73168 (ip-10-150-111-24.ec2.internal) started on
10.150.111.24:33608
I0210 02:47:31.261281 19597 master.cpp:385] Flags at startup: --acls=""
--agent_ping_timeout="15secs" --agent_reregister_timeout="10mins"
--allocation_interval="1secs" --allocator="HierarchicalDRF"
--authenticate_agents="true" --authenticate_frameworks="false"
--authenticate_http_frameworks="true" --authenticate_http_readonly="true"
--authenticate_http_readwrite="true" --authenticators="crammd5"
--authorizers="local" --credentials="/tmp/fBrqHi/credentials"
--framework_sorter="drf" --help="false" --hostname_lookup="true"
--http_authenticators="basic" --http_framework_authenticators="basic"
--initialize_driver_logging="true" --log_auto_initialize="true"
--logbufsecs="0" --logging_level="INFO" --max_agent_ping_timeouts="5"
--max_completed_frameworks="50" --max_completed_tasks_per_framework="1000"
--max_unreachable_tasks_per_framework="1000" --quiet="false"
--recovery_agent_removal_limit="100%" --registry="in_memory"
--registry_fetch_timeout="1mins" --registry_gc_interval="15mins"
--registry_max_agent_age="2weeks" --registry_max_agent_count="102400"
--registry_store_timeout="100secs" --registry_strict="false"
--root_submissions="true" --user_sorter="drf" --version="false"
--webui_dir="/usr/local/share/mesos/webui" --work_dir="/tmp/fBrqHi/master"
--zk_session_timeout="10secs"
I0210 02:47:31.261404 19597 master.cpp:437] Master allowing unauthenticated
frameworks to register
I0210 02:47:31.261411 19597 master.cpp:449] Master only allowing authenticated
agents to register
I0210 02:47:31.261415 19597 master.cpp:462] Master only allowing authenticated
HTTP frameworks to register
I0210 02:47:31.261420 19597 credentials.hpp:37] Loading credentials for
authentication from '/tmp/fBrqHi/credentials'
I0210 02:47:31.261488 19597 master.cpp:507] Using default 'crammd5'
authenticator
I0210 02:47:31.261530 19597 http.cpp:919] Using default 'basic' HTTP
authenticator for realm 'mesos-master-readonly'
I0210 02:47:31.261591 19597 http.cpp:919] Using default 'basic' HTTP
authenticator for realm 'mesos-master-readwrite'
I0210 02:47:31.261631 19597 http.cpp:919] Using default 'basic' HTTP
authenticator for realm 'mesos-master-scheduler'
I0210 02:47:31.261698 19597 master.cpp:587] Authorization enabled
I0210 02:47:31.261754 19601 whitelist_watcher.cpp:77] No whitelist given
I0210 02:47:31.261754 19602 hierarchical.cpp:161] Initialized hierarchical
allocator process
I0210 02:47:31.262462 19597 master.cpp:2124] Elected as the leading master!
I0210 02:47:31.262482 19597 master.cpp:1646] Recovering from registrar
I0210 02:47:31.262545 19603 registrar.cpp:329] Recovering registrar
I0210 02:47:31.262774 19602 registrar.cpp:362] Successfully fetched the
registry (0B) in 201984ns
I0210 02:47:31.262809 19602 registrar.cpp:461] Applied 1 operations in 2963ns;
attempting to update the registry
I0210 02:47:31.263062 19599 registrar.cpp:506] Successfully updated the
registry in 214016ns
I0210 02:47:31.263119 19599 registrar.cpp:392] Successfully recovered registrar
I0210 02:47:31.263267 19597 master.cpp:1762] Recovered 0 agents from the
registry (172B); allowing 10mins for agents to re-register
I0210 02:47:31.263295 19598 hierarchical.cpp:188] Skipping recovery of
hierarchical allocator: nothing to recover
I0210 02:47:31.264645 19578 cluster.cpp:446] Creating default 'local' authorizer
I0210 02:47:31.265029 19598 slave.cpp:211] Mesos agent started on
(105)@10.150.111.24:33608
I0210 02:47:31.265187 19578 scheduler.cpp:184] Version: 1.3.0
I0210 02:47:31.265043 19598 slave.cpp:212] Flags at startup: --acls=""
--appc_simple_discovery_uri_prefix="http://"
--appc_store_dir="/tmp/mesos/store/appc" --authenticate_http_readonly="true"
--authenticate_http_readwrite="true" --authenticatee="crammd5"
--authentication_backoff_factor="1secs" --authorizer="local"
--cgroups_cpu_enable_pids_and_tids_count="false" --cgroups_enable_cfs="false"
--cgroups_hierarchy="/sys/fs/cgroup" --cgroups_limit_swap="false"
--cgroups_root="mesos" --container_disk_watch_interval="15secs"
--containerizers="mesos"
--credential="/tmp/HttpFaultToleranceTest_SchedulerFailoverFrameworkToExecutorMessage_I1eg7W/credential"
--default_role="*" --disk_watch_interval="1mins" --docker="docker"
--docker_kill_orphans="true" --docker_registry="https://registry-1.docker.io"
--docker_remove_delay="6hrs" --docker_socket="/var/run/docker.sock"
--docker_stop_timeout="0ns" --docker_store_dir="/tmp/mesos/store/docker"
--docker_volume_checkpoint_dir="/var/run/mesos/isolators/docker/volume"
--enforce_container_disk_quota="false" --executor_registration_timeout="1mins"
--executor_shutdown_grace_period="5secs"
--fetcher_cache_dir="/tmp/HttpFaultToleranceTest_SchedulerFailoverFrameworkToExecutorMessage_I1eg7W/fetch"
--fetcher_cache_size="2GB" --frameworks_home="" --gc_delay="1weeks"
--gc_disk_headroom="0.1" --hadoop_home="" --help="false"
--hostname_lookup="true" --http_authenticators="basic"
--http_command_executor="false"
--http_credentials="/tmp/HttpFaultToleranceTest_SchedulerFailoverFrameworkToExecutorMessage_I1eg7W/http_credentials"
--http_heartbeat_interval="30secs" --initialize_driver_logging="true"
--isolation="posix/cpu,posix/mem" --launcher="linux"
--launcher_dir="/home/ubuntu/workspace/mesos/Mesos_CI-build/FLAG/Plain/label/mesos-ec2-ubuntu-16.04/mesos/build/src"
--logbufsecs="0" --logging_level="INFO"
--max_completed_executors_per_framework="150"
--oversubscribed_resources_interval="15secs" --perf_duration="10secs"
--perf_interval="1mins" --qos_correction_interval_min="0ns" --quiet="false"
--recover="reconnect" --recovery_timeout="15mins"
--registration_backoff_factor="10ms"
--resources="cpus:2;gpus:0;mem:1024;disk:1024;ports:[31000-32000]"
--revocable_cpu_low_priority="true"
--runtime_dir="/tmp/HttpFaultToleranceTest_SchedulerFailoverFrameworkToExecutorMessage_I1eg7W"
--sandbox_directory="/mnt/mesos/sandbox" --strict="true" --switch_user="true"
--systemd_enable_support="true"
--systemd_runtime_directory="/run/systemd/system" --version="false"
--work_dir="/tmp/HttpFaultToleranceTest_SchedulerFailoverFrameworkToExecutorMessage_g77kUC"
I0210 02:47:31.265250 19598 credentials.hpp:86] Loading credential for
authentication from
'/tmp/HttpFaultToleranceTest_SchedulerFailoverFrameworkToExecutorMessage_I1eg7W/credential'
I0210 02:47:31.265290 19598 slave.cpp:354] Agent using credential for:
test-principal
I0210 02:47:31.265303 19598 credentials.hpp:37] Loading credentials for
authentication from
'/tmp/HttpFaultToleranceTest_SchedulerFailoverFrameworkToExecutorMessage_I1eg7W/http_credentials'
I0210 02:47:31.265349 19598 http.cpp:919] Using default 'basic' HTTP
authenticator for realm 'mesos-agent-readonly'
I0210 02:47:31.265388 19598 http.cpp:919] Using default 'basic' HTTP
authenticator for realm 'mesos-agent-readwrite'
I0210 02:47:31.265653 19599 scheduler.cpp:470] New master detected at
[email protected]:33608
I0210 02:47:31.265671 19599 scheduler.cpp:479] Waiting for 0ns before
initiating a re-(connection) attempt with the master
I0210 02:47:31.266012 19598 slave.cpp:541] Agent resources: cpus(*):2;
mem(*):1024; disk(*):1024; ports(*):[31000-32000]
I0210 02:47:31.266049 19598 slave.cpp:549] Agent attributes: [ ]
I0210 02:47:31.266055 19598 slave.cpp:554] Agent hostname:
ip-10-150-111-24.ec2.internal
I0210 02:47:31.266111 19602 status_update_manager.cpp:177] Pausing sending
status updates
I0210 02:47:31.266312 19604 state.cpp:62] Recovering state from
'/tmp/HttpFaultToleranceTest_SchedulerFailoverFrameworkToExecutorMessage_g77kUC/meta'
I0210 02:47:31.266428 19597 status_update_manager.cpp:203] Recovering status
update manager
I0210 02:47:31.266559 19600 slave.cpp:5555] Finished recovery
I0210 02:47:31.266736 19600 slave.cpp:5729] Querying resource estimator for
oversubscribable resources
I0210 02:47:31.266881 19600 status_update_manager.cpp:177] Pausing sending
status updates
I0210 02:47:31.266877 19598 slave.cpp:931] New master detected at
[email protected]:33608
I0210 02:47:31.266935 19598 slave.cpp:966] Detecting new master
I0210 02:47:31.266990 19598 slave.cpp:5743] Received oversubscribable resources
{} from the resource estimator
I0210 02:47:31.267329 19603 scheduler.cpp:361] Connected with the master at
http://10.150.111.24:33608/master/api/v1/scheduler
I0210 02:47:31.267763 19601 scheduler.cpp:243] Sending SUBSCRIBE call to
http://10.150.111.24:33608/master/api/v1/scheduler
I0210 02:47:31.268118 19597 process.cpp:3704] Handling HTTP event for process
'master' with path: '/master/api/v1/scheduler'
I0210 02:47:31.268470 19600 http.cpp:420] HTTP POST for
/master/api/v1/scheduler from 10.150.111.24:58632
I0210 02:47:31.268532 19600 master.cpp:2423] Received subscription request for
HTTP framework 'default'
I0210 02:47:31.268556 19600 master.cpp:2160] Authorizing framework principal
'test-principal' to receive offers for role '*'
I0210 02:47:31.268748 19599 master.cpp:2537] Subscribing framework 'default'
with checkpointing disabled and capabilities [ ]
I0210 02:47:31.268887 19597 master.hpp:2119] Sending heartbeat to
d8129420-2a04-48e7-9b28-6b0a0af73168-0000
I0210 02:47:31.268909 19604 hierarchical.cpp:286] Added framework
d8129420-2a04-48e7-9b28-6b0a0af73168-0000
I0210 02:47:31.268954 19604 hierarchical.cpp:1810] No allocations performed
I0210 02:47:31.268977 19604 hierarchical.cpp:1900] No inverse offers to send
out!
I0210 02:47:31.268992 19604 hierarchical.cpp:1394] Performed allocation for 0
agents in 45649ns
I0210 02:47:31.269172 19598 scheduler.cpp:676] Enqueuing event SUBSCRIBED
received from http://10.150.111.24:33608/master/api/v1/scheduler
I0210 02:47:31.269356 19598 scheduler.cpp:676] Enqueuing event HEARTBEAT
received from http://10.150.111.24:33608/master/api/v1/scheduler
I0210 02:47:31.273422 19602 slave.cpp:993] Authenticating with master
[email protected]:33608
I0210 02:47:31.273448 19602 slave.cpp:1004] Using default CRAM-MD5 authenticatee
I0210 02:47:31.273509 19599 authenticatee.cpp:121] Creating new client SASL
connection
I0210 02:47:31.273859 19599 master.cpp:7173] Authenticating
slave(105)@10.150.111.24:33608
I0210 02:47:31.273927 19598 authenticator.cpp:414] Starting authentication
session for crammd5-authenticatee(241)@10.150.111.24:33608
I0210 02:47:31.274020 19601 authenticator.cpp:98] Creating new server SASL
connection
I0210 02:47:31.274341 19601 authenticatee.cpp:213] Received SASL authentication
mechanisms: CRAM-MD5
I0210 02:47:31.274369 19601 authenticatee.cpp:239] Attempting to authenticate
with mechanism 'CRAM-MD5'
I0210 02:47:31.274416 19601 authenticator.cpp:204] Received SASL authentication
start
I0210 02:47:31.274457 19601 authenticator.cpp:326] Authentication requires more
steps
I0210 02:47:31.274494 19601 authenticatee.cpp:259] Received SASL authentication
step
I0210 02:47:31.274559 19598 authenticator.cpp:232] Received SASL authentication
step
I0210 02:47:31.274586 19598 auxprop.cpp:109] Request to lookup properties for
user: 'test-principal' realm: 'ip-10-150-111-24.ec2.internal' server FQDN:
'ip-10-150-111-24.ec2.internal' SASL_AUXPROP_VERIFY_AGAINST_HASH: false
SASL_AUXPROP_OVERRIDE: false SASL_AUXPROP_AUTHZID: false
I0210 02:47:31.274600 19598 auxprop.cpp:181] Looking up auxiliary property
'*userPassword'
I0210 02:47:31.274615 19598 auxprop.cpp:181] Looking up auxiliary property
'*cmusaslsecretCRAM-MD5'
I0210 02:47:31.274629 19598 auxprop.cpp:109] Request to lookup properties for
user: 'test-principal' realm: 'ip-10-150-111-24.ec2.internal' server FQDN:
'ip-10-150-111-24.ec2.internal' SASL_AUXPROP_VERIFY_AGAINST_HASH: false
SASL_AUXPROP_OVERRIDE: false SASL_AUXPROP_AUTHZID: true
I0210 02:47:31.274641 19598 auxprop.cpp:131] Skipping auxiliary property
'*userPassword' since SASL_AUXPROP_AUTHZID == true
I0210 02:47:31.274682 19598 auxprop.cpp:131] Skipping auxiliary property
'*cmusaslsecretCRAM-MD5' since SASL_AUXPROP_AUTHZID == true
I0210 02:47:31.274701 19598 authenticator.cpp:318] Authentication success
I0210 02:47:31.274760 19603 authenticatee.cpp:299] Authentication success
I0210 02:47:31.274780 19601 authenticator.cpp:432] Authentication session
cleanup for crammd5-authenticatee(241)@10.150.111.24:33608
I0210 02:47:31.274793 19598 master.cpp:7203] Successfully authenticated
principal 'test-principal' at slave(105)@10.150.111.24:33608
I0210 02:47:31.274907 19603 slave.cpp:1088] Successfully authenticated with
master [email protected]:33608
I0210 02:47:31.274969 19603 slave.cpp:1515] Will retry registration in
1.825425ms if necessary
I0210 02:47:31.275038 19602 master.cpp:5388] Registering agent at
slave(105)@10.150.111.24:33608 (ip-10-150-111-24.ec2.internal) with id
d8129420-2a04-48e7-9b28-6b0a0af73168-S0
I0210 02:47:31.275151 19602 registrar.cpp:461] Applied 1 operations in 12479ns;
attempting to update the registry
I0210 02:47:31.275413 19600 registrar.cpp:506] Successfully updated the
registry in 237056ns
I0210 02:47:31.275595 19601 slave.cpp:4343] Received ping from
slave-observer(98)@10.150.111.24:33608
I0210 02:47:31.275621 19600 master.cpp:5462] Registered agent
d8129420-2a04-48e7-9b28-6b0a0af73168-S0 at slave(105)@10.150.111.24:33608
(ip-10-150-111-24.ec2.internal) with cpus(*):2; mem(*):1024; disk(*):1024;
ports(*):[31000-32000]
I0210 02:47:31.275703 19597 hierarchical.cpp:516] Added agent
d8129420-2a04-48e7-9b28-6b0a0af73168-S0 (ip-10-150-111-24.ec2.internal) with
cpus(*):2; mem(*):1024; disk(*):1024; ports(*):[31000-32000] (allocated: {})
I0210 02:47:31.275722 19601 slave.cpp:1134] Registered with master
[email protected]:33608; given agent ID
d8129420-2a04-48e7-9b28-6b0a0af73168-S0
I0210 02:47:31.275737 19601 fetcher.cpp:90] Clearing fetcher cache
I0210 02:47:31.275805 19599 status_update_manager.cpp:184] Resuming sending
status updates
I0210 02:47:31.275915 19601 slave.cpp:1162] Checkpointing SlaveInfo to
'/tmp/HttpFaultToleranceTest_SchedulerFailoverFrameworkToExecutorMessage_g77kUC/meta/slaves/d8129420-2a04-48e7-9b28-6b0a0af73168-S0/slave.info'
I0210 02:47:31.275993 19597 hierarchical.cpp:1900] No inverse offers to send
out!
I0210 02:47:31.276023 19597 hierarchical.cpp:1394] Performed allocation for 1
agents in 263025ns
I0210 02:47:31.276077 19601 slave.cpp:1200] Forwarding total oversubscribed
resources {}
I0210 02:47:31.276083 19602 master.cpp:7002] Sending 1 offers to framework
d8129420-2a04-48e7-9b28-6b0a0af73168-0000 (default)
I0210 02:47:31.276250 19602 master.cpp:6019] Received update of agent
d8129420-2a04-48e7-9b28-6b0a0af73168-S0 at slave(105)@10.150.111.24:33608
(ip-10-150-111-24.ec2.internal) with total oversubscribed resources {}
I0210 02:47:31.276347 19599 hierarchical.cpp:589] Agent
d8129420-2a04-48e7-9b28-6b0a0af73168-S0 (ip-10-150-111-24.ec2.internal) updated
with oversubscribed resources {} (total: cpus(*):2; mem(*):1024; disk(*):1024;
ports(*):[31000-32000], allocated: cpus(*)(allocated: *):2; mem(*)(allocated:
*):1024; disk(*)(allocated: *):1024; ports(*)(allocated: *):[31000-32000])
I0210 02:47:31.276420 19599 hierarchical.cpp:1810] No allocations performed
I0210 02:47:31.276445 19599 hierarchical.cpp:1900] No inverse offers to send
out!
I0210 02:47:31.276461 19599 hierarchical.cpp:1394] Performed allocation for 1
agents in 56740ns
I0210 02:47:31.276593 19598 scheduler.cpp:676] Enqueuing event OFFERS received
from http://10.150.111.24:33608/master/api/v1/scheduler
I0210 02:47:31.277225 19601 scheduler.cpp:243] Sending ACCEPT call to
http://10.150.111.24:33608/master/api/v1/scheduler
I0210 02:47:31.277544 19597 process.cpp:3704] Handling HTTP event for process
'master' with path: '/master/api/v1/scheduler'
I0210 02:47:31.277899 19598 http.cpp:420] HTTP POST for
/master/api/v1/scheduler from 10.150.111.24:58630
I0210 02:47:31.278055 19598 master.cpp:3776] Processing ACCEPT call for offers:
[ d8129420-2a04-48e7-9b28-6b0a0af73168-O0 ] on agent
d8129420-2a04-48e7-9b28-6b0a0af73168-S0 at slave(105)@10.150.111.24:33608
(ip-10-150-111-24.ec2.internal) for framework
d8129420-2a04-48e7-9b28-6b0a0af73168-0000 (default)
I0210 02:47:31.278100 19598 master.cpp:3352] Authorizing framework principal
'test-principal' to launch task 512412f2-4982-435f-a963-8dc60a3fb03e
W0210 02:47:31.278590 19598 validation.cpp:1064] Executor 'default' for task
'512412f2-4982-435f-a963-8dc60a3fb03e' uses less CPUs (None) than the minimum
required (0.01). Please update your executor, as this will be mandatory in
future releases.
W0210 02:47:31.278615 19598 validation.cpp:1076] Executor 'default' for task
'512412f2-4982-435f-a963-8dc60a3fb03e' uses less memory (None) than the minimum
required (32MB). Please update your executor, as this will be mandatory in
future releases.
I0210 02:47:31.278728 19598 master.cpp:9063] Adding task
512412f2-4982-435f-a963-8dc60a3fb03e with resources cpus(*)(allocated: *):2;
mem(*)(allocated: *):1024; disk(*)(allocated: *):1024; ports(*)(allocated:
*):[31000-32000] on agent d8129420-2a04-48e7-9b28-6b0a0af73168-S0 at
slave(105)@10.150.111.24:33608 (ip-10-150-111-24.ec2.internal)
I0210 02:47:31.278810 19598 master.cpp:4426] Launching task
512412f2-4982-435f-a963-8dc60a3fb03e of framework
d8129420-2a04-48e7-9b28-6b0a0af73168-0000 (default) with resources
cpus(*)(allocated: *):2; mem(*)(allocated: *):1024; disk(*)(allocated: *):1024;
ports(*)(allocated: *):[31000-32000] on agent
d8129420-2a04-48e7-9b28-6b0a0af73168-S0 at slave(105)@10.150.111.24:33608
(ip-10-150-111-24.ec2.internal)
I0210 02:47:31.278962 19599 slave.cpp:1625] Got assigned task
'512412f2-4982-435f-a963-8dc60a3fb03e' for framework
d8129420-2a04-48e7-9b28-6b0a0af73168-0000
I0210 02:47:31.279145 19599 slave.cpp:1785] Launching task
'512412f2-4982-435f-a963-8dc60a3fb03e' for framework
d8129420-2a04-48e7-9b28-6b0a0af73168-0000
I0210 02:47:31.279255 19603 hierarchical.cpp:807] Updated allocation of
framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000 on agent
d8129420-2a04-48e7-9b28-6b0a0af73168-S0 from cpus(*)(allocated: *):2;
mem(*)(allocated: *):1024; disk(*)(allocated: *):1024; ports(*)(allocated:
*):[31000-32000] to cpus(*)(allocated: *):2; mem(*)(allocated: *):1024;
disk(*)(allocated: *):1024; ports(*)(allocated: *):[31000-32000]
I0210 02:47:31.279412 19599 paths.cpp:547] Trying to chown
'/tmp/HttpFaultToleranceTest_SchedulerFailoverFrameworkToExecutorMessage_g77kUC/slaves/d8129420-2a04-48e7-9b28-6b0a0af73168-S0/frameworks/d8129420-2a04-48e7-9b28-6b0a0af73168-0000/executors/default/runs/f287deb2-0a21-4be0-ad54-613d7a35087d'
to user 'root'
I0210 02:47:31.279538 19599 slave.cpp:6477] Launching executor 'default' of
framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000 with resources {} in work
directory
'/tmp/HttpFaultToleranceTest_SchedulerFailoverFrameworkToExecutorMessage_g77kUC/slaves/d8129420-2a04-48e7-9b28-6b0a0af73168-S0/frameworks/d8129420-2a04-48e7-9b28-6b0a0af73168-0000/executors/default/runs/f287deb2-0a21-4be0-ad54-613d7a35087d'
I0210 02:47:31.279744 19599 slave.cpp:2115] Queued task
'512412f2-4982-435f-a963-8dc60a3fb03e' for executor 'default' of framework
d8129420-2a04-48e7-9b28-6b0a0af73168-0000
I0210 02:47:31.279788 19599 slave.cpp:884] Successfully attached file
'/tmp/HttpFaultToleranceTest_SchedulerFailoverFrameworkToExecutorMessage_g77kUC/slaves/d8129420-2a04-48e7-9b28-6b0a0af73168-S0/frameworks/d8129420-2a04-48e7-9b28-6b0a0af73168-0000/executors/default/runs/f287deb2-0a21-4be0-ad54-613d7a35087d'
I0210 02:47:31.280185 19604 executor.cpp:192] Version: 1.3.0
I0210 02:47:31.280642 19597 executor.cpp:401] Connected with the agent
I0210 02:47:31.280867 19603 executor.cpp:302] Sending SUBSCRIBE call to
http://10.150.111.24:33608/slave(105)/api/v1/executor
I0210 02:47:31.281189 19597 process.cpp:3704] Handling HTTP event for process
'slave(105)' with path: '/slave(105)/api/v1/executor'
I0210 02:47:31.281433 19597 http.cpp:307] HTTP POST for
/slave(105)/api/v1/executor from 10.150.111.24:58634
I0210 02:47:31.281486 19597 slave.cpp:3166] Received Subscribe request for HTTP
executor 'default' of framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000
I0210 02:47:31.281761 19597 slave.cpp:2328] Sending queued task
'512412f2-4982-435f-a963-8dc60a3fb03e' to executor 'default' of framework
d8129420-2a04-48e7-9b28-6b0a0af73168-0000 (via HTTP)
I0210 02:47:31.282199 19602 executor.cpp:716] Enqueuing event SUBSCRIBED
received from http://10.150.111.24:33608/slave(105)/api/v1/executor
I0210 02:47:31.282369 19602 executor.cpp:716] Enqueuing event LAUNCH received
from http://10.150.111.24:33608/slave(105)/api/v1/executor
I0210 02:47:31.282842 19578 scheduler.cpp:184] Version: 1.3.0
I0210 02:47:31.283253 19604 scheduler.cpp:470] New master detected at
[email protected]:33608
I0210 02:47:31.283275 19604 scheduler.cpp:479] Waiting for 0ns before
initiating a re-(connection) attempt with the master
I0210 02:47:31.284811 19602 scheduler.cpp:361] Connected with the master at
http://10.150.111.24:33608/master/api/v1/scheduler
I0210 02:47:31.285277 19599 scheduler.cpp:243] Sending SUBSCRIBE call to
http://10.150.111.24:33608/master/api/v1/scheduler
I0210 02:47:31.285604 19604 process.cpp:3704] Handling HTTP event for process
'master' with path: '/master/api/v1/scheduler'
I0210 02:47:31.285926 19602 http.cpp:420] HTTP POST for
/master/api/v1/scheduler from 10.150.111.24:58640
I0210 02:47:31.285991 19602 master.cpp:2423] Received subscription request for
HTTP framework 'default'
I0210 02:47:31.286016 19602 master.cpp:2160] Authorizing framework principal
'test-principal' to receive offers for role '*'
I0210 02:47:31.286171 19598 master.cpp:2537] Subscribing framework 'default'
with checkpointing disabled and capabilities [ ]
I0210 02:47:31.286193 19598 master.cpp:2583] Updating info for framework
d8129420-2a04-48e7-9b28-6b0a0af73168-0000
I0210 02:47:31.286348 19603 master.hpp:2119] Sending heartbeat to
d8129420-2a04-48e7-9b28-6b0a0af73168-0000
I0210 02:47:31.286393 19601 slave.cpp:2824] Updating framework
d8129420-2a04-48e7-9b28-6b0a0af73168-0000 pid to @0.0.0.0:0
I0210 02:47:31.286487 19603 status_update_manager.cpp:184] Resuming sending
status updates
I0210 02:47:31.286502 19600 scheduler.cpp:676] Enqueuing event ERROR received
from http://10.150.111.24:33608/master/api/v1/scheduler
I0210 02:47:31.286542 19603 master.cpp:1237] Ignoring disconnection for
framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000 (default) as it has already
reconnected
I0210 02:47:31.286800 19603 scheduler.cpp:676] Enqueuing event SUBSCRIBED
received from http://10.150.111.24:33608/master/api/v1/scheduler
E0210 02:47:31.286847 19602 scheduler.cpp:649] End-Of-File received from
master. The master closed the event stream
I0210 02:47:31.287020 19601 scheduler.cpp:444] Re-detecting master
I0210 02:47:31.287117 19603 scheduler.cpp:676] Enqueuing event HEARTBEAT
received from http://10.150.111.24:33608/master/api/v1/scheduler
I0210 02:47:31.287179 19601 scheduler.cpp:395] Ignoring disconnection attempt
from stale connection
I0210 02:47:31.287207 19601 scheduler.cpp:395] Ignoring disconnection attempt
from stale connection
I0210 02:47:31.287338 19598 scheduler.cpp:470] New master detected at
[email protected]:33608
I0210 02:47:31.287356 19598 scheduler.cpp:479] Waiting for 0ns before
initiating a re-(connection) attempt with the master
I0210 02:47:31.287582 19598 scheduler.cpp:243] Sending MESSAGE call to
http://10.150.111.24:33608/master/api/v1/scheduler
I0210 02:47:31.288050 19600 process.cpp:3704] Handling HTTP event for process
'master' with path: '/master/api/v1/scheduler'
I0210 02:47:31.288568 19604 http.cpp:420] HTTP POST for
/master/api/v1/scheduler from 10.150.111.24:58638
I0210 02:47:31.288591 19601 scheduler.cpp:361] Connected with the master at
http://10.150.111.24:33608/master/api/v1/scheduler
I0210 02:47:31.288615 19604 master.cpp:5271] Processing MESSAGE call from
framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000 (default) to agent
d8129420-2a04-48e7-9b28-6b0a0af73168-S0 at slave(105)@10.150.111.24:33608
(ip-10-150-111-24.ec2.internal)
I0210 02:47:31.289027 19603 executor.cpp:716] Enqueuing event MESSAGE received
from http://10.150.111.24:33608/slave(105)/api/v1/executor
I0210 02:47:31.289861 19597 master.cpp:1381] Framework
d8129420-2a04-48e7-9b28-6b0a0af73168-0000 (default) disconnected
I0210 02:47:31.289906 19597 master.cpp:3081] Deactivating framework
d8129420-2a04-48e7-9b28-6b0a0af73168-0000 (default)
I0210 02:47:31.289947 19597 master.cpp:3058] Disconnecting framework
d8129420-2a04-48e7-9b28-6b0a0af73168-0000 (default)
I0210 02:47:31.289978 19601 hierarchical.cpp:415] Deactivated framework
d8129420-2a04-48e7-9b28-6b0a0af73168-0000
I0210 02:47:31.290050 19597 master.cpp:1396] Giving framework
d8129420-2a04-48e7-9b28-6b0a0af73168-0000 (default) 0ns to failover
I0210 02:47:31.291277 19598 master.cpp:6842] Framework failover timeout,
removing framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000 (default)
I0210 02:47:31.291296 19598 master.cpp:7727] Removing framework
d8129420-2a04-48e7-9b28-6b0a0af73168-0000 (default)
I0210 02:47:31.291358 19598 master.cpp:8312] Updating the state of task
512412f2-4982-435f-a963-8dc60a3fb03e of framework
d8129420-2a04-48e7-9b28-6b0a0af73168-0000 (latest state: TASK_KILLED, status
update state: TASK_KILLED)
I0210 02:47:31.291368 19599 slave.cpp:2655] Asked to shut down framework
d8129420-2a04-48e7-9b28-6b0a0af73168-0000 by [email protected]:33608
I0210 02:47:31.291404 19599 slave.cpp:2680] Shutting down framework
d8129420-2a04-48e7-9b28-6b0a0af73168-0000
I0210 02:47:31.291416 19599 slave.cpp:5080] Shutting down executor 'default' of
framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000 (via HTTP)
I0210 02:47:31.291527 19598 master.cpp:8406] Removing task
512412f2-4982-435f-a963-8dc60a3fb03e with resources cpus(*)(allocated: *):2;
mem(*)(allocated: *):1024; disk(*)(allocated: *):1024; ports(*)(allocated:
*):[31000-32000] of framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000 on
agent d8129420-2a04-48e7-9b28-6b0a0af73168-S0 at slave(105)@10.150.111.24:33608
(ip-10-150-111-24.ec2.internal)
I0210 02:47:31.291599 19598 master.cpp:8435] Removing executor 'default' with
resources {} of framework d8129420-2a04-48e7-9b28-6b0a0af73168-0000 on agent
d8129420-2a04-48e7-9b28-6b0a0af73168-S0 at slave(105)@10.150.111.24:33608
(ip-10-150-111-24.ec2.internal)
I0210 02:47:31.291600 19603 hierarchical.cpp:1064] Recovered cpus(*)(allocated:
*):2; mem(*)(allocated: *):1024; disk(*)(allocated: *):1024;
ports(*)(allocated: *):[31000-32000] (total: cpus(*):2; mem(*):1024;
disk(*):1024; ports(*):[31000-32000], allocated: {}) on agent
d8129420-2a04-48e7-9b28-6b0a0af73168-S0 from framework
d8129420-2a04-48e7-9b28-6b0a0af73168-0000
I0210 02:47:31.291684 19603 hierarchical.cpp:362] Removed framework
d8129420-2a04-48e7-9b28-6b0a0af73168-0000
I0210 02:47:31.292042 19603 executor.cpp:716] Enqueuing event SHUTDOWN received
from http://10.150.111.24:33608/slave(105)/api/v1/executor
*** Aborted at 1486694851 (unix time) try "date -d @1486694851" if you are
using GNU date ***
PC: @ 0x55e75dde42b2
testing::internal::UntypedFunctionMockerBase::SetOwnerAndName()
*** SIGBUS (@0x0) received by PID 19578 (TID 0x7fd07ac1f700) from PID 0; stack
trace: ***
@ 0x7fd082e30390 (unknown)
@ 0x55e75dde42b2
testing::internal::UntypedFunctionMockerBase::SetOwnerAndName()
@ 0x55e75d11ebe6 mesos::internal::tests::executor::TestMesos<>::events()
@ 0x55e75d10f34f std::_Function_handler<>::_M_invoke()
@ 0x7fd084a840b2 process::AsyncExecutorProcess::execute<>()
@ 0x7fd084a9595a
_ZNSt17_Function_handlerIFvPN7process11ProcessBaseEEZNS0_8dispatchI7NothingNS0_20AsyncExecutorProcessERKSt8functionIFvRKSt5queueIN5mesos2v18executor5EventESt5dequeISC_SaISC_EEEEESG_PvSK_SG_SN_EENS0_6FutureIT_EERKNS0_3PIDIT0_EEMSS_FSP_T1_T2_T3_ET4_T5_T6_EUlS2_E_E9_M_invokeERKSt9_Any_dataOS2_
@ 0x7fd0857b340c process::ProcessBase::visit()
@ 0x7fd0857c5c33 process::ProcessManager::resume()
@ 0x7fd0857d06e7
_ZNSt6thread5_ImplISt12_Bind_simpleIFZN7process14ProcessManager12init_threadsEvEUt_vEEE6_M_runEv
@ 0x7fd083613c80 (unknown)
@ 0x7fd082e266ba start_thread
@ 0x7fd082b5c82d (unknown)
{code}
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)