Benjamin Bannier created MESOS-7784:
---------------------------------------

             Summary: 
MasterTestPrePostReservationRefinement.CreateAndDestroyVolumesV1 is flaky
                 Key: MESOS-7784
                 URL: https://issues.apache.org/jira/browse/MESOS-7784
             Project: Mesos
          Issue Type: Bug
          Components: test
    Affects Versions: 1.4.0
         Environment: ASF CI, cmake,clang,--verbose --enable-libevent 
--enable-ssl,GLOG_v=1 
MESOS_VERBOSE=1,ubuntu:14.04,(ubuntu)&&(!ubuntu-us1)&&(!ubuntu-eu2)
            Reporter: Benjamin Bannier


Saw {{bool/MasterTestPrePostReservationRefinement.CreateAndDestroyVolumesV1/1}} 
fail in ASF CI today.

{noformat}
[ RUN      ] 
bool/MasterTestPrePostReservationRefinement.CreateAndDestroyVolumesV1/1
I0711 22:59:15.174615   608 cluster.cpp:162] Creating default 'local' authorizer
I0711 22:59:15.176837  8488 master.cpp:442] Master 
838c7e1d-60d1-4aa7-8918-397da9ebcfa7 (6e892dc05c61) started on 172.17.0.4:32791
I0711 22:59:15.177094  8488 master.cpp:444] Flags at startup: --acls="" 
--agent_ping_timeout="15secs" --agent_reregister_timeout="10mins" 
--allocation_interval="1secs" --allocator="HierarchicalDRF" 
--authenticate_agents="true" --authenticate_frameworks="true" 
--authenticate_http_frameworks="true" --authenticate_http_readonly="true" 
--authenticate_http_readwrite="true" --authenticators="crammd5" 
--authorizers="local" --credentials="/tmp/fTUJFH/credentials" 
--filter_gpu_resources="true" --framework_sorter="drf" --help="false" 
--hostname_lookup="true" --http_authenticators="basic" 
--http_framework_authenticators="basic" --initialize_driver_logging="true" 
--log_auto_initialize="true" --logbufsecs="0" --logging_level="INFO" 
--max_agent_ping_timeouts="5" --max_completed_frameworks="50" 
--max_completed_tasks_per_framework="1000" 
--max_unreachable_tasks_per_framework="1000" --port="5050" --quiet="false" 
--recovery_agent_removal_limit="100%" --registry="in_memory" 
--registry_fetch_timeout="1mins" --registry_gc_interval="15mins" 
--registry_max_agent_age="2weeks" --registry_max_agent_count="102400" 
--registry_store_timeout="100secs" --registry_strict="false" 
--root_submissions="true" --user_sorter="drf" --version="false" 
--webui_dir="/usr/local/share/mesos/webui" --work_dir="/tmp/fTUJFH/master" 
--zk_session_timeout="10secs"
I0711 22:59:15.177460  8488 master.cpp:494] Master only allowing authenticated 
frameworks to register
I0711 22:59:15.177477  8488 master.cpp:508] Master only allowing authenticated 
agents to register
I0711 22:59:15.177487  8488 master.cpp:521] Master only allowing authenticated 
HTTP frameworks to register
I0711 22:59:15.177497  8488 credentials.hpp:37] Loading credentials for 
authentication from '/tmp/fTUJFH/credentials'
I0711 22:59:15.177793  8488 master.cpp:566] Using default 'crammd5' 
authenticator
I0711 22:59:15.177947  8488 http.cpp:1009] Creating default 'basic' HTTP 
authenticator for realm 'mesos-master-readonly'
I0711 22:59:15.178114  8488 http.cpp:1009] Creating default 'basic' HTTP 
authenticator for realm 'mesos-master-readwrite'
I0711 22:59:15.178267  8488 http.cpp:1009] Creating default 'basic' HTTP 
authenticator for realm 'mesos-master-scheduler'
I0711 22:59:15.178421  8488 master.cpp:646] Authorization enabled
I0711 22:59:15.178761  8482 hierarchical.cpp:171] Initialized hierarchical 
allocator process
I0711 22:59:15.178794  8484 whitelist_watcher.cpp:77] No whitelist given
I0711 22:59:15.182126  8486 master.cpp:2163] Elected as the leading master!
I0711 22:59:15.182142  8486 master.cpp:1702] Recovering from registrar
I0711 22:59:15.182231  8484 registrar.cpp:345] Recovering registrar
I0711 22:59:15.182955  8484 registrar.cpp:389] Successfully fetched the 
registry (0B) in 500992ns
I0711 22:59:15.183069  8484 registrar.cpp:493] Applied 1 operations in 42023ns; 
attempting to update the registry
I0711 22:59:15.183899  8484 registrar.cpp:550] Successfully updated the 
registry in 573952ns
I0711 22:59:15.184029  8484 registrar.cpp:422] Successfully recovered registrar
I0711 22:59:15.184540  8486 hierarchical.cpp:209] Skipping recovery of 
hierarchical allocator: nothing to recover
I0711 22:59:15.184530  8496 master.cpp:1801] Recovered 0 agents from the 
registry (129B); allowing 10mins for agents to re-register
I0711 22:59:15.192324   608 containerizer.cpp:230] Using isolation: 
posix/cpu,posix/mem,filesystem/posix,network/cni,environment_secret
W0711 22:59:15.193044   608 backend.cpp:76] Failed to create 'aufs' backend: 
AufsBackend requires root privileges
W0711 22:59:15.193667   608 backend.cpp:76] Failed to create 'bind' backend: 
BindBackend requires root privileges
I0711 22:59:15.193964   608 provisioner.cpp:255] Using default backend 'copy'
I0711 22:59:15.196156   608 cluster.cpp:448] Creating default 'local' authorizer
I0711 22:59:15.198931  8500 slave.cpp:250] Mesos agent started on 
(747)@172.17.0.4:32791
I0711 22:59:15.199259  8500 slave.cpp:251] Flags at startup: --acls="" 
--appc_simple_discovery_uri_prefix="http://"; 
--appc_store_dir="/tmp/bool_MasterTestPrePostReservationRefinement_CreateAndDestroyVolumesV1_1_xMDP9y/store/appc"
 --authenticate_http_executors="true" --authenticate_http_readonly="true" 
--authenticate_http_readwrite="true" --authenticatee="crammd5" 
--authentication_backoff_factor="1secs" --authorizer="local" 
--cgroups_cpu_enable_pids_and_tids_count="false" --cgroups_enable_cfs="false" 
--cgroups_hierarchy="/sys/fs/cgroup" --cgroups_limit_swap="false" 
--cgroups_root="mesos" --container_disk_watch_interval="15secs" 
--containerizers="mesos" 
--credential="/tmp/bool_MasterTestPrePostReservationRefinement_CreateAndDestroyVolumesV1_1_xMDP9y/credential"
 --default_role="*" --disk_watch_interval="1mins" --docker="docker" 
--docker_kill_orphans="true" --docker_registry="https://registry-1.docker.io"; 
--docker_remove_delay="6hrs" --docker_socket="/var/run/docker.sock" 
--docker_stop_timeout="0ns" 
--docker_store_dir="/tmp/bool_MasterTestPrePostReservationRefinement_CreateAndDestroyVolumesV1_1_xMDP9y/store/docker"
 --docker_volume_checkpoint_dir="/var/run/mesos/isolators/docker/volume" 
--enforce_container_disk_quota="false" --executor_registration_timeout="1mins" 
--executor_reregistration_timeout="2secs" 
--executor_secret_key="/tmp/bool_MasterTestPrePostReservationRefinement_CreateAndDestroyVolumesV1_1_xMDP9y/executor_secret_key"
 --executor_shutdown_grace_period="5secs" 
--fetcher_cache_dir="/tmp/bool_MasterTestPrePostReservationRefinement_CreateAndDestroyVolumesV1_1_xMDP9y/fetch"
 --fetcher_cache_size="2GB" --frameworks_home="" --gc_delay="1weeks" 
--gc_disk_headroom="0.1" --hadoop_home="" --help="false" 
--hostname_lookup="true" --http_command_executor="false" 
--http_credentials="/tmp/bool_MasterTestPrePostReservationRefinement_CreateAndDestroyVolumesV1_1_xMDP9y/http_credentials"
 --http_heartbeat_interval="30secs" --initialize_driver_logging="true" 
--isolation="posix/cpu,posix/mem" --launcher="posix" 
--launcher_dir="/mesos/build/src" --logbufsecs="0" --logging_level="INFO" 
--max_completed_executors_per_framework="150" 
--oversubscribed_resources_interval="15secs" --perf_duration="10secs" 
--perf_interval="1mins" --port="5051" --qos_correction_interval_min="0ns" 
--quiet="false" --recover="reconnect" --recovery_timeout="15mins" 
--registration_backoff_factor="10ms" --resources="disk(role1):1024" 
--revocable_cpu_low_priority="true" 
--runtime_dir="/tmp/bool_MasterTestPrePostReservationRefinement_CreateAndDestroyVolumesV1_1_xMDP9y"
 --sandbox_directory="/mnt/mesos/sandbox" --strict="true" --switch_user="true" 
--systemd_enable_support="true" 
--systemd_runtime_directory="/run/systemd/system" --version="false" 
--work_dir="/tmp/bool_MasterTestPrePostReservationRefinement_CreateAndDestroyVolumesV1_1_0yLLp8"
I0711 22:59:15.200019  8500 credentials.hpp:86] Loading credential for 
authentication from 
'/tmp/bool_MasterTestPrePostReservationRefinement_CreateAndDestroyVolumesV1_1_xMDP9y/credential'
I0711 22:59:15.200402  8500 slave.cpp:283] Agent using credential for: 
test-principal
I0711 22:59:15.200629  8500 credentials.hpp:37] Loading credentials for 
authentication from 
'/tmp/bool_MasterTestPrePostReservationRefinement_CreateAndDestroyVolumesV1_1_xMDP9y/http_credentials'
I0711 22:59:15.201143  8500 http.cpp:1009] Creating default 'basic' HTTP 
authenticator for realm 'mesos-agent-executor'
I0711 22:59:15.201472  8500 http.cpp:1030] Creating default 'jwt' HTTP 
authenticator for realm 'mesos-agent-executor'
I0711 22:59:15.201917  8500 http.cpp:1009] Creating default 'basic' HTTP 
authenticator for realm 'mesos-agent-readonly'
I0711 22:59:15.202338  8500 http.cpp:1030] Creating default 'jwt' HTTP 
authenticator for realm 'mesos-agent-readonly'
I0711 22:59:15.202867  8500 http.cpp:1009] Creating default 'basic' HTTP 
authenticator for realm 'mesos-agent-readwrite'
I0711 22:59:15.203336  8500 http.cpp:1030] Creating default 'jwt' HTTP 
authenticator for realm 'mesos-agent-readwrite'
I0711 22:59:15.205016  8500 slave.cpp:565] Agent resources: 
[{"name":"disk","reservations":[{"role":"role1","type":"STATIC"}],"scalar":{"value":1024.0},"type":"SCALAR"},{"name":"cpus","scalar":{"value":24.0},"type":"SCALAR"},{"name":"mem","scalar":{"value":95614.0},"type":"SCALAR"},{"name":"ports","ranges":{"range":[{"begin":31000,"end":32000}]},"type":"RANGES"}]
I0711 22:59:15.205534  8500 slave.cpp:573] Agent attributes: [  ]
I0711 22:59:15.205739  8500 slave.cpp:582] Agent hostname: 6e892dc05c61
I0711 22:59:15.206575  8485 status_update_manager.cpp:177] Pausing sending 
status updates
I0711 22:59:15.208063  8494 state.cpp:64] Recovering state from 
'/tmp/bool_MasterTestPrePostReservationRefinement_CreateAndDestroyVolumesV1_1_0yLLp8/meta'
I0711 22:59:15.208487  8494 status_update_manager.cpp:203] Recovering status 
update manager
I0711 22:59:15.208730  8489 containerizer.cpp:582] Recovering containerizer
I0711 22:59:15.210507  8487 provisioner.cpp:416] Provisioner recovery complete
I0711 22:59:15.210856  8484 slave.cpp:6207] Finished recovery
I0711 22:59:15.211546  8484 slave.cpp:6389] Querying resource estimator for 
oversubscribable resources
I0711 22:59:15.211982  8481 slave.cpp:6403] Received oversubscribable resources 
{} from the resource estimator
I0711 22:59:15.212260  8491 slave.cpp:971] New master detected at 
[email protected]:32791
I0711 22:59:15.212343  8500 status_update_manager.cpp:177] Pausing sending 
status updates
I0711 22:59:15.212570  8491 slave.cpp:1006] Detecting new master
I0711 22:59:15.223120  8500 slave.cpp:1033] Authenticating with master 
[email protected]:32791
I0711 22:59:15.223232  8500 slave.cpp:1044] Using default CRAM-MD5 authenticatee
I0711 22:59:15.223508  8500 authenticatee.cpp:121] Creating new client SASL 
connection
I0711 22:59:15.223800  8500 master.cpp:7773] Authenticating 
slave(747)@172.17.0.4:32791
I0711 22:59:15.223963  8500 authenticator.cpp:414] Starting authentication 
session for crammd5-authenticatee(1376)@172.17.0.4:32791
I0711 22:59:15.224268  8500 authenticator.cpp:98] Creating new server SASL 
connection
I0711 22:59:15.224467  8500 authenticatee.cpp:213] Received SASL authentication 
mechanisms: CRAM-MD5
I0711 22:59:15.224488  8500 authenticatee.cpp:239] Attempting to authenticate 
with mechanism 'CRAM-MD5'
I0711 22:59:15.224565  8500 authenticator.cpp:204] Received SASL authentication 
start
I0711 22:59:15.224609  8500 authenticator.cpp:326] Authentication requires more 
steps
I0711 22:59:15.224685  8500 authenticatee.cpp:259] Received SASL authentication 
step
I0711 22:59:15.224761  8500 authenticator.cpp:232] Received SASL authentication 
step
I0711 22:59:15.224781  8500 auxprop.cpp:109] Request to lookup properties for 
user: 'test-principal' realm: '6e892dc05c61' server FQDN: '6e892dc05c61' 
SASL_AUXPROP_VERIFY_AGAINST_HASH: false SASL_AUXPROP_OVERRIDE: false 
SASL_AUXPROP_AUTHZID: false 
I0711 22:59:15.224792  8500 auxprop.cpp:181] Looking up auxiliary property 
'*userPassword'
I0711 22:59:15.224828  8500 auxprop.cpp:181] Looking up auxiliary property 
'*cmusaslsecretCRAM-MD5'
I0711 22:59:15.224845  8500 auxprop.cpp:109] Request to lookup properties for 
user: 'test-principal' realm: '6e892dc05c61' server FQDN: '6e892dc05c61' 
SASL_AUXPROP_VERIFY_AGAINST_HASH: false SASL_AUXPROP_OVERRIDE: false 
SASL_AUXPROP_AUTHZID: true 
I0711 22:59:15.224853  8500 auxprop.cpp:131] Skipping auxiliary property 
'*userPassword' since SASL_AUXPROP_AUTHZID == true
I0711 22:59:15.224859  8500 auxprop.cpp:131] Skipping auxiliary property 
'*cmusaslsecretCRAM-MD5' since SASL_AUXPROP_AUTHZID == true
I0711 22:59:15.224874  8500 authenticator.cpp:318] Authentication success
I0711 22:59:15.225013  8500 authenticatee.cpp:299] Authentication success
I0711 22:59:15.225075  8500 master.cpp:7803] Successfully authenticated 
principal 'test-principal' at slave(747)@172.17.0.4:32791
I0711 22:59:15.225141  8500 authenticator.cpp:432] Authentication session 
cleanup for crammd5-authenticatee(1376)@172.17.0.4:32791
I0711 22:59:15.225407  8500 slave.cpp:1128] Successfully authenticated with 
master [email protected]:32791
I0711 22:59:15.225633  8500 slave.cpp:1572] Will retry registration in 783987ns 
if necessary
I0711 22:59:15.225926  8500 master.cpp:5677] Received register agent message 
from slave(747)@172.17.0.4:32791 (6e892dc05c61)
I0711 22:59:15.225955  8500 master.cpp:3773] Authorizing agent with principal 
'test-principal'
I0711 22:59:15.226485  8500 master.cpp:5737] Authorized registration of agent 
at slave(747)@172.17.0.4:32791 (6e892dc05c61)
I0711 22:59:15.226609  8500 master.cpp:5830] Registering agent at 
slave(747)@172.17.0.4:32791 (6e892dc05c61) with id 
838c7e1d-60d1-4aa7-8918-397da9ebcfa7-S0
I0711 22:59:15.227051  8491 registrar.cpp:493] Applied 1 operations in 89228ns; 
attempting to update the registry
I0711 22:59:15.227352  8500 slave.cpp:1572] Will retry registration in 
25.413469ms if necessary
I0711 22:59:15.227893  8491 registrar.cpp:550] Successfully updated the 
registry in 773120ns
I0711 22:59:15.227954  8502 master.cpp:5671] Ignoring register agent message 
from slave(747)@172.17.0.4:32791 (6e892dc05c61) as registration is already in 
progress
I0711 22:59:15.228094  8491 master.cpp:5877] Admitted agent 
838c7e1d-60d1-4aa7-8918-397da9ebcfa7-S0 at slave(747)@172.17.0.4:32791 
(6e892dc05c61)
I0711 22:59:15.228929  8492 slave.cpp:4905] Received ping from 
slave-observer(680)@172.17.0.4:32791
I0711 22:59:15.229359  8482 slave.cpp:1174] Registered with master 
[email protected]:32791; given agent ID 838c7e1d-60d1-4aa7-8918-397da9ebcfa7-S0
I0711 22:59:15.229082  8491 master.cpp:5908] Registered agent 
838c7e1d-60d1-4aa7-8918-397da9ebcfa7-S0 at slave(747)@172.17.0.4:32791 
(6e892dc05c61) with 
[{"name":"disk","reservations":[{"role":"role1","type":"STATIC"}],"scalar":{"value":1024.0},"type":"SCALAR"},{"name":"cpus","scalar":{"value":24.0},"type":"SCALAR"},{"name":"mem","scalar":{"value":95614.0},"type":"SCALAR"},{"name":"ports","ranges":{"range":[{"begin":31000,"end":32000}]},"type":"RANGES"}]
I0711 22:59:15.233381  8491 process.cpp:3820] Handling HTTP event for process 
'master' with path: '/master/api/v1'
I0711 22:59:15.229657  8486 status_update_manager.cpp:184] Resuming sending 
status updates
I0711 22:59:15.235273  8491 http.cpp:1149] HTTP POST for /master/api/v1 from 
172.17.0.4:38046
I0711 22:59:15.235756  8482 slave.cpp:1194] Checkpointing SlaveInfo to 
'/tmp/bool_MasterTestPrePostReservationRefinement_CreateAndDestroyVolumesV1_1_0yLLp8/meta/slaves/838c7e1d-60d1-4aa7-8918-397da9ebcfa7-S0/slave.info'
I0711 22:59:15.229573  8492 hierarchical.cpp:593] Added agent 
838c7e1d-60d1-4aa7-8918-397da9ebcfa7-S0 (6e892dc05c61) with disk(reservations: 
[(STATIC,role1)]):1024; cpus:24; mem:95614; ports:[31000-32000] (allocated: {})
I0711 22:59:15.236582  8492 hierarchical.cpp:1925] No allocations performed
I0711 22:59:15.237116  8492 hierarchical.cpp:1468] Performed allocation for 1 
agents in 695738ns
I0711 22:59:15.237339  8491 http.cpp:660] Processing call CREATE_VOLUMES
I0711 22:59:15.237845  8491 master.cpp:3693] Authorizing principal 
'test-principal' to create volumes 
'[{"disk":{"persistence":{"id":"id1","principal":"test-principal"},"volume":{"container_path":"path1","mode":"RW"}},"name":"disk","reservations":[{"role":"role1","type":"STATIC"}],"scalar":{"value":64.0},"type":"SCALAR"}]'
I0711 22:59:15.236997  8482 slave.cpp:1232] Forwarding total oversubscribed 
resources {}
I0711 22:59:15.239473  8482 master.cpp:6624] Received update of agent 
838c7e1d-60d1-4aa7-8918-397da9ebcfa7-S0 at slave(747)@172.17.0.4:32791 
(6e892dc05c61) with total oversubscribed resources {}
I0711 22:59:15.242722  8495 master.cpp:9041] Sending updated checkpointed 
resources disk(reservations: [(STATIC,role1)])[id1:path1]:64 to agent 
838c7e1d-60d1-4aa7-8918-397da9ebcfa7-S0 at slave(747)@172.17.0.4:32791 
(6e892dc05c61)
I0711 22:59:15.244683  8495 slave.cpp:3453] Updated checkpointed resources from 
{} to disk(reservations: [(STATIC,role1)])[id1:path1]:64
I0711 22:59:15.247383  8487 hierarchical.cpp:660] Agent 
838c7e1d-60d1-4aa7-8918-397da9ebcfa7-S0 (6e892dc05c61) updated with total 
resources disk(reservations: [(STATIC,role1)]):1024; cpus:24; mem:95614; 
ports:[31000-32000]
I0711 22:59:15.248096  8487 hierarchical.cpp:1925] No allocations performed
I0711 22:59:15.248278  8487 hierarchical.cpp:1468] Performed allocation for 1 
agents in 340150ns
I0711 22:59:15.248538  8493 process.cpp:3820] Handling HTTP event for process 
'master' with path: '/master/api/v1'
I0711 22:59:15.250051  8489 http.cpp:1149] HTTP POST for /master/api/v1 from 
172.17.0.4:38048
I0711 22:59:15.250254  8489 http.cpp:660] Processing call DESTROY_VOLUMES
I0711 22:59:15.250676  8489 master.cpp:3745] Authorizing principal 
'test-principal' to destroy volumes 
'[{"disk":{"persistence":{"id":"id1","principal":"test-principal"},"volume":{"container_path":"path1","mode":"RW"}},"name":"disk","reservations":[{"role":"role1","type":"STATIC"}],"scalar":{"value":64.0},"type":"SCALAR"}]'
/mesos/src/tests/master_tests.cpp:8455: Failure
Value of: (v1DestroyVolumesResponse).get().status
  Actual: "409 Conflict"
Expected: Accepted().status
Which is: "202 Accepted"
I0711 22:59:15.260431  8492 slave.cpp:843] Agent terminating
I0711 22:59:15.260645  8492 master.cpp:1318] Agent 
838c7e1d-60d1-4aa7-8918-397da9ebcfa7-S0 at slave(747)@172.17.0.4:32791 
(6e892dc05c61) disconnected
I0711 22:59:15.260676  8492 master.cpp:3271] Disconnecting agent 
838c7e1d-60d1-4aa7-8918-397da9ebcfa7-S0 at slave(747)@172.17.0.4:32791 
(6e892dc05c61)
I0711 22:59:15.261204  8492 master.cpp:3290] Deactivating agent 
838c7e1d-60d1-4aa7-8918-397da9ebcfa7-S0 at slave(747)@172.17.0.4:32791 
(6e892dc05c61)
I0711 22:59:15.261584  8496 hierarchical.cpp:690] Agent 
838c7e1d-60d1-4aa7-8918-397da9ebcfa7-S0 deactivated
I0711 22:59:15.267756   608 master.cpp:1160] Master terminating
I0711 22:59:15.268867  8485 hierarchical.cpp:626] Removed agent 
838c7e1d-60d1-4aa7-8918-397da9ebcfa7-S0
[  FAILED  ] 
bool/MasterTestPrePostReservationRefinement.CreateAndDestroyVolumesV1/1, where 
GetParam() = false (99 ms)
{noformat}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to