[ https://issues.apache.org/jira/browse/MESOS-7784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Alexander Rukletsov reassigned MESOS-7784: ------------------------------------------ Resolution: Fixed Assignee: Alexander Rukletsov Fix Version/s: 1.5.0 Fixed as part of MESOS-8082. > MasterTestPrePostReservationRefinement.CreateAndDestroyVolumesV1 is flaky > ------------------------------------------------------------------------- > > Key: MESOS-7784 > URL: https://issues.apache.org/jira/browse/MESOS-7784 > Project: Mesos > Issue Type: Bug > Components: test > Affects Versions: 1.4.0 > Environment: ASF CI, cmake,clang,--verbose --enable-libevent > --enable-ssl,GLOG_v=1 > MESOS_VERBOSE=1,ubuntu:14.04,(ubuntu)&&(!ubuntu-us1)&&(!ubuntu-eu2) > Reporter: Benjamin Bannier > Assignee: Alexander Rukletsov > Labels: flaky, flaky-test, mesosphere > Fix For: 1.5.0 > > > Saw > {{bool/MasterTestPrePostReservationRefinement.CreateAndDestroyVolumesV1/1}} > fail in ASF CI today. > {noformat} > [ RUN ] > bool/MasterTestPrePostReservationRefinement.CreateAndDestroyVolumesV1/1 > I0711 22:59:15.174615 608 cluster.cpp:162] Creating default 'local' > authorizer > I0711 22:59:15.176837 8488 master.cpp:442] Master > 838c7e1d-60d1-4aa7-8918-397da9ebcfa7 (6e892dc05c61) started on > 172.17.0.4:32791 > I0711 22:59:15.177094 8488 master.cpp:444] Flags at startup: --acls="" > --agent_ping_timeout="15secs" --agent_reregister_timeout="10mins" > --allocation_interval="1secs" --allocator="HierarchicalDRF" > --authenticate_agents="true" --authenticate_frameworks="true" > --authenticate_http_frameworks="true" --authenticate_http_readonly="true" > --authenticate_http_readwrite="true" --authenticators="crammd5" > --authorizers="local" --credentials="/tmp/fTUJFH/credentials" > --filter_gpu_resources="true" --framework_sorter="drf" --help="false" > --hostname_lookup="true" --http_authenticators="basic" > --http_framework_authenticators="basic" --initialize_driver_logging="true" > --log_auto_initialize="true" --logbufsecs="0" --logging_level="INFO" > --max_agent_ping_timeouts="5" --max_completed_frameworks="50" > --max_completed_tasks_per_framework="1000" > --max_unreachable_tasks_per_framework="1000" --port="5050" --quiet="false" > --recovery_agent_removal_limit="100%" --registry="in_memory" > --registry_fetch_timeout="1mins" --registry_gc_interval="15mins" > --registry_max_agent_age="2weeks" --registry_max_agent_count="102400" > --registry_store_timeout="100secs" --registry_strict="false" > --root_submissions="true" --user_sorter="drf" --version="false" > --webui_dir="/usr/local/share/mesos/webui" --work_dir="/tmp/fTUJFH/master" > --zk_session_timeout="10secs" > I0711 22:59:15.177460 8488 master.cpp:494] Master only allowing > authenticated frameworks to register > I0711 22:59:15.177477 8488 master.cpp:508] Master only allowing > authenticated agents to register > I0711 22:59:15.177487 8488 master.cpp:521] Master only allowing > authenticated HTTP frameworks to register > I0711 22:59:15.177497 8488 credentials.hpp:37] Loading credentials for > authentication from '/tmp/fTUJFH/credentials' > I0711 22:59:15.177793 8488 master.cpp:566] Using default 'crammd5' > authenticator > I0711 22:59:15.177947 8488 http.cpp:1009] Creating default 'basic' HTTP > authenticator for realm 'mesos-master-readonly' > I0711 22:59:15.178114 8488 http.cpp:1009] Creating default 'basic' HTTP > authenticator for realm 'mesos-master-readwrite' > I0711 22:59:15.178267 8488 http.cpp:1009] Creating default 'basic' HTTP > authenticator for realm 'mesos-master-scheduler' > I0711 22:59:15.178421 8488 master.cpp:646] Authorization enabled > I0711 22:59:15.178761 8482 hierarchical.cpp:171] Initialized hierarchical > allocator process > I0711 22:59:15.178794 8484 whitelist_watcher.cpp:77] No whitelist given > I0711 22:59:15.182126 8486 master.cpp:2163] Elected as the leading master! > I0711 22:59:15.182142 8486 master.cpp:1702] Recovering from registrar > I0711 22:59:15.182231 8484 registrar.cpp:345] Recovering registrar > I0711 22:59:15.182955 8484 registrar.cpp:389] Successfully fetched the > registry (0B) in 500992ns > I0711 22:59:15.183069 8484 registrar.cpp:493] Applied 1 operations in > 42023ns; attempting to update the registry > I0711 22:59:15.183899 8484 registrar.cpp:550] Successfully updated the > registry in 573952ns > I0711 22:59:15.184029 8484 registrar.cpp:422] Successfully recovered > registrar > I0711 22:59:15.184540 8486 hierarchical.cpp:209] Skipping recovery of > hierarchical allocator: nothing to recover > I0711 22:59:15.184530 8496 master.cpp:1801] Recovered 0 agents from the > registry (129B); allowing 10mins for agents to re-register > I0711 22:59:15.192324 608 containerizer.cpp:230] Using isolation: > posix/cpu,posix/mem,filesystem/posix,network/cni,environment_secret > W0711 22:59:15.193044 608 backend.cpp:76] Failed to create 'aufs' backend: > AufsBackend requires root privileges > W0711 22:59:15.193667 608 backend.cpp:76] Failed to create 'bind' backend: > BindBackend requires root privileges > I0711 22:59:15.193964 608 provisioner.cpp:255] Using default backend 'copy' > I0711 22:59:15.196156 608 cluster.cpp:448] Creating default 'local' > authorizer > I0711 22:59:15.198931 8500 slave.cpp:250] Mesos agent started on > (747)@172.17.0.4:32791 > I0711 22:59:15.199259 8500 slave.cpp:251] Flags at startup: --acls="" > --appc_simple_discovery_uri_prefix="http://" > --appc_store_dir="/tmp/bool_MasterTestPrePostReservationRefinement_CreateAndDestroyVolumesV1_1_xMDP9y/store/appc" > --authenticate_http_executors="true" --authenticate_http_readonly="true" > --authenticate_http_readwrite="true" --authenticatee="crammd5" > --authentication_backoff_factor="1secs" --authorizer="local" > --cgroups_cpu_enable_pids_and_tids_count="false" --cgroups_enable_cfs="false" > --cgroups_hierarchy="/sys/fs/cgroup" --cgroups_limit_swap="false" > --cgroups_root="mesos" --container_disk_watch_interval="15secs" > --containerizers="mesos" > --credential="/tmp/bool_MasterTestPrePostReservationRefinement_CreateAndDestroyVolumesV1_1_xMDP9y/credential" > --default_role="*" --disk_watch_interval="1mins" --docker="docker" > --docker_kill_orphans="true" --docker_registry="https://registry-1.docker.io" > --docker_remove_delay="6hrs" --docker_socket="/var/run/docker.sock" > --docker_stop_timeout="0ns" > --docker_store_dir="/tmp/bool_MasterTestPrePostReservationRefinement_CreateAndDestroyVolumesV1_1_xMDP9y/store/docker" > --docker_volume_checkpoint_dir="/var/run/mesos/isolators/docker/volume" > --enforce_container_disk_quota="false" > --executor_registration_timeout="1mins" > --executor_reregistration_timeout="2secs" > --executor_secret_key="/tmp/bool_MasterTestPrePostReservationRefinement_CreateAndDestroyVolumesV1_1_xMDP9y/executor_secret_key" > --executor_shutdown_grace_period="5secs" > --fetcher_cache_dir="/tmp/bool_MasterTestPrePostReservationRefinement_CreateAndDestroyVolumesV1_1_xMDP9y/fetch" > --fetcher_cache_size="2GB" --frameworks_home="" --gc_delay="1weeks" > --gc_disk_headroom="0.1" --hadoop_home="" --help="false" > --hostname_lookup="true" --http_command_executor="false" > --http_credentials="/tmp/bool_MasterTestPrePostReservationRefinement_CreateAndDestroyVolumesV1_1_xMDP9y/http_credentials" > --http_heartbeat_interval="30secs" --initialize_driver_logging="true" > --isolation="posix/cpu,posix/mem" --launcher="posix" > --launcher_dir="/mesos/build/src" --logbufsecs="0" --logging_level="INFO" > --max_completed_executors_per_framework="150" > --oversubscribed_resources_interval="15secs" --perf_duration="10secs" > --perf_interval="1mins" --port="5051" --qos_correction_interval_min="0ns" > --quiet="false" --recover="reconnect" --recovery_timeout="15mins" > --registration_backoff_factor="10ms" --resources="disk(role1):1024" > --revocable_cpu_low_priority="true" > --runtime_dir="/tmp/bool_MasterTestPrePostReservationRefinement_CreateAndDestroyVolumesV1_1_xMDP9y" > --sandbox_directory="/mnt/mesos/sandbox" --strict="true" > --switch_user="true" --systemd_enable_support="true" > --systemd_runtime_directory="/run/systemd/system" --version="false" > --work_dir="/tmp/bool_MasterTestPrePostReservationRefinement_CreateAndDestroyVolumesV1_1_0yLLp8" > I0711 22:59:15.200019 8500 credentials.hpp:86] Loading credential for > authentication from > '/tmp/bool_MasterTestPrePostReservationRefinement_CreateAndDestroyVolumesV1_1_xMDP9y/credential' > I0711 22:59:15.200402 8500 slave.cpp:283] Agent using credential for: > test-principal > I0711 22:59:15.200629 8500 credentials.hpp:37] Loading credentials for > authentication from > '/tmp/bool_MasterTestPrePostReservationRefinement_CreateAndDestroyVolumesV1_1_xMDP9y/http_credentials' > I0711 22:59:15.201143 8500 http.cpp:1009] Creating default 'basic' HTTP > authenticator for realm 'mesos-agent-executor' > I0711 22:59:15.201472 8500 http.cpp:1030] Creating default 'jwt' HTTP > authenticator for realm 'mesos-agent-executor' > I0711 22:59:15.201917 8500 http.cpp:1009] Creating default 'basic' HTTP > authenticator for realm 'mesos-agent-readonly' > I0711 22:59:15.202338 8500 http.cpp:1030] Creating default 'jwt' HTTP > authenticator for realm 'mesos-agent-readonly' > I0711 22:59:15.202867 8500 http.cpp:1009] Creating default 'basic' HTTP > authenticator for realm 'mesos-agent-readwrite' > I0711 22:59:15.203336 8500 http.cpp:1030] Creating default 'jwt' HTTP > authenticator for realm 'mesos-agent-readwrite' > I0711 22:59:15.205016 8500 slave.cpp:565] Agent resources: > [{"name":"disk","reservations":[{"role":"role1","type":"STATIC"}],"scalar":{"value":1024.0},"type":"SCALAR"},{"name":"cpus","scalar":{"value":24.0},"type":"SCALAR"},{"name":"mem","scalar":{"value":95614.0},"type":"SCALAR"},{"name":"ports","ranges":{"range":[{"begin":31000,"end":32000}]},"type":"RANGES"}] > I0711 22:59:15.205534 8500 slave.cpp:573] Agent attributes: [ ] > I0711 22:59:15.205739 8500 slave.cpp:582] Agent hostname: 6e892dc05c61 > I0711 22:59:15.206575 8485 status_update_manager.cpp:177] Pausing sending > status updates > I0711 22:59:15.208063 8494 state.cpp:64] Recovering state from > '/tmp/bool_MasterTestPrePostReservationRefinement_CreateAndDestroyVolumesV1_1_0yLLp8/meta' > I0711 22:59:15.208487 8494 status_update_manager.cpp:203] Recovering status > update manager > I0711 22:59:15.208730 8489 containerizer.cpp:582] Recovering containerizer > I0711 22:59:15.210507 8487 provisioner.cpp:416] Provisioner recovery complete > I0711 22:59:15.210856 8484 slave.cpp:6207] Finished recovery > I0711 22:59:15.211546 8484 slave.cpp:6389] Querying resource estimator for > oversubscribable resources > I0711 22:59:15.211982 8481 slave.cpp:6403] Received oversubscribable > resources {} from the resource estimator > I0711 22:59:15.212260 8491 slave.cpp:971] New master detected at > master@172.17.0.4:32791 > I0711 22:59:15.212343 8500 status_update_manager.cpp:177] Pausing sending > status updates > I0711 22:59:15.212570 8491 slave.cpp:1006] Detecting new master > I0711 22:59:15.223120 8500 slave.cpp:1033] Authenticating with master > master@172.17.0.4:32791 > I0711 22:59:15.223232 8500 slave.cpp:1044] Using default CRAM-MD5 > authenticatee > I0711 22:59:15.223508 8500 authenticatee.cpp:121] Creating new client SASL > connection > I0711 22:59:15.223800 8500 master.cpp:7773] Authenticating > slave(747)@172.17.0.4:32791 > I0711 22:59:15.223963 8500 authenticator.cpp:414] Starting authentication > session for crammd5-authenticatee(1376)@172.17.0.4:32791 > I0711 22:59:15.224268 8500 authenticator.cpp:98] Creating new server SASL > connection > I0711 22:59:15.224467 8500 authenticatee.cpp:213] Received SASL > authentication mechanisms: CRAM-MD5 > I0711 22:59:15.224488 8500 authenticatee.cpp:239] Attempting to authenticate > with mechanism 'CRAM-MD5' > I0711 22:59:15.224565 8500 authenticator.cpp:204] Received SASL > authentication start > I0711 22:59:15.224609 8500 authenticator.cpp:326] Authentication requires > more steps > I0711 22:59:15.224685 8500 authenticatee.cpp:259] Received SASL > authentication step > I0711 22:59:15.224761 8500 authenticator.cpp:232] Received SASL > authentication step > I0711 22:59:15.224781 8500 auxprop.cpp:109] Request to lookup properties for > user: 'test-principal' realm: '6e892dc05c61' server FQDN: '6e892dc05c61' > SASL_AUXPROP_VERIFY_AGAINST_HASH: false SASL_AUXPROP_OVERRIDE: false > SASL_AUXPROP_AUTHZID: false > I0711 22:59:15.224792 8500 auxprop.cpp:181] Looking up auxiliary property > '*userPassword' > I0711 22:59:15.224828 8500 auxprop.cpp:181] Looking up auxiliary property > '*cmusaslsecretCRAM-MD5' > I0711 22:59:15.224845 8500 auxprop.cpp:109] Request to lookup properties for > user: 'test-principal' realm: '6e892dc05c61' server FQDN: '6e892dc05c61' > SASL_AUXPROP_VERIFY_AGAINST_HASH: false SASL_AUXPROP_OVERRIDE: false > SASL_AUXPROP_AUTHZID: true > I0711 22:59:15.224853 8500 auxprop.cpp:131] Skipping auxiliary property > '*userPassword' since SASL_AUXPROP_AUTHZID == true > I0711 22:59:15.224859 8500 auxprop.cpp:131] Skipping auxiliary property > '*cmusaslsecretCRAM-MD5' since SASL_AUXPROP_AUTHZID == true > I0711 22:59:15.224874 8500 authenticator.cpp:318] Authentication success > I0711 22:59:15.225013 8500 authenticatee.cpp:299] Authentication success > I0711 22:59:15.225075 8500 master.cpp:7803] Successfully authenticated > principal 'test-principal' at slave(747)@172.17.0.4:32791 > I0711 22:59:15.225141 8500 authenticator.cpp:432] Authentication session > cleanup for crammd5-authenticatee(1376)@172.17.0.4:32791 > I0711 22:59:15.225407 8500 slave.cpp:1128] Successfully authenticated with > master master@172.17.0.4:32791 > I0711 22:59:15.225633 8500 slave.cpp:1572] Will retry registration in > 783987ns if necessary > I0711 22:59:15.225926 8500 master.cpp:5677] Received register agent message > from slave(747)@172.17.0.4:32791 (6e892dc05c61) > I0711 22:59:15.225955 8500 master.cpp:3773] Authorizing agent with principal > 'test-principal' > I0711 22:59:15.226485 8500 master.cpp:5737] Authorized registration of agent > at slave(747)@172.17.0.4:32791 (6e892dc05c61) > I0711 22:59:15.226609 8500 master.cpp:5830] Registering agent at > slave(747)@172.17.0.4:32791 (6e892dc05c61) with id > 838c7e1d-60d1-4aa7-8918-397da9ebcfa7-S0 > I0711 22:59:15.227051 8491 registrar.cpp:493] Applied 1 operations in > 89228ns; attempting to update the registry > I0711 22:59:15.227352 8500 slave.cpp:1572] Will retry registration in > 25.413469ms if necessary > I0711 22:59:15.227893 8491 registrar.cpp:550] Successfully updated the > registry in 773120ns > I0711 22:59:15.227954 8502 master.cpp:5671] Ignoring register agent message > from slave(747)@172.17.0.4:32791 (6e892dc05c61) as registration is already in > progress > I0711 22:59:15.228094 8491 master.cpp:5877] Admitted agent > 838c7e1d-60d1-4aa7-8918-397da9ebcfa7-S0 at slave(747)@172.17.0.4:32791 > (6e892dc05c61) > I0711 22:59:15.228929 8492 slave.cpp:4905] Received ping from > slave-observer(680)@172.17.0.4:32791 > I0711 22:59:15.229359 8482 slave.cpp:1174] Registered with master > master@172.17.0.4:32791; given agent ID > 838c7e1d-60d1-4aa7-8918-397da9ebcfa7-S0 > I0711 22:59:15.229082 8491 master.cpp:5908] Registered agent > 838c7e1d-60d1-4aa7-8918-397da9ebcfa7-S0 at slave(747)@172.17.0.4:32791 > (6e892dc05c61) with > [{"name":"disk","reservations":[{"role":"role1","type":"STATIC"}],"scalar":{"value":1024.0},"type":"SCALAR"},{"name":"cpus","scalar":{"value":24.0},"type":"SCALAR"},{"name":"mem","scalar":{"value":95614.0},"type":"SCALAR"},{"name":"ports","ranges":{"range":[{"begin":31000,"end":32000}]},"type":"RANGES"}] > I0711 22:59:15.233381 8491 process.cpp:3820] Handling HTTP event for process > 'master' with path: '/master/api/v1' > I0711 22:59:15.229657 8486 status_update_manager.cpp:184] Resuming sending > status updates > I0711 22:59:15.235273 8491 http.cpp:1149] HTTP POST for /master/api/v1 from > 172.17.0.4:38046 > I0711 22:59:15.235756 8482 slave.cpp:1194] Checkpointing SlaveInfo to > '/tmp/bool_MasterTestPrePostReservationRefinement_CreateAndDestroyVolumesV1_1_0yLLp8/meta/slaves/838c7e1d-60d1-4aa7-8918-397da9ebcfa7-S0/slave.info' > I0711 22:59:15.229573 8492 hierarchical.cpp:593] Added agent > 838c7e1d-60d1-4aa7-8918-397da9ebcfa7-S0 (6e892dc05c61) with > disk(reservations: [(STATIC,role1)]):1024; cpus:24; mem:95614; > ports:[31000-32000] (allocated: {}) > I0711 22:59:15.236582 8492 hierarchical.cpp:1925] No allocations performed > I0711 22:59:15.237116 8492 hierarchical.cpp:1468] Performed allocation for 1 > agents in 695738ns > I0711 22:59:15.237339 8491 http.cpp:660] Processing call CREATE_VOLUMES > I0711 22:59:15.237845 8491 master.cpp:3693] Authorizing principal > 'test-principal' to create volumes > '[{"disk":{"persistence":{"id":"id1","principal":"test-principal"},"volume":{"container_path":"path1","mode":"RW"}},"name":"disk","reservations":[{"role":"role1","type":"STATIC"}],"scalar":{"value":64.0},"type":"SCALAR"}]' > I0711 22:59:15.236997 8482 slave.cpp:1232] Forwarding total oversubscribed > resources {} > I0711 22:59:15.239473 8482 master.cpp:6624] Received update of agent > 838c7e1d-60d1-4aa7-8918-397da9ebcfa7-S0 at slave(747)@172.17.0.4:32791 > (6e892dc05c61) with total oversubscribed resources {} > I0711 22:59:15.242722 8495 master.cpp:9041] Sending updated checkpointed > resources disk(reservations: [(STATIC,role1)])[id1:path1]:64 to agent > 838c7e1d-60d1-4aa7-8918-397da9ebcfa7-S0 at slave(747)@172.17.0.4:32791 > (6e892dc05c61) > I0711 22:59:15.244683 8495 slave.cpp:3453] Updated checkpointed resources > from {} to disk(reservations: [(STATIC,role1)])[id1:path1]:64 > I0711 22:59:15.247383 8487 hierarchical.cpp:660] Agent > 838c7e1d-60d1-4aa7-8918-397da9ebcfa7-S0 (6e892dc05c61) updated with total > resources disk(reservations: [(STATIC,role1)]):1024; cpus:24; mem:95614; > ports:[31000-32000] > I0711 22:59:15.248096 8487 hierarchical.cpp:1925] No allocations performed > I0711 22:59:15.248278 8487 hierarchical.cpp:1468] Performed allocation for 1 > agents in 340150ns > I0711 22:59:15.248538 8493 process.cpp:3820] Handling HTTP event for process > 'master' with path: '/master/api/v1' > I0711 22:59:15.250051 8489 http.cpp:1149] HTTP POST for /master/api/v1 from > 172.17.0.4:38048 > I0711 22:59:15.250254 8489 http.cpp:660] Processing call DESTROY_VOLUMES > I0711 22:59:15.250676 8489 master.cpp:3745] Authorizing principal > 'test-principal' to destroy volumes > '[{"disk":{"persistence":{"id":"id1","principal":"test-principal"},"volume":{"container_path":"path1","mode":"RW"}},"name":"disk","reservations":[{"role":"role1","type":"STATIC"}],"scalar":{"value":64.0},"type":"SCALAR"}]' > /mesos/src/tests/master_tests.cpp:8455: Failure > Value of: (v1DestroyVolumesResponse).get().status > Actual: "409 Conflict" > Expected: Accepted().status > Which is: "202 Accepted" > I0711 22:59:15.260431 8492 slave.cpp:843] Agent terminating > I0711 22:59:15.260645 8492 master.cpp:1318] Agent > 838c7e1d-60d1-4aa7-8918-397da9ebcfa7-S0 at slave(747)@172.17.0.4:32791 > (6e892dc05c61) disconnected > I0711 22:59:15.260676 8492 master.cpp:3271] Disconnecting agent > 838c7e1d-60d1-4aa7-8918-397da9ebcfa7-S0 at slave(747)@172.17.0.4:32791 > (6e892dc05c61) > I0711 22:59:15.261204 8492 master.cpp:3290] Deactivating agent > 838c7e1d-60d1-4aa7-8918-397da9ebcfa7-S0 at slave(747)@172.17.0.4:32791 > (6e892dc05c61) > I0711 22:59:15.261584 8496 hierarchical.cpp:690] Agent > 838c7e1d-60d1-4aa7-8918-397da9ebcfa7-S0 deactivated > I0711 22:59:15.267756 608 master.cpp:1160] Master terminating > I0711 22:59:15.268867 8485 hierarchical.cpp:626] Removed agent > 838c7e1d-60d1-4aa7-8918-397da9ebcfa7-S0 > [ FAILED ] > bool/MasterTestPrePostReservationRefinement.CreateAndDestroyVolumesV1/1, > where GetParam() = false (99 ms) > {noformat} -- This message was sent by Atlassian JIRA (v6.4.14#64029)