[ 
https://issues.apache.org/jira/browse/MESOS-3422?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14745611#comment-14745611
 ] 

Guangya Liu commented on MESOS-3422:
------------------------------------

[[email protected]] I'm sorry that I updated the problem description by 
mistake, can you please help update the description again? Thanks!

> MasterSlaveReconciliationTest.ReconcileLostTask test is flaky
> -------------------------------------------------------------
>
>                 Key: MESOS-3422
>                 URL: https://issues.apache.org/jira/browse/MESOS-3422
>             Project: Mesos
>          Issue Type: Bug
>          Components: technical debt, test
>    Affects Versions: 0.25.0
>         Environment: CentOS
>            Reporter: Vinod Kone
>
> [==========] Running 1 test from 1 test case.
> [----------] Global test environment set-up.
> [----------] 1 test from MasterSlaveReconciliationTest
> [ RUN      ] MasterSlaveReconciliationTest.ReconcileLostTask
> Using temporary directory 
> '/tmp/MasterSlaveReconciliationTest_ReconcileLostTask_2tUQZn'
> I0915 22:28:40.800787  3733 leveldb.cpp:176] Opened db in 252.206266ms
> I0915 22:28:40.851069  3733 leveldb.cpp:183] Compacted db in 50.197346ms
> I0915 22:28:40.851210  3733 leveldb.cpp:198] Created db iterator in 63324ns
> I0915 22:28:40.851256  3733 leveldb.cpp:204] Seeked to beginning of db in 
> 4562ns
> I0915 22:28:40.851286  3733 leveldb.cpp:273] Iterated through 0 keys in the 
> db in 322ns
> I0915 22:28:40.871953  3733 replica.cpp:744] Replica recovered with log 
> positions 0 -> 0 with 1 holes and 0 unlearned
> I0915 22:28:40.886368  3756 recover.cpp:449] Starting replica recovery
> I0915 22:28:40.903333  3756 recover.cpp:475] Replica is in EMPTY status
> I0915 22:28:40.916332  3759 replica.cpp:641] Replica in EMPTY status received 
> a broadcasted recover request
> I0915 22:28:40.917351  3756 recover.cpp:195] Received a recover response from 
> a replica in EMPTY status
> I0915 22:28:40.918557  3755 recover.cpp:566] Updating replica status to 
> STARTING
> I0915 22:28:40.928189  3759 master.cpp:380] Master 
> 20150915-222840-16842879-54960-3733 (devstack007.cn.ibm.com) started on 
> 127.0.1.1:54960
> I0915 22:28:40.928261  3759 master.cpp:382] Flags at startup: --acls="" 
> --allocation_interval="1secs" --allocator="HierarchicalDRF" 
> --authenticate="true" --authenticate_slaves="true" --authenticators="crammd5" 
> --authorizers="local" 
> --credentials="/tmp/MasterSlaveReconciliationTest_ReconcileLostTask_2tUQZn/credentials"
>  --framework_sorter="drf" --help="false" --initialize_driver_logging="true" 
> --log_auto_initialize="true" --logbufsecs="0" --logging_level="INFO" 
> --max_slave_ping_timeouts="5" --quiet="false" 
> --recovery_slave_removal_limit="100%" --registry="replicated_log" 
> --registry_fetch_timeout="1mins" --registry_store_timeout="25secs" 
> --registry_strict="true" --root_submissions="true" 
> --slave_ping_timeout="15secs" --slave_reregister_timeout="10mins" 
> --user_sorter="drf" --version="false" 
> --webui_dir="/usr/local/share/mesos/webui" 
> --work_dir="/tmp/MasterSlaveReconciliationTest_ReconcileLostTask_2tUQZn/master"
>  --zk_session_timeout="10secs"
> I0915 22:28:40.993895  3759 master.cpp:427] Master only allowing 
> authenticated frameworks to register
> I0915 22:28:40.993962  3759 master.cpp:432] Master only allowing 
> authenticated slaves to register
> I0915 22:28:40.994010  3759 credentials.hpp:37] Loading credentials for 
> authentication from 
> '/tmp/MasterSlaveReconciliationTest_ReconcileLostTask_2tUQZn/credentials'
> I0915 22:28:40.994776  3759 master.cpp:471] Using default 'crammd5' 
> authenticator
> I0915 22:28:40.995053  3759 authenticator.cpp:512] Initializing server SASL
> I0915 22:28:41.009496  3757 leveldb.cpp:306] Persisting metadata (8 bytes) to 
> leveldb took 90.341573ms
> I0915 22:28:41.009570  3757 replica.cpp:323] Persisted replica status to 
> STARTING
> I0915 22:28:41.010040  3756 recover.cpp:475] Replica is in STARTING status
> I0915 22:28:41.011255  3757 replica.cpp:641] Replica in STARTING status 
> received a broadcasted recover request
> I0915 22:28:41.011551  3752 recover.cpp:195] Received a recover response from 
> a replica in STARTING status
> I0915 22:28:41.012073  3756 recover.cpp:566] Updating replica status to VOTING
> I0915 22:28:41.084720  3753 leveldb.cpp:306] Persisting metadata (8 bytes) to 
> leveldb took 72.469042ms
> I0915 22:28:41.084803  3753 replica.cpp:323] Persisted replica status to 
> VOTING
> I0915 22:28:41.084935  3752 recover.cpp:580] Successfully joined the Paxos 
> group
> I0915 22:28:41.085227  3752 recover.cpp:464] Recover process terminated
> I0915 22:28:41.191287  3759 auxprop.cpp:66] Initialized in-memory auxiliary 
> property plugin
> I0915 22:28:41.191455  3759 master.cpp:508] Authorization enabled
> I0915 22:28:41.192039  3758 hierarchical.hpp:408] Initialized hierarchical 
> allocator process
> I0915 22:28:41.210978  3752 whitelist_watcher.cpp:79] No whitelist given
> I0915 22:28:41.226894  3757 master.cpp:1605] The newly elected leader is 
> [email protected]:54960 with id 20150915-222840-16842879-54960-3733
> I0915 22:28:41.227022  3757 master.cpp:1618] Elected as the leading master!
> I0915 22:28:41.227073  3757 master.cpp:1378] Recovering from registrar
> I0915 22:28:41.227442  3756 registrar.cpp:309] Recovering registrar
> I0915 22:28:41.228864  3759 log.cpp:661] Attempting to start the writer
> I0915 22:28:41.231155  3754 replica.cpp:477] Replica received implicit 
> promise request with proposal 1
> I0915 22:28:41.276180  3754 leveldb.cpp:306] Persisting metadata (8 bytes) to 
> leveldb took 44.960628ms
> I0915 22:28:41.276265  3754 replica.cpp:345] Persisted promised to 1
> I0915 22:28:41.277185  3755 coordinator.cpp:231] Coordinator attemping to 
> fill missing position
> I0915 22:28:41.279559  3755 replica.cpp:378] Replica received explicit 
> promise request for position 0 with proposal 2
> I0915 22:28:41.317904  3755 leveldb.cpp:343] Persisting action (8 bytes) to 
> leveldb took 38.28625ms
> I0915 22:28:41.317952  3755 replica.cpp:679] Persisted action at 0
> I0915 22:28:41.318975  3756 replica.cpp:511] Replica received write request 
> for position 0
> I0915 22:28:41.319077  3756 leveldb.cpp:438] Reading position from leveldb 
> took 48432ns
> I0915 22:28:41.351290  3756 leveldb.cpp:343] Persisting action (14 bytes) to 
> leveldb took 32.131668ms
> I0915 22:28:41.351372  3756 replica.cpp:679] Persisted action at 0
> I0915 22:28:41.352147  3755 replica.cpp:658] Replica received learned notice 
> for position 0
> I0915 22:28:41.384781  3755 leveldb.cpp:343] Persisting action (16 bytes) to 
> leveldb took 32.568205ms
> I0915 22:28:41.384858  3755 replica.cpp:679] Persisted action at 0
> I0915 22:28:41.384902  3755 replica.cpp:664] Replica learned NOP action at 
> position 0
> I0915 22:28:41.385823  3753 log.cpp:677] Writer started with ending position 0
> I0915 22:28:41.388413  3754 leveldb.cpp:438] Reading position from leveldb 
> took 41960ns
> I0915 22:28:41.391221  3759 registrar.cpp:342] Successfully fetched the 
> registry (0B) in 163.655936ms
> I0915 22:28:41.391530  3759 registrar.cpp:441] Applied 1 operations in 
> 83084ns; attempting to update the 'registry'
> I0915 22:28:41.395333  3752 log.cpp:685] Attempting to append 188 bytes to 
> the log
> I0915 22:28:41.395625  3757 coordinator.cpp:341] Coordinator attempting to 
> write APPEND action at position 1
> I0915 22:28:41.396404  3753 replica.cpp:511] Replica received write request 
> for position 1
> I0915 22:28:41.434862  3753 leveldb.cpp:343] Persisting action (207 bytes) to 
> leveldb took 38.376695ms
> I0915 22:28:41.434942  3753 replica.cpp:679] Persisted action at 1
> I0915 22:28:41.435797  3758 replica.cpp:658] Replica received learned notice 
> for position 1
> I0915 22:28:41.484905  3758 leveldb.cpp:343] Persisting action (209 bytes) to 
> leveldb took 49.03218ms
> I0915 22:28:41.484977  3758 replica.cpp:679] Persisted action at 1
> I0915 22:28:41.485021  3758 replica.cpp:664] Replica learned APPEND action at 
> position 1
> I0915 22:28:41.486634  3759 registrar.cpp:486] Successfully updated the 
> 'registry' in 94.96704ms
> I0915 22:28:41.486788  3759 registrar.cpp:372] Successfully recovered 
> registrar
> I0915 22:28:41.486871  3752 log.cpp:704] Attempting to truncate the log to 1
> I0915 22:28:41.487041  3753 coordinator.cpp:341] Coordinator attempting to 
> write TRUNCATE action at position 2
> I0915 22:28:41.487397  3758 master.cpp:1415] Recovered 0 slaves from the 
> Registry (149B) ; allowing 10mins for slaves to re-register
> I0915 22:28:41.488390  3754 replica.cpp:511] Replica received write request 
> for position 2
> I0915 22:28:41.518287  3754 leveldb.cpp:343] Persisting action (16 bytes) to 
> leveldb took 29.818009ms
> I0915 22:28:41.518383  3754 replica.cpp:679] Persisted action at 2
> I0915 22:28:41.519301  3753 replica.cpp:658] Replica received learned notice 
> for position 2
> I0915 22:28:41.551661  3753 leveldb.cpp:343] Persisting action (18 bytes) to 
> leveldb took 32.172645ms
> I0915 22:28:41.551758  3753 leveldb.cpp:401] Deleting ~1 keys from leveldb 
> took 45547ns
> I0915 22:28:41.551798  3753 replica.cpp:679] Persisted action at 2
> I0915 22:28:41.551862  3753 replica.cpp:664] Replica learned TRUNCATE action 
> at position 2
> I0915 22:28:41.582856  3733 containerizer.cpp:160] Using isolation: 
> posix/cpu,posix/mem,filesystem/posix
> I0915 22:28:41.612773  3752 slave.cpp:190] Slave started on 1)@127.0.1.1:54960
> I0915 22:28:41.612828  3752 slave.cpp:191] Flags at startup: 
> --appc_provisioner_backend="copy" --appc_store_dir="/tmp/mesos/store/appc" 
> --authenticatee="crammd5" --cgroups_cpu_enable_pids_and_tids_count="false" 
> --cgroups_enable_cfs="false" --cgroups_hierarchy="/sys/fs/cgroup" 
> --cgroups_limit_swap="false" --cgroups_root="mesos" 
> --container_disk_watch_interval="15secs" --containerizers="mesos" 
> --credential="/tmp/MasterSlaveReconciliationTest_ReconcileLostTask_tymaz7/credential"
>  --default_role="*" --disk_watch_interval="1mins" --docker="docker" 
> --docker_kill_orphans="true" --docker_remove_delay="6hrs" 
> --docker_socket="/var/run/docker.sock" --docker_stop_timeout="0ns" 
> --enforce_container_disk_quota="false" 
> --executor_registration_timeout="1mins" 
> --executor_shutdown_grace_period="5secs" 
> --fetcher_cache_dir="/tmp/MasterSlaveReconciliationTest_ReconcileLostTask_tymaz7/fetch"
>  --fetcher_cache_size="2GB" --frameworks_home="" --gc_delay="1weeks" 
> --gc_disk_headroom="0.1" --hadoop_home="" --help="false" 
> --initialize_driver_logging="true" --isolation="posix/cpu,posix/mem" 
> --launcher_dir="/home/gyliu/src/mesos/bug-fix/mesos/build/src" 
> --logbufsecs="0" --logging_level="INFO" 
> --oversubscribed_resources_interval="15secs" --perf_duration="10secs" 
> --perf_interval="1mins" --qos_correction_interval_min="0ns" --quiet="false" 
> --recover="reconnect" --recovery_timeout="15mins" 
> --registration_backoff_factor="10ms" --resource_monitoring_interval="1secs" 
> --resources="cpus:2;mem:1024;disk:1024;ports:[31000-32000]" 
> --revocable_cpu_low_priority="true" --sandbox_directory="/mnt/mesos/sandbox" 
> --strict="true" --switch_user="true" --version="false" 
> --work_dir="/tmp/MasterSlaveReconciliationTest_ReconcileLostTask_tymaz7"
> I0915 22:28:41.613301  3752 credentials.hpp:85] Loading credential for 
> authentication from 
> '/tmp/MasterSlaveReconciliationTest_ReconcileLostTask_tymaz7/credential'
> I0915 22:28:41.613534  3752 slave.cpp:321] Slave using credential for: 
> test-principal
> I0915 22:28:41.614586  3752 slave.cpp:354] Slave resources: cpus(*):2; 
> mem(*):1024; disk(*):1024; ports(*):[31000-32000]
> I0915 22:28:41.614718  3752 slave.cpp:384] Slave hostname: 
> devstack007.cn.ibm.com
> I0915 22:28:41.614749  3752 slave.cpp:389] Slave checkpoint: true
> I0915 22:28:41.616605  3756 state.cpp:54] Recovering state from 
> '/tmp/MasterSlaveReconciliationTest_ReconcileLostTask_tymaz7/meta'
> I0915 22:28:41.617005  3756 status_update_manager.cpp:202] Recovering status 
> update manager
> I0915 22:28:41.617219  3756 containerizer.cpp:396] Recovering containerizer
> I0915 22:28:41.618137  3733 sched.cpp:164] Version: 0.25.0
> I0915 22:28:41.620059  3752 sched.cpp:262] New master detected at 
> [email protected]:54960
> I0915 22:28:41.620291  3752 sched.cpp:318] Authenticating with master 
> [email protected]:54960
> I0915 22:28:41.620337  3752 sched.cpp:325] Using default CRAM-MD5 
> authenticatee
> I0915 22:28:41.620400  3758 slave.cpp:4077] Finished recovery
> I0915 22:28:41.620708  3759 authenticatee.cpp:91] Initializing client SASL
> I0915 22:28:41.620842  3758 slave.cpp:4234] Querying resource estimator for 
> oversubscribable resources
> I0915 22:28:41.620873  3759 authenticatee.cpp:115] Creating new client SASL 
> connection
> I0915 22:28:41.621410  3752 master.cpp:5089] Authenticating 
> [email protected]:54960
> I0915 22:28:41.621544  3757 authenticator.cpp:407] Starting authentication 
> session for crammd5_authenticatee(1)@127.0.1.1:54960
> I0915 22:28:41.621572  3758 slave.cpp:692] New master detected at 
> [email protected]:54960
> I0915 22:28:41.621656  3758 slave.cpp:755] Authenticating with master 
> [email protected]:54960
> I0915 22:28:41.621686  3758 slave.cpp:760] Using default CRAM-MD5 
> authenticatee
> I0915 22:28:41.621772  3752 status_update_manager.cpp:176] Pausing sending 
> status updates
> I0915 22:28:41.621888  3755 authenticatee.cpp:115] Creating new client SASL 
> connection
> I0915 22:28:41.621942  3758 slave.cpp:728] Detecting new master
> I0915 22:28:41.622141  3758 slave.cpp:4248] Received oversubscribable 
> resources  from the resource estimator
> I0915 22:28:41.621975  3753 authenticator.cpp:92] Creating new server SASL 
> connection
> I0915 22:28:41.622253  3754 master.cpp:5089] Authenticating 
> slave(1)@127.0.1.1:54960
> I0915 22:28:41.622418  3756 authenticator.cpp:407] Starting authentication 
> session for crammd5_authenticatee(2)@127.0.1.1:54960
> I0915 22:28:41.622560  3757 authenticator.cpp:92] Creating new server SASL 
> connection
> I0915 22:28:41.624449  3759 authenticatee.cpp:206] Received SASL 
> authentication mechanisms: CRAM-MD5
> I0915 22:28:41.624451  3755 authenticatee.cpp:206] Received SASL 
> authentication mechanisms: CRAM-MD5
> I0915 22:28:41.624485  3759 authenticatee.cpp:232] Attempting to authenticate 
> with mechanism 'CRAM-MD5'
> I0915 22:28:41.624511  3755 authenticatee.cpp:232] Attempting to authenticate 
> with mechanism 'CRAM-MD5'
> I0915 22:28:41.624595  3759 authenticator.cpp:197] Received SASL 
> authentication start
> I0915 22:28:41.624606  3755 authenticator.cpp:197] Received SASL 
> authentication start
> I0915 22:28:41.624666  3759 authenticator.cpp:319] Authentication requires 
> more steps
> I0915 22:28:41.624698  3755 authenticator.cpp:319] Authentication requires 
> more steps
> I0915 22:28:41.624742  3759 authenticatee.cpp:252] Received SASL 
> authentication step
> I0915 22:28:41.624788  3755 authenticatee.cpp:252] Received SASL 
> authentication step
> I0915 22:28:41.624846  3759 authenticator.cpp:225] Received SASL 
> authentication step
> I0915 22:28:41.624876  3759 auxprop.cpp:102] Request to lookup properties for 
> user: 'test-principal' realm: 'devstack007.cn.ibm.com' server FQDN: 
> 'devstack007.cn.ibm.com' SASL_AUXPROP_VERIFY_AGAINST_HASH: false 
> SASL_AUXPROP_OVERRIDE: false SASL_AUXPROP_AUTHZID: false 
> I0915 22:28:41.624879  3755 authenticator.cpp:225] Received SASL 
> authentication step
> I0915 22:28:41.624898  3759 auxprop.cpp:174] Looking up auxiliary property 
> '*userPassword'
> I0915 22:28:41.624927  3755 auxprop.cpp:102] Request to lookup properties for 
> user: 'test-principal' realm: 'devstack007.cn.ibm.com' server FQDN: 
> 'devstack007.cn.ibm.com' SASL_AUXPROP_VERIFY_AGAINST_HASH: false 
> SASL_AUXPROP_OVERRIDE: false SASL_AUXPROP_AUTHZID: false 
> I0915 22:28:41.624956  3755 auxprop.cpp:174] Looking up auxiliary property 
> '*userPassword'
> I0915 22:28:41.625015  3759 auxprop.cpp:174] Looking up auxiliary property 
> '*cmusaslsecretCRAM-MD5'
> I0915 22:28:41.625020  3755 auxprop.cpp:174] Looking up auxiliary property 
> '*cmusaslsecretCRAM-MD5'
> I0915 22:28:41.625049  3759 auxprop.cpp:102] Request to lookup properties for 
> user: 'test-principal' realm: 'devstack007.cn.ibm.com' server FQDN: 
> 'devstack007.cn.ibm.com' SASL_AUXPROP_VERIFY_AGAINST_HASH: false 
> SASL_AUXPROP_OVERRIDE: false SASL_AUXPROP_AUTHZID: true 
> I0915 22:28:41.625067  3759 auxprop.cpp:124] Skipping auxiliary property 
> '*userPassword' since SASL_AUXPROP_AUTHZID == true
> I0915 22:28:41.625071  3755 auxprop.cpp:102] Request to lookup properties for 
> user: 'test-principal' realm: 'devstack007.cn.ibm.com' server FQDN: 
> 'devstack007.cn.ibm.com' SASL_AUXPROP_VERIFY_AGAINST_HASH: false 
> SASL_AUXPROP_OVERRIDE: false SASL_AUXPROP_AUTHZID: true 
> I0915 22:28:41.625080  3759 auxprop.cpp:124] Skipping auxiliary property 
> '*cmusaslsecretCRAM-MD5' since SASL_AUXPROP_AUTHZID == true
> I0915 22:28:41.625102  3755 auxprop.cpp:124] Skipping auxiliary property 
> '*userPassword' since SASL_AUXPROP_AUTHZID == true
> I0915 22:28:41.625118  3759 authenticator.cpp:311] Authentication success
> I0915 22:28:41.625123  3755 auxprop.cpp:124] Skipping auxiliary property 
> '*cmusaslsecretCRAM-MD5' since SASL_AUXPROP_AUTHZID == true
> I0915 22:28:41.625154  3755 authenticator.cpp:311] Authentication success
> I0915 22:28:41.625191  3754 authenticatee.cpp:292] Authentication success
> I0915 22:28:41.625288  3753 authenticatee.cpp:292] Authentication success
> I0915 22:28:41.625375  3752 master.cpp:5119] Successfully authenticated 
> principal 'test-principal' at 
> [email protected]:54960
> I0915 22:28:41.625401  3759 authenticator.cpp:425] Authentication session 
> cleanup for crammd5_authenticatee(1)@127.0.1.1:54960
> I0915 22:28:41.625497  3754 sched.cpp:407] Successfully authenticated with 
> master [email protected]:54960
> I0915 22:28:41.625535  3754 sched.cpp:714] Sending SUBSCRIBE call to 
> [email protected]:54960
> I0915 22:28:41.625500  3752 master.cpp:5119] Successfully authenticated 
> principal 'test-principal' at slave(1)@127.0.1.1:54960
> I0915 22:28:41.625704  3759 authenticator.cpp:425] Authentication session 
> cleanup for crammd5_authenticatee(2)@127.0.1.1:54960
> I0915 22:28:41.625695  3754 sched.cpp:747] Will retry registration in 
> 810.177326ms if necessary
> I0915 22:28:41.625833  3755 master.cpp:2174] Received SUBSCRIBE call for 
> framework 'default' at 
> [email protected]:54960
> I0915 22:28:41.625833  3758 slave.cpp:823] Successfully authenticated with 
> master [email protected]:54960
> I0915 22:28:41.625954  3755 master.cpp:1644] Authorizing framework principal 
> 'test-principal' to receive offers for role '*'
> I0915 22:28:41.626005  3758 slave.cpp:1217] Will retry registration in 
> 1.558741ms if necessary
> I0915 22:28:41.628494  3758 slave.cpp:1217] Will retry registration in 
> 4.690825ms if necessary
> I0915 22:28:41.634006  3756 slave.cpp:1217] Will retry registration in 
> 12.908225ms if necessary
> I0915 22:28:41.636726  3755 master.cpp:3816] Registering slave at 
> slave(1)@127.0.1.1:54960 (devstack007.cn.ibm.com) with id 
> 20150915-222840-16842879-54960-3733-S0
> I0915 22:28:41.637073  3755 master.cpp:2244] Subscribing framework default 
> with checkpointing disabled and capabilities [  ]
> I0915 22:28:41.637322  3753 registrar.cpp:441] Applied 1 operations in 
> 55510ns; attempting to update the 'registry'
> I0915 22:28:41.637547  3752 hierarchical.hpp:453] Added framework 
> 20150915-222840-16842879-54960-3733-0000
> I0915 22:28:41.637742  3755 master.cpp:3804] Ignoring register slave message 
> from slave(1)@127.0.1.1:54960 (devstack007.cn.ibm.com) as admission is 
> already in progress
> I0915 22:28:41.637774  3759 sched.cpp:641] Framework registered with 
> 20150915-222840-16842879-54960-3733-0000
> I0915 22:28:41.637926  3752 hierarchical.hpp:1147] No resources available to 
> allocate!
> I0915 22:28:41.638077  3752 hierarchical.hpp:1230] No inverse offers to send 
> out!
> I0915 22:28:41.638087  3755 master.cpp:3804] Ignoring register slave message 
> from slave(1)@127.0.1.1:54960 (devstack007.cn.ibm.com) as admission is 
> already in progress
> I0915 22:28:41.638108  3752 hierarchical.hpp:1047] Performed allocation for 0 
> slaves in 202137ns
> I0915 22:28:41.638044  3759 sched.cpp:655] Scheduler::registered took 39399ns
> I0915 22:28:41.640064  3754 log.cpp:685] Attempting to append 366 bytes to 
> the log
> I0915 22:28:41.640266  3755 coordinator.cpp:341] Coordinator attempting to 
> write APPEND action at position 3
> I0915 22:28:41.640923  3756 replica.cpp:511] Replica received write request 
> for position 3
> I0915 22:28:41.647588  3752 slave.cpp:1217] Will retry registration in 
> 148.576472ms if necessary
> I0915 22:28:41.647784  3758 master.cpp:3804] Ignoring register slave message 
> from slave(1)@127.0.1.1:54960 (devstack007.cn.ibm.com) as admission is 
> already in progress
> I0915 22:28:41.693835  3756 leveldb.cpp:343] Persisting action (385 bytes) to 
> leveldb took 52.865341ms
> I0915 22:28:41.693887  3756 replica.cpp:679] Persisted action at 3
> I0915 22:28:41.694504  3752 replica.cpp:658] Replica received learned notice 
> for position 3
> I0915 22:28:41.727273  3752 leveldb.cpp:343] Persisting action (387 bytes) to 
> leveldb took 32.694232ms
> I0915 22:28:41.727345  3752 replica.cpp:679] Persisted action at 3
> I0915 22:28:41.727392  3752 replica.cpp:664] Replica learned APPEND action at 
> position 3
> I0915 22:28:41.728603  3754 registrar.cpp:486] Successfully updated the 
> 'registry' in 91.195136ms
> I0915 22:28:41.728809  3755 log.cpp:704] Attempting to truncate the log to 3
> I0915 22:28:41.728914  3754 coordinator.cpp:341] Coordinator attempting to 
> write TRUNCATE action at position 4
> I0915 22:28:41.729423  3753 replica.cpp:511] Replica received write request 
> for position 4
> I0915 22:28:41.729729  3755 slave.cpp:3105] Received ping from 
> slave-observer(1)@127.0.1.1:54960
> I0915 22:28:41.729907  3758 master.cpp:3884] Registered slave 
> 20150915-222840-16842879-54960-3733-S0 at slave(1)@127.0.1.1:54960 
> (devstack007.cn.ibm.com) with cpus(*):2; mem(*):1024; disk(*):1024; 
> ports(*):[31000-32000]
> I0915 22:28:41.730042  3755 hierarchical.hpp:612] Added slave 
> 20150915-222840-16842879-54960-3733-S0 (devstack007.cn.ibm.com) with 
> cpus(*):2; mem(*):1024; disk(*):1024; ports(*):[31000-32000] (allocated: )
> I0915 22:28:41.730104  3754 slave.cpp:867] Registered with master 
> [email protected]:54960; given slave ID 20150915-222840-16842879-54960-3733-S0
> I0915 22:28:41.730140  3754 fetcher.cpp:77] Clearing fetcher cache
> I0915 22:28:41.730262  3757 status_update_manager.cpp:183] Resuming sending 
> status updates
> I0915 22:28:41.730599  3755 hierarchical.hpp:1230] No inverse offers to send 
> out!
> I0915 22:28:41.730633  3755 hierarchical.hpp:1065] Performed allocation for 
> slave 20150915-222840-16842879-54960-3733-S0 in 548692ns
> I0915 22:28:41.731081  3759 master.cpp:4918] Sending 1 offers to framework 
> 20150915-222840-16842879-54960-3733-0000 (default) at 
> [email protected]:54960
> I0915 22:28:41.731506  3759 sched.cpp:811] Scheduler::resourceOffers took 
> 76181ns
> I0915 22:28:41.733994  3758 master.cpp:2878] Processing ACCEPT call for 
> offers: [ 20150915-222840-16842879-54960-3733-O0 ] on slave 
> 20150915-222840-16842879-54960-3733-S0 at slave(1)@127.0.1.1:54960 
> (devstack007.cn.ibm.com) for framework 
> 20150915-222840-16842879-54960-3733-0000 (default) at 
> [email protected]:54960
> I0915 22:28:41.734100  3758 master.cpp:2682] Authorizing framework principal 
> 'test-principal' to launch task 1 as user 'gyliu'
> W0915 22:28:41.736322  3758 validation.cpp:419] Executor default for task 1 
> uses less CPUs (None) than the minimum required (0.01). Please update your 
> executor, as this will be mandatory in future releases.
> W0915 22:28:41.736408  3758 validation.cpp:431] Executor default for task 1 
> uses less memory (None) than the minimum required (32MB). Please update your 
> executor, as this will be mandatory in future releases.
> I0915 22:28:41.737035  3758 master.hpp:176] Adding task 1 with resources 
> cpus(*):2; mem(*):1024; disk(*):1024; ports(*):[31000-32000] on slave 
> 20150915-222840-16842879-54960-3733-S0 (devstack007.cn.ibm.com)
> I0915 22:28:41.737259  3758 master.cpp:3208] Launching task 1 of framework 
> 20150915-222840-16842879-54960-3733-0000 (default) at 
> [email protected]:54960 with resources 
> cpus(*):2; mem(*):1024; disk(*):1024; ports(*):[31000-32000] on slave 
> 20150915-222840-16842879-54960-3733-S0 at slave(1)@127.0.1.1:54960 
> (devstack007.cn.ibm.com)
> I0915 22:28:41.820271  3754 slave.cpp:890] Checkpointing SlaveInfo to 
> '/tmp/MasterSlaveReconciliationTest_ReconcileLostTask_tymaz7/meta/slaves/20150915-222840-16842879-54960-3733-S0/slave.info'
> I0915 22:28:41.820550  3754 slave.cpp:926] Forwarding total oversubscribed 
> resources 
> I0915 22:28:41.821171  3756 master.cpp:4226] Received update of slave 
> 20150915-222840-16842879-54960-3733-S0 at slave(1)@127.0.1.1:54960 
> (devstack007.cn.ibm.com) with total oversubscribed resources 
> I0915 22:28:41.821429  3755 hierarchical.hpp:672] Slave 
> 20150915-222840-16842879-54960-3733-S0 (devstack007.cn.ibm.com) updated with 
> oversubscribed resources  (total: cpus(*):2; mem(*):1024; disk(*):1024; 
> ports(*):[31000-32000], allocated: cpus(*):2; mem(*):1024; disk(*):1024; 
> ports(*):[31000-32000])
> I0915 22:28:41.821674  3755 hierarchical.hpp:1147] No resources available to 
> allocate!
> I0915 22:28:41.821709  3755 hierarchical.hpp:1230] No inverse offers to send 
> out!
> I0915 22:28:41.821738  3755 hierarchical.hpp:1065] Performed allocation for 
> slave 20150915-222840-16842879-54960-3733-S0 in 264094ns
> I0915 22:28:41.822713  3756 status_update_manager.cpp:176] Pausing sending 
> status updates
> I0915 22:28:41.822727  3758 slave.cpp:692] New master detected at 
> [email protected]:54960
> I0915 22:28:41.822803  3758 slave.cpp:755] Authenticating with master 
> [email protected]:54960
> I0915 22:28:41.822823  3758 slave.cpp:760] Using default CRAM-MD5 
> authenticatee
> I0915 22:28:41.822923  3758 slave.cpp:728] Detecting new master
> I0915 22:28:41.823016  3752 authenticatee.cpp:115] Creating new client SASL 
> connection
> I0915 22:28:41.823370  3754 master.cpp:5089] Authenticating 
> slave(1)@127.0.1.1:54960
> I0915 22:28:41.823542  3758 authenticator.cpp:407] Starting authentication 
> session for crammd5_authenticatee(3)@127.0.1.1:54960
> I0915 22:28:41.823711  3752 authenticator.cpp:92] Creating new server SASL 
> connection
> I0915 22:28:41.823891  3756 authenticatee.cpp:206] Received SASL 
> authentication mechanisms: CRAM-MD5
> I0915 22:28:41.823927  3756 authenticatee.cpp:232] Attempting to authenticate 
> with mechanism 'CRAM-MD5'
> I0915 22:28:41.824025  3758 authenticator.cpp:197] Received SASL 
> authentication start
> I0915 22:28:41.824177  3758 authenticator.cpp:319] Authentication requires 
> more steps
> I0915 22:28:41.824311  3758 authenticatee.cpp:252] Received SASL 
> authentication step
> I0915 22:28:41.824445  3758 authenticator.cpp:225] Received SASL 
> authentication step
> I0915 22:28:41.824507  3758 auxprop.cpp:102] Request to lookup properties for 
> user: 'test-principal' realm: 'devstack007.cn.ibm.com' server FQDN: 
> 'devstack007.cn.ibm.com' SASL_AUXPROP_VERIFY_AGAINST_HASH: false 
> SASL_AUXPROP_OVERRIDE: false SASL_AUXPROP_AUTHZID: false 
> I0915 22:28:41.824544  3758 auxprop.cpp:174] Looking up auxiliary property 
> '*userPassword'
> I0915 22:28:41.824609  3758 auxprop.cpp:174] Looking up auxiliary property 
> '*cmusaslsecretCRAM-MD5'
> I0915 22:28:41.824658  3758 auxprop.cpp:102] Request to lookup properties for 
> user: 'test-principal' realm: 'devstack007.cn.ibm.com' server FQDN: 
> 'devstack007.cn.ibm.com' SASL_AUXPROP_VERIFY_AGAINST_HASH: false 
> SASL_AUXPROP_OVERRIDE: false SASL_AUXPROP_AUTHZID: true 
> I0915 22:28:41.824692  3758 auxprop.cpp:124] Skipping auxiliary property 
> '*userPassword' since SASL_AUXPROP_AUTHZID == true
> I0915 22:28:41.824717  3758 auxprop.cpp:124] Skipping auxiliary property 
> '*cmusaslsecretCRAM-MD5' since SASL_AUXPROP_AUTHZID == true
> I0915 22:28:41.824753  3758 authenticator.cpp:311] Authentication success
> I0915 22:28:41.824874  3756 authenticatee.cpp:292] Authentication success
> I0915 22:28:41.824914  3757 master.cpp:5119] Successfully authenticated 
> principal 'test-principal' at slave(1)@127.0.1.1:54960
> I0915 22:28:41.825027  3759 authenticator.cpp:425] Authentication session 
> cleanup for crammd5_authenticatee(3)@127.0.1.1:54960
> I0915 22:28:41.825168  3758 slave.cpp:823] Successfully authenticated with 
> master [email protected]:54960
> I0915 22:28:41.825443  3758 slave.cpp:1217] Will retry registration in 
> 13.419746ms if necessary
> I0915 22:28:41.825734  3752 master.cpp:3976] Re-registering slave 
> 20150915-222840-16842879-54960-3733-S0 at slave(1)@127.0.1.1:54960 
> (devstack007.cn.ibm.com)
> W0915 22:28:41.826159  3752 master.cpp:5186] Task 1 of framework 
> 20150915-222840-16842879-54960-3733-0000 unknown to the slave 
> 20150915-222840-16842879-54960-3733-S0 at slave(1)@127.0.1.1:54960 
> (devstack007.cn.ibm.com) during re-registration: reconciling with the slave
> W0915 22:28:41.826668  3752 master.cpp:5268] Executor default of framework 
> 20150915-222840-16842879-54960-3733-0000 possibly unknown to the slave 
> 20150915-222840-16842879-54960-3733-S0 at slave(1)@127.0.1.1:54960 
> (devstack007.cn.ibm.com)
> I0915 22:28:41.826745  3756 slave.cpp:967] Re-registered with master 
> [email protected]:54960
> I0915 22:28:41.826797  3752 master.cpp:6118] Removing executor 'default' with 
> resources  of framework 20150915-222840-16842879-54960-3733-0000 on slave 
> 20150915-222840-16842879-54960-3733-S0 at slave(1)@127.0.1.1:54960 
> (devstack007.cn.ibm.com)
> I0915 22:28:41.826820  3757 status_update_manager.cpp:183] Resuming sending 
> status updates
> I0915 22:28:41.826843  3756 slave.cpp:1003] Forwarding total oversubscribed 
> resources 
> W0915 22:28:41.827033  3756 slave.cpp:1043] Slave reconciling task 1 of 
> framework 20150915-222840-16842879-54960-3733-0000 in state TASK_LOST: task 
> unknown to the slave
> I0915 22:28:41.827244  3752 master.cpp:4164] Sending updated checkpointed 
> resources  to slave 20150915-222840-16842879-54960-3733-S0 at 
> slave(1)@127.0.1.1:54960 (devstack007.cn.ibm.com)
> I0915 22:28:41.827461  3752 master.cpp:4226] Received update of slave 
> 20150915-222840-16842879-54960-3733-S0 at slave(1)@127.0.1.1:54960 
> (devstack007.cn.ibm.com) with total oversubscribed resources 
> I0915 22:28:41.827873  3757 hierarchical.hpp:672] Slave 
> 20150915-222840-16842879-54960-3733-S0 (devstack007.cn.ibm.com) updated with 
> oversubscribed resources  (total: cpus(*):2; mem(*):1024; disk(*):1024; 
> ports(*):[31000-32000], allocated: cpus(*):2; mem(*):1024; disk(*):1024; 
> ports(*):[31000-32000])
> I0915 22:28:41.828115  3757 hierarchical.hpp:1147] No resources available to 
> allocate!
> I0915 22:28:41.828151  3757 hierarchical.hpp:1230] No inverse offers to send 
> out!
> I0915 22:28:41.828176  3757 hierarchical.hpp:1065] Performed allocation for 
> slave 20150915-222840-16842879-54960-3733-S0 in 260424ns
> I0915 22:28:41.828800  3752 status_update_manager.cpp:322] Received status 
> update TASK_LOST (UUID: 6a8c03f0-509f-48e6-8fd5-ed7b47eb44df) for task 1 of 
> framework 20150915-222840-16842879-54960-3733-0000
> I0915 22:28:41.828882  3752 status_update_manager.cpp:499] Creating 
> StatusUpdate stream for task 1 of framework 
> 20150915-222840-16842879-54960-3733-0000
> I0915 22:28:41.829025  3756 slave.cpp:2235] Updated checkpointed resources 
> from  to 
> I0915 22:28:41.829366  3752 status_update_manager.cpp:376] Forwarding update 
> TASK_LOST (UUID: 6a8c03f0-509f-48e6-8fd5-ed7b47eb44df) for task 1 of 
> framework 20150915-222840-16842879-54960-3733-0000 to the slave
> I0915 22:28:41.829633  3755 slave.cpp:2983] Forwarding the update TASK_LOST 
> (UUID: 6a8c03f0-509f-48e6-8fd5-ed7b47eb44df) for task 1 of framework 
> 20150915-222840-16842879-54960-3733-0000 to [email protected]:54960
> I0915 22:28:41.829941  3755 slave.cpp:2907] Status update manager 
> successfully handled status update TASK_LOST (UUID: 
> 6a8c03f0-509f-48e6-8fd5-ed7b47eb44df) for task 1 of framework 
> 20150915-222840-16842879-54960-3733-0000
> I0915 22:28:41.830066  3758 master.cpp:4366] Status update TASK_LOST (UUID: 
> 6a8c03f0-509f-48e6-8fd5-ed7b47eb44df) for task 1 of framework 
> 20150915-222840-16842879-54960-3733-0000 from slave 
> 20150915-222840-16842879-54960-3733-S0 at slave(1)@127.0.1.1:54960 
> (devstack007.cn.ibm.com)
> I0915 22:28:41.830114  3758 master.cpp:4405] Forwarding status update 
> TASK_LOST (UUID: 6a8c03f0-509f-48e6-8fd5-ed7b47eb44df) for task 1 of 
> framework 20150915-222840-16842879-54960-3733-0000
> I0915 22:28:41.830272  3758 master.cpp:6021] Updating the latest state of 
> task 1 of framework 20150915-222840-16842879-54960-3733-0000 to TASK_LOST
> I0915 22:28:41.830363  3759 sched.cpp:918] Scheduler::statusUpdate took 
> 54887ns
> I0915 22:28:41.830675  3757 hierarchical.hpp:954] Recovered cpus(*):2; 
> mem(*):1024; disk(*):1024; ports(*):[31000-32000] (total: cpus(*):2; 
> mem(*):1024; disk(*):1024; ports(*):[31000-32000], allocated: ) on slave 
> 20150915-222840-16842879-54960-3733-S0 from framework 
> 20150915-222840-16842879-54960-3733-0000
> I0915 22:28:41.835749  3753 leveldb.cpp:343] Persisting action (16 bytes) to 
> leveldb took 106.276892ms
> I0915 22:28:41.835824  3753 replica.cpp:679] Persisted action at 4
> I0915 22:28:41.836457  3753 replica.cpp:658] Replica received learned notice 
> for position 4
> I0915 22:28:41.836736  3758 master.cpp:6089] Removing task 1 with resources 
> cpus(*):2; mem(*):1024; disk(*):1024; ports(*):[31000-32000] of framework 
> 20150915-222840-16842879-54960-3733-0000 on slave 
> 20150915-222840-16842879-54960-3733-S0 at slave(1)@127.0.1.1:54960 
> (devstack007.cn.ibm.com)
> I0915 22:28:41.836931  3758 master.cpp:3560] Processing ACKNOWLEDGE call 
> 6a8c03f0-509f-48e6-8fd5-ed7b47eb44df for task 1 of framework 
> 20150915-222840-16842879-54960-3733-0000 (default) at 
> [email protected]:54960 on slave 
> 20150915-222840-16842879-54960-3733-S0
> I0915 22:28:41.837369  3758 status_update_manager.cpp:394] Received status 
> update acknowledgement (UUID: 6a8c03f0-509f-48e6-8fd5-ed7b47eb44df) for task 
> 1 of framework 20150915-222840-16842879-54960-3733-0000
> I0915 22:28:41.837505  3758 status_update_manager.cpp:530] Cleaning up status 
> update stream for task 1 of framework 20150915-222840-16842879-54960-3733-0000
> I0915 22:28:41.837813  3758 slave.cpp:2306] Status update manager 
> successfully handled status update acknowledgement (UUID: 
> 6a8c03f0-509f-48e6-8fd5-ed7b47eb44df) for task 1 of framework 
> 20150915-222840-16842879-54960-3733-0000
> E0915 22:28:41.837849  3758 slave.cpp:2317] Status update acknowledgement 
> (UUID: 6a8c03f0-509f-48e6-8fd5-ed7b47eb44df) for task 1 of unknown framework 
> 20150915-222840-16842879-54960-3733-0000
> I0915 22:28:41.877821  3753 leveldb.cpp:343] Persisting action (18 bytes) to 
> leveldb took 41.327821ms
> I0915 22:28:41.877909  3753 leveldb.cpp:401] Deleting ~2 keys from leveldb 
> took 47279ns
> I0915 22:28:41.877940  3753 replica.cpp:679] Persisted action at 4
> I0915 22:28:41.877981  3753 replica.cpp:664] Replica learned TRUNCATE action 
> at position 4
> I0915 22:28:41.906275  3754 process.cpp:3021] Handling HTTP event for process 
> 'metrics' with path: '/metrics/snapshot'
> I0915 22:28:41.918052  3733 sched.cpp:1754] Asked to stop the driver
> I0915 22:28:41.918167  3754 sched.cpp:1040] Stopping framework 
> '20150915-222840-16842879-54960-3733-0000'
> I0915 22:28:41.918184  3752 master.cpp:921] Master terminating
> I0915 22:28:41.918462  3755 hierarchical.hpp:643] Removed slave 
> 20150915-222840-16842879-54960-3733-S0
> I0915 22:28:41.918792  3755 hierarchical.hpp:490] Removed framework 
> 20150915-222840-16842879-54960-3733-0000
> I0915 22:28:41.919427  3756 slave.cpp:3151] [email protected]:54960 exited
> W0915 22:28:41.919839  3756 slave.cpp:3154] Master disconnected! Waiting for 
> a new master to be elected
> I0915 22:28:41.923146  3753 slave.cpp:572] Slave terminating
> [       OK ] MasterSlaveReconciliationTest.ReconcileLostTask (1443 ms)
> [----------] 1 test from MasterSlaveReconciliationTest (1443 ms total)
>  
> [----------] Global test environment tear-down
> [==========] 1 test from 1 test case ran. (1780 ms total)
> [  PASSED  ] 1 test.
> make[3]: Leaving directory `/home/gyliu/src/mesos/bug-fix/mesos/build/src'
> make[2]: Leaving directory `/home/gyliu/src/mesos/bug-fix/mesos/build/src'
> make[1]: Leaving directory `/home/gyliu/src/mesos/bug-fix/mesos/build/src'
> gyliu@devstack007:



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to