[jira] [Updated] (MESOS-7106) Test ContentTypeAndSSLConfig/SchedulerSSLTest.RunTaskAndTeardown/1 segfaults

2017-04-27 Thread Vinod Kone (JIRA)

 [ 
https://issues.apache.org/jira/browse/MESOS-7106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinod Kone updated MESOS-7106:
--
Sprint: Mesosphere Sprint 52, Mesosphere Sprint 53  (was: Mesosphere Sprint 
52, Mesosphere Sprint 53, Mesosphere Sprint 55)

> Test ContentTypeAndSSLConfig/SchedulerSSLTest.RunTaskAndTeardown/1 segfaults
> 
>
> Key: MESOS-7106
> URL: https://issues.apache.org/jira/browse/MESOS-7106
> Project: Mesos
>  Issue Type: Bug
>Affects Versions: 1.2.0
> Environment: centos7, SSL build
>Reporter: Benjamin Bannier
>Assignee: Joseph Wu
>  Labels: flaky-test, mesosphere, test
>
> {{ContentTypeAndSSLConfig/SchedulerSSLTest.RunTaskAndTeardown/1}} segfaulted 
> in our internal CI:
> {noformat}
> [ RUN  ] ContentTypeAndSSLConfig/SchedulerSSLTest.RunTaskAndTeardown/1
> W0210 03:08:05.018744  1020 process.cpp:3029] Attempted to spawn a process 
> (__http_connection__(1079)@10.168.212.35:42363) after finalizing libprocess!
> *** Aborted at 1486696085 (unix time) try "date -d @1486696085" if you are 
> using GNU date ***
> I0210 03:08:05.023609  6019 process.cpp:1246] libprocess is initialized on 
> 10.168.212.35:44850 with 8 worker threads
> I0210 03:08:05.024163  6019 cluster.cpp:160] Creating default 'local' 
> authorizer
> I0210 03:08:05.025065  1025 master.cpp:383] Master 
> 7adcbe15-38a9-4512-aa9c-8d5f7538e4ee (ip-10-168-212-35.ec2.internal) started 
> on 10.168.212.35:44850
> I0210 03:08:05.025089  1025 master.cpp:385] Flags at startup: --acls="" 
> --agent_ping_timeout="15secs" --agent_reregister_timeout="10mins" 
> --allocation_interval="1secs" --allocator="HierarchicalDRF" 
> --authenticate_agents="true" --authenticate_frameworks="true" 
> --authenticate_http_frameworks="true" --authenticate_http_readonly="true" 
> --authenticate_http_readwrite="true" --authenticators="crammd5" 
> --authorizers="local" --credentials="/tmp/5DRa8u/credentials" 
> --framework_sorter="drf" --help="false" --hostname_lookup="true" 
> --http_authenticators="basic" --http_framework_authenticators="basic" 
> --initialize_driver_logging="true" --log_auto_initialize="true" 
> --logbufsecs="0" --logging_level="INFO" --max_agent_ping_timeouts="5" 
> --max_completed_frameworks="50" --max_completed_tasks_per_framework="1000" 
> --max_unreachable_tasks_per_framework="1000" --quiet="false" 
> --recovery_agent_removal_limit="100%" --registry="in_memory" 
> --registry_fetch_timeout="1mins" --registry_gc_interval="15mins" 
> --registry_max_agent_age="2weeks" --registry_max_agent_count="102400" 
> --registry_store_timeout="100secs" --registry_strict="false" 
> --root_submissions="true" --user_sorter="drf" --version="false" 
> --webui_dir="/usr/local/share/mesos/webui" --work_dir="/tmp/5DRa8u/master" 
> --zk_session_timeout="10secs"
> I0210 03:08:05.025264  1025 master.cpp:435] Master only allowing 
> authenticated frameworks to register
> I0210 03:08:05.025276  1025 master.cpp:449] Master only allowing 
> authenticated agents to register
> I0210 03:08:05.025285  1025 master.cpp:462] Master only allowing 
> authenticated HTTP frameworks to register
> I0210 03:08:05.025293  1025 credentials.hpp:37] Loading credentials for 
> authentication from '/tmp/5DRa8u/credentials'
> I0210 03:08:05.025387  1025 master.cpp:507] Using default 'crammd5' 
> authenticator
> I0210 03:08:05.025441  1025 http.cpp:919] Using default 'basic' HTTP 
> authenticator for realm 'mesos-master-readonly'
> I0210 03:08:05.025512  1025 http.cpp:919] Using default 'basic' HTTP 
> authenticator for realm 'mesos-master-readwrite'
> I0210 03:08:05.025560  1025 http.cpp:919] Using default 'basic' HTTP 
> authenticator for realm 'mesos-master-scheduler'
> I0210 03:08:05.025619  1025 master.cpp:587] Authorization enabled
> I0210 03:08:05.025728  1023 hierarchical.cpp:161] Initialized hierarchical 
> allocator process
> I0210 03:08:05.025754  1027 whitelist_watcher.cpp:77] No whitelist given
> PC: @ 0x7f69d2296012 process::ProcessManager::spawn()
> *** SIGSEGV (@0x0) received by PID 6019 (TID 0x7f69c46d5700) from PID 0; 
> stack trace: ***
> @ 0x7f69c2408725 (unknown)
> I0210 03:08:05.026340  1023 master.cpp:2124] Elected as the leading master!
> I0210 03:08:05.026357  1023 master.cpp:1646] Recovering from registrar
> I0210 03:08:05.026406  1025 registrar.cpp:329] Recovering registrar
> @ 0x7f69c240d2f1 (unknown)
> @ 0x7f69c24011e8 (unknown)
> I0210 03:08:05.027294  1024 registrar.cpp:362] Successfully fetched the 
> registry (0B) in 865024ns
> I0210 03:08:05.027330  1024 registrar.cpp:461] Applied 1 operations in 
> 2848ns; attempting to update the registry
> @ 0x7f69d027b370 (unknown)
> I0210 03:08:05.028261  1028 registrar.cpp:506] Successfully updated the 
> registry in 916992ns
> I0210 

[jira] [Updated] (MESOS-7106) Test ContentTypeAndSSLConfig/SchedulerSSLTest.RunTaskAndTeardown/1 segfaults

2017-04-13 Thread Adam B (JIRA)

 [ 
https://issues.apache.org/jira/browse/MESOS-7106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Adam B updated MESOS-7106:
--
Sprint: Mesosphere Sprint 52, Mesosphere Sprint 53  (was: Mesosphere Sprint 
52, Mesosphere Sprint 53, Mesosphere Sprint 54)

> Test ContentTypeAndSSLConfig/SchedulerSSLTest.RunTaskAndTeardown/1 segfaults
> 
>
> Key: MESOS-7106
> URL: https://issues.apache.org/jira/browse/MESOS-7106
> Project: Mesos
>  Issue Type: Bug
>Affects Versions: 1.2.0
> Environment: centos7, SSL build
>Reporter: Benjamin Bannier
>Assignee: Joseph Wu
>  Labels: flaky-test, mesosphere, test
>
> {{ContentTypeAndSSLConfig/SchedulerSSLTest.RunTaskAndTeardown/1}} segfaulted 
> in our internal CI:
> {noformat}
> [ RUN  ] ContentTypeAndSSLConfig/SchedulerSSLTest.RunTaskAndTeardown/1
> W0210 03:08:05.018744  1020 process.cpp:3029] Attempted to spawn a process 
> (__http_connection__(1079)@10.168.212.35:42363) after finalizing libprocess!
> *** Aborted at 1486696085 (unix time) try "date -d @1486696085" if you are 
> using GNU date ***
> I0210 03:08:05.023609  6019 process.cpp:1246] libprocess is initialized on 
> 10.168.212.35:44850 with 8 worker threads
> I0210 03:08:05.024163  6019 cluster.cpp:160] Creating default 'local' 
> authorizer
> I0210 03:08:05.025065  1025 master.cpp:383] Master 
> 7adcbe15-38a9-4512-aa9c-8d5f7538e4ee (ip-10-168-212-35.ec2.internal) started 
> on 10.168.212.35:44850
> I0210 03:08:05.025089  1025 master.cpp:385] Flags at startup: --acls="" 
> --agent_ping_timeout="15secs" --agent_reregister_timeout="10mins" 
> --allocation_interval="1secs" --allocator="HierarchicalDRF" 
> --authenticate_agents="true" --authenticate_frameworks="true" 
> --authenticate_http_frameworks="true" --authenticate_http_readonly="true" 
> --authenticate_http_readwrite="true" --authenticators="crammd5" 
> --authorizers="local" --credentials="/tmp/5DRa8u/credentials" 
> --framework_sorter="drf" --help="false" --hostname_lookup="true" 
> --http_authenticators="basic" --http_framework_authenticators="basic" 
> --initialize_driver_logging="true" --log_auto_initialize="true" 
> --logbufsecs="0" --logging_level="INFO" --max_agent_ping_timeouts="5" 
> --max_completed_frameworks="50" --max_completed_tasks_per_framework="1000" 
> --max_unreachable_tasks_per_framework="1000" --quiet="false" 
> --recovery_agent_removal_limit="100%" --registry="in_memory" 
> --registry_fetch_timeout="1mins" --registry_gc_interval="15mins" 
> --registry_max_agent_age="2weeks" --registry_max_agent_count="102400" 
> --registry_store_timeout="100secs" --registry_strict="false" 
> --root_submissions="true" --user_sorter="drf" --version="false" 
> --webui_dir="/usr/local/share/mesos/webui" --work_dir="/tmp/5DRa8u/master" 
> --zk_session_timeout="10secs"
> I0210 03:08:05.025264  1025 master.cpp:435] Master only allowing 
> authenticated frameworks to register
> I0210 03:08:05.025276  1025 master.cpp:449] Master only allowing 
> authenticated agents to register
> I0210 03:08:05.025285  1025 master.cpp:462] Master only allowing 
> authenticated HTTP frameworks to register
> I0210 03:08:05.025293  1025 credentials.hpp:37] Loading credentials for 
> authentication from '/tmp/5DRa8u/credentials'
> I0210 03:08:05.025387  1025 master.cpp:507] Using default 'crammd5' 
> authenticator
> I0210 03:08:05.025441  1025 http.cpp:919] Using default 'basic' HTTP 
> authenticator for realm 'mesos-master-readonly'
> I0210 03:08:05.025512  1025 http.cpp:919] Using default 'basic' HTTP 
> authenticator for realm 'mesos-master-readwrite'
> I0210 03:08:05.025560  1025 http.cpp:919] Using default 'basic' HTTP 
> authenticator for realm 'mesos-master-scheduler'
> I0210 03:08:05.025619  1025 master.cpp:587] Authorization enabled
> I0210 03:08:05.025728  1023 hierarchical.cpp:161] Initialized hierarchical 
> allocator process
> I0210 03:08:05.025754  1027 whitelist_watcher.cpp:77] No whitelist given
> PC: @ 0x7f69d2296012 process::ProcessManager::spawn()
> *** SIGSEGV (@0x0) received by PID 6019 (TID 0x7f69c46d5700) from PID 0; 
> stack trace: ***
> @ 0x7f69c2408725 (unknown)
> I0210 03:08:05.026340  1023 master.cpp:2124] Elected as the leading master!
> I0210 03:08:05.026357  1023 master.cpp:1646] Recovering from registrar
> I0210 03:08:05.026406  1025 registrar.cpp:329] Recovering registrar
> @ 0x7f69c240d2f1 (unknown)
> @ 0x7f69c24011e8 (unknown)
> I0210 03:08:05.027294  1024 registrar.cpp:362] Successfully fetched the 
> registry (0B) in 865024ns
> I0210 03:08:05.027330  1024 registrar.cpp:461] Applied 1 operations in 
> 2848ns; attempting to update the registry
> @ 0x7f69d027b370 (unknown)
> I0210 03:08:05.028261  1028 registrar.cpp:506] Successfully updated the 
> registry in 916992ns
> I0210 

[jira] [Updated] (MESOS-7106) Test ContentTypeAndSSLConfig/SchedulerSSLTest.RunTaskAndTeardown/1 segfaults

2017-04-13 Thread Adam B (JIRA)

 [ 
https://issues.apache.org/jira/browse/MESOS-7106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Adam B updated MESOS-7106:
--
Sprint: Mesosphere Sprint 52, Mesosphere Sprint 53, Mesosphere Sprint 55  
(was: Mesosphere Sprint 52, Mesosphere Sprint 53)

> Test ContentTypeAndSSLConfig/SchedulerSSLTest.RunTaskAndTeardown/1 segfaults
> 
>
> Key: MESOS-7106
> URL: https://issues.apache.org/jira/browse/MESOS-7106
> Project: Mesos
>  Issue Type: Bug
>Affects Versions: 1.2.0
> Environment: centos7, SSL build
>Reporter: Benjamin Bannier
>Assignee: Joseph Wu
>  Labels: flaky-test, mesosphere, test
>
> {{ContentTypeAndSSLConfig/SchedulerSSLTest.RunTaskAndTeardown/1}} segfaulted 
> in our internal CI:
> {noformat}
> [ RUN  ] ContentTypeAndSSLConfig/SchedulerSSLTest.RunTaskAndTeardown/1
> W0210 03:08:05.018744  1020 process.cpp:3029] Attempted to spawn a process 
> (__http_connection__(1079)@10.168.212.35:42363) after finalizing libprocess!
> *** Aborted at 1486696085 (unix time) try "date -d @1486696085" if you are 
> using GNU date ***
> I0210 03:08:05.023609  6019 process.cpp:1246] libprocess is initialized on 
> 10.168.212.35:44850 with 8 worker threads
> I0210 03:08:05.024163  6019 cluster.cpp:160] Creating default 'local' 
> authorizer
> I0210 03:08:05.025065  1025 master.cpp:383] Master 
> 7adcbe15-38a9-4512-aa9c-8d5f7538e4ee (ip-10-168-212-35.ec2.internal) started 
> on 10.168.212.35:44850
> I0210 03:08:05.025089  1025 master.cpp:385] Flags at startup: --acls="" 
> --agent_ping_timeout="15secs" --agent_reregister_timeout="10mins" 
> --allocation_interval="1secs" --allocator="HierarchicalDRF" 
> --authenticate_agents="true" --authenticate_frameworks="true" 
> --authenticate_http_frameworks="true" --authenticate_http_readonly="true" 
> --authenticate_http_readwrite="true" --authenticators="crammd5" 
> --authorizers="local" --credentials="/tmp/5DRa8u/credentials" 
> --framework_sorter="drf" --help="false" --hostname_lookup="true" 
> --http_authenticators="basic" --http_framework_authenticators="basic" 
> --initialize_driver_logging="true" --log_auto_initialize="true" 
> --logbufsecs="0" --logging_level="INFO" --max_agent_ping_timeouts="5" 
> --max_completed_frameworks="50" --max_completed_tasks_per_framework="1000" 
> --max_unreachable_tasks_per_framework="1000" --quiet="false" 
> --recovery_agent_removal_limit="100%" --registry="in_memory" 
> --registry_fetch_timeout="1mins" --registry_gc_interval="15mins" 
> --registry_max_agent_age="2weeks" --registry_max_agent_count="102400" 
> --registry_store_timeout="100secs" --registry_strict="false" 
> --root_submissions="true" --user_sorter="drf" --version="false" 
> --webui_dir="/usr/local/share/mesos/webui" --work_dir="/tmp/5DRa8u/master" 
> --zk_session_timeout="10secs"
> I0210 03:08:05.025264  1025 master.cpp:435] Master only allowing 
> authenticated frameworks to register
> I0210 03:08:05.025276  1025 master.cpp:449] Master only allowing 
> authenticated agents to register
> I0210 03:08:05.025285  1025 master.cpp:462] Master only allowing 
> authenticated HTTP frameworks to register
> I0210 03:08:05.025293  1025 credentials.hpp:37] Loading credentials for 
> authentication from '/tmp/5DRa8u/credentials'
> I0210 03:08:05.025387  1025 master.cpp:507] Using default 'crammd5' 
> authenticator
> I0210 03:08:05.025441  1025 http.cpp:919] Using default 'basic' HTTP 
> authenticator for realm 'mesos-master-readonly'
> I0210 03:08:05.025512  1025 http.cpp:919] Using default 'basic' HTTP 
> authenticator for realm 'mesos-master-readwrite'
> I0210 03:08:05.025560  1025 http.cpp:919] Using default 'basic' HTTP 
> authenticator for realm 'mesos-master-scheduler'
> I0210 03:08:05.025619  1025 master.cpp:587] Authorization enabled
> I0210 03:08:05.025728  1023 hierarchical.cpp:161] Initialized hierarchical 
> allocator process
> I0210 03:08:05.025754  1027 whitelist_watcher.cpp:77] No whitelist given
> PC: @ 0x7f69d2296012 process::ProcessManager::spawn()
> *** SIGSEGV (@0x0) received by PID 6019 (TID 0x7f69c46d5700) from PID 0; 
> stack trace: ***
> @ 0x7f69c2408725 (unknown)
> I0210 03:08:05.026340  1023 master.cpp:2124] Elected as the leading master!
> I0210 03:08:05.026357  1023 master.cpp:1646] Recovering from registrar
> I0210 03:08:05.026406  1025 registrar.cpp:329] Recovering registrar
> @ 0x7f69c240d2f1 (unknown)
> @ 0x7f69c24011e8 (unknown)
> I0210 03:08:05.027294  1024 registrar.cpp:362] Successfully fetched the 
> registry (0B) in 865024ns
> I0210 03:08:05.027330  1024 registrar.cpp:461] Applied 1 operations in 
> 2848ns; attempting to update the registry
> @ 0x7f69d027b370 (unknown)
> I0210 03:08:05.028261  1028 registrar.cpp:506] Successfully updated the 
> registry in 916992ns
> I0210 

[jira] [Updated] (MESOS-7106) Test ContentTypeAndSSLConfig/SchedulerSSLTest.RunTaskAndTeardown/1 segfaults

2017-03-31 Thread Adam B (JIRA)

 [ 
https://issues.apache.org/jira/browse/MESOS-7106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Adam B updated MESOS-7106:
--
Sprint: Mesosphere Sprint 52, Mesosphere Sprint 53, Mesosphere Sprint 54  
(was: Mesosphere Sprint 52, Mesosphere Sprint 53)

> Test ContentTypeAndSSLConfig/SchedulerSSLTest.RunTaskAndTeardown/1 segfaults
> 
>
> Key: MESOS-7106
> URL: https://issues.apache.org/jira/browse/MESOS-7106
> Project: Mesos
>  Issue Type: Bug
>Affects Versions: 1.2.0
> Environment: centos7, SSL build
>Reporter: Benjamin Bannier
>Assignee: Joseph Wu
>  Labels: flaky-test, mesosphere, test
>
> {{ContentTypeAndSSLConfig/SchedulerSSLTest.RunTaskAndTeardown/1}} segfaulted 
> in our internal CI:
> {noformat}
> [ RUN  ] ContentTypeAndSSLConfig/SchedulerSSLTest.RunTaskAndTeardown/1
> W0210 03:08:05.018744  1020 process.cpp:3029] Attempted to spawn a process 
> (__http_connection__(1079)@10.168.212.35:42363) after finalizing libprocess!
> *** Aborted at 1486696085 (unix time) try "date -d @1486696085" if you are 
> using GNU date ***
> I0210 03:08:05.023609  6019 process.cpp:1246] libprocess is initialized on 
> 10.168.212.35:44850 with 8 worker threads
> I0210 03:08:05.024163  6019 cluster.cpp:160] Creating default 'local' 
> authorizer
> I0210 03:08:05.025065  1025 master.cpp:383] Master 
> 7adcbe15-38a9-4512-aa9c-8d5f7538e4ee (ip-10-168-212-35.ec2.internal) started 
> on 10.168.212.35:44850
> I0210 03:08:05.025089  1025 master.cpp:385] Flags at startup: --acls="" 
> --agent_ping_timeout="15secs" --agent_reregister_timeout="10mins" 
> --allocation_interval="1secs" --allocator="HierarchicalDRF" 
> --authenticate_agents="true" --authenticate_frameworks="true" 
> --authenticate_http_frameworks="true" --authenticate_http_readonly="true" 
> --authenticate_http_readwrite="true" --authenticators="crammd5" 
> --authorizers="local" --credentials="/tmp/5DRa8u/credentials" 
> --framework_sorter="drf" --help="false" --hostname_lookup="true" 
> --http_authenticators="basic" --http_framework_authenticators="basic" 
> --initialize_driver_logging="true" --log_auto_initialize="true" 
> --logbufsecs="0" --logging_level="INFO" --max_agent_ping_timeouts="5" 
> --max_completed_frameworks="50" --max_completed_tasks_per_framework="1000" 
> --max_unreachable_tasks_per_framework="1000" --quiet="false" 
> --recovery_agent_removal_limit="100%" --registry="in_memory" 
> --registry_fetch_timeout="1mins" --registry_gc_interval="15mins" 
> --registry_max_agent_age="2weeks" --registry_max_agent_count="102400" 
> --registry_store_timeout="100secs" --registry_strict="false" 
> --root_submissions="true" --user_sorter="drf" --version="false" 
> --webui_dir="/usr/local/share/mesos/webui" --work_dir="/tmp/5DRa8u/master" 
> --zk_session_timeout="10secs"
> I0210 03:08:05.025264  1025 master.cpp:435] Master only allowing 
> authenticated frameworks to register
> I0210 03:08:05.025276  1025 master.cpp:449] Master only allowing 
> authenticated agents to register
> I0210 03:08:05.025285  1025 master.cpp:462] Master only allowing 
> authenticated HTTP frameworks to register
> I0210 03:08:05.025293  1025 credentials.hpp:37] Loading credentials for 
> authentication from '/tmp/5DRa8u/credentials'
> I0210 03:08:05.025387  1025 master.cpp:507] Using default 'crammd5' 
> authenticator
> I0210 03:08:05.025441  1025 http.cpp:919] Using default 'basic' HTTP 
> authenticator for realm 'mesos-master-readonly'
> I0210 03:08:05.025512  1025 http.cpp:919] Using default 'basic' HTTP 
> authenticator for realm 'mesos-master-readwrite'
> I0210 03:08:05.025560  1025 http.cpp:919] Using default 'basic' HTTP 
> authenticator for realm 'mesos-master-scheduler'
> I0210 03:08:05.025619  1025 master.cpp:587] Authorization enabled
> I0210 03:08:05.025728  1023 hierarchical.cpp:161] Initialized hierarchical 
> allocator process
> I0210 03:08:05.025754  1027 whitelist_watcher.cpp:77] No whitelist given
> PC: @ 0x7f69d2296012 process::ProcessManager::spawn()
> *** SIGSEGV (@0x0) received by PID 6019 (TID 0x7f69c46d5700) from PID 0; 
> stack trace: ***
> @ 0x7f69c2408725 (unknown)
> I0210 03:08:05.026340  1023 master.cpp:2124] Elected as the leading master!
> I0210 03:08:05.026357  1023 master.cpp:1646] Recovering from registrar
> I0210 03:08:05.026406  1025 registrar.cpp:329] Recovering registrar
> @ 0x7f69c240d2f1 (unknown)
> @ 0x7f69c24011e8 (unknown)
> I0210 03:08:05.027294  1024 registrar.cpp:362] Successfully fetched the 
> registry (0B) in 865024ns
> I0210 03:08:05.027330  1024 registrar.cpp:461] Applied 1 operations in 
> 2848ns; attempting to update the registry
> @ 0x7f69d027b370 (unknown)
> I0210 03:08:05.028261  1028 registrar.cpp:506] Successfully updated the 
> registry in 916992ns
> I0210 

[jira] [Updated] (MESOS-7106) Test ContentTypeAndSSLConfig/SchedulerSSLTest.RunTaskAndTeardown/1 segfaults

2017-03-23 Thread Artem Harutyunyan (JIRA)

 [ 
https://issues.apache.org/jira/browse/MESOS-7106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Artem Harutyunyan updated MESOS-7106:
-
Sprint: Mesosphere Sprint 52, Mesosphere Sprint 53  (was: Mesosphere Sprint 
52)

> Test ContentTypeAndSSLConfig/SchedulerSSLTest.RunTaskAndTeardown/1 segfaults
> 
>
> Key: MESOS-7106
> URL: https://issues.apache.org/jira/browse/MESOS-7106
> Project: Mesos
>  Issue Type: Bug
>Affects Versions: 1.2.0
> Environment: centos7, SSL build
>Reporter: Benjamin Bannier
>Assignee: Joseph Wu
>  Labels: flaky-test, mesosphere, test
>
> {{ContentTypeAndSSLConfig/SchedulerSSLTest.RunTaskAndTeardown/1}} segfaulted 
> in our internal CI:
> {noformat}
> [ RUN  ] ContentTypeAndSSLConfig/SchedulerSSLTest.RunTaskAndTeardown/1
> W0210 03:08:05.018744  1020 process.cpp:3029] Attempted to spawn a process 
> (__http_connection__(1079)@10.168.212.35:42363) after finalizing libprocess!
> *** Aborted at 1486696085 (unix time) try "date -d @1486696085" if you are 
> using GNU date ***
> I0210 03:08:05.023609  6019 process.cpp:1246] libprocess is initialized on 
> 10.168.212.35:44850 with 8 worker threads
> I0210 03:08:05.024163  6019 cluster.cpp:160] Creating default 'local' 
> authorizer
> I0210 03:08:05.025065  1025 master.cpp:383] Master 
> 7adcbe15-38a9-4512-aa9c-8d5f7538e4ee (ip-10-168-212-35.ec2.internal) started 
> on 10.168.212.35:44850
> I0210 03:08:05.025089  1025 master.cpp:385] Flags at startup: --acls="" 
> --agent_ping_timeout="15secs" --agent_reregister_timeout="10mins" 
> --allocation_interval="1secs" --allocator="HierarchicalDRF" 
> --authenticate_agents="true" --authenticate_frameworks="true" 
> --authenticate_http_frameworks="true" --authenticate_http_readonly="true" 
> --authenticate_http_readwrite="true" --authenticators="crammd5" 
> --authorizers="local" --credentials="/tmp/5DRa8u/credentials" 
> --framework_sorter="drf" --help="false" --hostname_lookup="true" 
> --http_authenticators="basic" --http_framework_authenticators="basic" 
> --initialize_driver_logging="true" --log_auto_initialize="true" 
> --logbufsecs="0" --logging_level="INFO" --max_agent_ping_timeouts="5" 
> --max_completed_frameworks="50" --max_completed_tasks_per_framework="1000" 
> --max_unreachable_tasks_per_framework="1000" --quiet="false" 
> --recovery_agent_removal_limit="100%" --registry="in_memory" 
> --registry_fetch_timeout="1mins" --registry_gc_interval="15mins" 
> --registry_max_agent_age="2weeks" --registry_max_agent_count="102400" 
> --registry_store_timeout="100secs" --registry_strict="false" 
> --root_submissions="true" --user_sorter="drf" --version="false" 
> --webui_dir="/usr/local/share/mesos/webui" --work_dir="/tmp/5DRa8u/master" 
> --zk_session_timeout="10secs"
> I0210 03:08:05.025264  1025 master.cpp:435] Master only allowing 
> authenticated frameworks to register
> I0210 03:08:05.025276  1025 master.cpp:449] Master only allowing 
> authenticated agents to register
> I0210 03:08:05.025285  1025 master.cpp:462] Master only allowing 
> authenticated HTTP frameworks to register
> I0210 03:08:05.025293  1025 credentials.hpp:37] Loading credentials for 
> authentication from '/tmp/5DRa8u/credentials'
> I0210 03:08:05.025387  1025 master.cpp:507] Using default 'crammd5' 
> authenticator
> I0210 03:08:05.025441  1025 http.cpp:919] Using default 'basic' HTTP 
> authenticator for realm 'mesos-master-readonly'
> I0210 03:08:05.025512  1025 http.cpp:919] Using default 'basic' HTTP 
> authenticator for realm 'mesos-master-readwrite'
> I0210 03:08:05.025560  1025 http.cpp:919] Using default 'basic' HTTP 
> authenticator for realm 'mesos-master-scheduler'
> I0210 03:08:05.025619  1025 master.cpp:587] Authorization enabled
> I0210 03:08:05.025728  1023 hierarchical.cpp:161] Initialized hierarchical 
> allocator process
> I0210 03:08:05.025754  1027 whitelist_watcher.cpp:77] No whitelist given
> PC: @ 0x7f69d2296012 process::ProcessManager::spawn()
> *** SIGSEGV (@0x0) received by PID 6019 (TID 0x7f69c46d5700) from PID 0; 
> stack trace: ***
> @ 0x7f69c2408725 (unknown)
> I0210 03:08:05.026340  1023 master.cpp:2124] Elected as the leading master!
> I0210 03:08:05.026357  1023 master.cpp:1646] Recovering from registrar
> I0210 03:08:05.026406  1025 registrar.cpp:329] Recovering registrar
> @ 0x7f69c240d2f1 (unknown)
> @ 0x7f69c24011e8 (unknown)
> I0210 03:08:05.027294  1024 registrar.cpp:362] Successfully fetched the 
> registry (0B) in 865024ns
> I0210 03:08:05.027330  1024 registrar.cpp:461] Applied 1 operations in 
> 2848ns; attempting to update the registry
> @ 0x7f69d027b370 (unknown)
> I0210 03:08:05.028261  1028 registrar.cpp:506] Successfully updated the 
> registry in 916992ns
> I0210 03:08:05.028313  1028 

[jira] [Updated] (MESOS-7106) Test ContentTypeAndSSLConfig/SchedulerSSLTest.RunTaskAndTeardown/1 segfaults

2017-03-03 Thread Joseph Wu (JIRA)

 [ 
https://issues.apache.org/jira/browse/MESOS-7106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Joseph Wu updated MESOS-7106:
-
  Sprint: Mesosphere Sprint 52
Story Points: 2

Ran into this as I was enabling SSL on the CMake build.

The problem is a race between {{ProcessManager::spawn}} and 
{{ProcessManager::finalize}}.  The bad interleaving is:

|| {{ProcessManager::finalize}} || {{ProcessManager::spawn}} ||
| | New process is spawned, wanting {{gc}} management |
| | Check for {{finalizing}} atomic bool passes |
| {{finalizing}} set to true | Waits on {{processes_mutex}} |
| Takes {{processes_mutex}} (1+ times) | |
| Lets go of {{processes_mutex}} and terminates one process; Repeat until no 
processes are left | |
| | Takes {{processes_mutex}} |
| Terminates {{gc}} | Adds new process to {{processes}} mapping |
| | Tries to add new process to {{gc}}; segfault because {{gc}} is gone |

> Test ContentTypeAndSSLConfig/SchedulerSSLTest.RunTaskAndTeardown/1 segfaults
> 
>
> Key: MESOS-7106
> URL: https://issues.apache.org/jira/browse/MESOS-7106
> Project: Mesos
>  Issue Type: Bug
>Affects Versions: 1.2.0
> Environment: centos7, SSL build
>Reporter: Benjamin Bannier
>Assignee: Joseph Wu
>  Labels: flaky-test, test
>
> {{ContentTypeAndSSLConfig/SchedulerSSLTest.RunTaskAndTeardown/1}} segfaulted 
> in our internal CI:
> {noformat}
> [ RUN  ] ContentTypeAndSSLConfig/SchedulerSSLTest.RunTaskAndTeardown/1
> W0210 03:08:05.018744  1020 process.cpp:3029] Attempted to spawn a process 
> (__http_connection__(1079)@10.168.212.35:42363) after finalizing libprocess!
> *** Aborted at 1486696085 (unix time) try "date -d @1486696085" if you are 
> using GNU date ***
> I0210 03:08:05.023609  6019 process.cpp:1246] libprocess is initialized on 
> 10.168.212.35:44850 with 8 worker threads
> I0210 03:08:05.024163  6019 cluster.cpp:160] Creating default 'local' 
> authorizer
> I0210 03:08:05.025065  1025 master.cpp:383] Master 
> 7adcbe15-38a9-4512-aa9c-8d5f7538e4ee (ip-10-168-212-35.ec2.internal) started 
> on 10.168.212.35:44850
> I0210 03:08:05.025089  1025 master.cpp:385] Flags at startup: --acls="" 
> --agent_ping_timeout="15secs" --agent_reregister_timeout="10mins" 
> --allocation_interval="1secs" --allocator="HierarchicalDRF" 
> --authenticate_agents="true" --authenticate_frameworks="true" 
> --authenticate_http_frameworks="true" --authenticate_http_readonly="true" 
> --authenticate_http_readwrite="true" --authenticators="crammd5" 
> --authorizers="local" --credentials="/tmp/5DRa8u/credentials" 
> --framework_sorter="drf" --help="false" --hostname_lookup="true" 
> --http_authenticators="basic" --http_framework_authenticators="basic" 
> --initialize_driver_logging="true" --log_auto_initialize="true" 
> --logbufsecs="0" --logging_level="INFO" --max_agent_ping_timeouts="5" 
> --max_completed_frameworks="50" --max_completed_tasks_per_framework="1000" 
> --max_unreachable_tasks_per_framework="1000" --quiet="false" 
> --recovery_agent_removal_limit="100%" --registry="in_memory" 
> --registry_fetch_timeout="1mins" --registry_gc_interval="15mins" 
> --registry_max_agent_age="2weeks" --registry_max_agent_count="102400" 
> --registry_store_timeout="100secs" --registry_strict="false" 
> --root_submissions="true" --user_sorter="drf" --version="false" 
> --webui_dir="/usr/local/share/mesos/webui" --work_dir="/tmp/5DRa8u/master" 
> --zk_session_timeout="10secs"
> I0210 03:08:05.025264  1025 master.cpp:435] Master only allowing 
> authenticated frameworks to register
> I0210 03:08:05.025276  1025 master.cpp:449] Master only allowing 
> authenticated agents to register
> I0210 03:08:05.025285  1025 master.cpp:462] Master only allowing 
> authenticated HTTP frameworks to register
> I0210 03:08:05.025293  1025 credentials.hpp:37] Loading credentials for 
> authentication from '/tmp/5DRa8u/credentials'
> I0210 03:08:05.025387  1025 master.cpp:507] Using default 'crammd5' 
> authenticator
> I0210 03:08:05.025441  1025 http.cpp:919] Using default 'basic' HTTP 
> authenticator for realm 'mesos-master-readonly'
> I0210 03:08:05.025512  1025 http.cpp:919] Using default 'basic' HTTP 
> authenticator for realm 'mesos-master-readwrite'
> I0210 03:08:05.025560  1025 http.cpp:919] Using default 'basic' HTTP 
> authenticator for realm 'mesos-master-scheduler'
> I0210 03:08:05.025619  1025 master.cpp:587] Authorization enabled
> I0210 03:08:05.025728  1023 hierarchical.cpp:161] Initialized hierarchical 
> allocator process
> I0210 03:08:05.025754  1027 whitelist_watcher.cpp:77] No whitelist given
> PC: @ 0x7f69d2296012 process::ProcessManager::spawn()
> *** SIGSEGV (@0x0) received by PID 6019 (TID 0x7f69c46d5700) from PID 0; 
> stack trace: ***
> @ 0x7f69c2408725 (unknown)
> I0210 

[jira] [Updated] (MESOS-7106) Test ContentTypeAndSSLConfig/SchedulerSSLTest.RunTaskAndTeardown/1 segfaults

2017-02-09 Thread Benjamin Bannier (JIRA)

 [ 
https://issues.apache.org/jira/browse/MESOS-7106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Benjamin Bannier updated MESOS-7106:

Affects Version/s: 1.2.0

> Test ContentTypeAndSSLConfig/SchedulerSSLTest.RunTaskAndTeardown/1 segfaults
> 
>
> Key: MESOS-7106
> URL: https://issues.apache.org/jira/browse/MESOS-7106
> Project: Mesos
>  Issue Type: Bug
>Affects Versions: 1.2.0
> Environment: centos7, SSL build
>Reporter: Benjamin Bannier
>  Labels: flaky-test, test
>
> {{ContentTypeAndSSLConfig/SchedulerSSLTest.RunTaskAndTeardown/1}} segfaulted 
> in our internal CI:
> {noformat}
> [ RUN  ] ContentTypeAndSSLConfig/SchedulerSSLTest.RunTaskAndTeardown/1
> W0210 03:08:05.018744  1020 process.cpp:3029] Attempted to spawn a process 
> (__http_connection__(1079)@10.168.212.35:42363) after finalizing libprocess!
> *** Aborted at 1486696085 (unix time) try "date -d @1486696085" if you are 
> using GNU date ***
> I0210 03:08:05.023609  6019 process.cpp:1246] libprocess is initialized on 
> 10.168.212.35:44850 with 8 worker threads
> I0210 03:08:05.024163  6019 cluster.cpp:160] Creating default 'local' 
> authorizer
> I0210 03:08:05.025065  1025 master.cpp:383] Master 
> 7adcbe15-38a9-4512-aa9c-8d5f7538e4ee (ip-10-168-212-35.ec2.internal) started 
> on 10.168.212.35:44850
> I0210 03:08:05.025089  1025 master.cpp:385] Flags at startup: --acls="" 
> --agent_ping_timeout="15secs" --agent_reregister_timeout="10mins" 
> --allocation_interval="1secs" --allocator="HierarchicalDRF" 
> --authenticate_agents="true" --authenticate_frameworks="true" 
> --authenticate_http_frameworks="true" --authenticate_http_readonly="true" 
> --authenticate_http_readwrite="true" --authenticators="crammd5" 
> --authorizers="local" --credentials="/tmp/5DRa8u/credentials" 
> --framework_sorter="drf" --help="false" --hostname_lookup="true" 
> --http_authenticators="basic" --http_framework_authenticators="basic" 
> --initialize_driver_logging="true" --log_auto_initialize="true" 
> --logbufsecs="0" --logging_level="INFO" --max_agent_ping_timeouts="5" 
> --max_completed_frameworks="50" --max_completed_tasks_per_framework="1000" 
> --max_unreachable_tasks_per_framework="1000" --quiet="false" 
> --recovery_agent_removal_limit="100%" --registry="in_memory" 
> --registry_fetch_timeout="1mins" --registry_gc_interval="15mins" 
> --registry_max_agent_age="2weeks" --registry_max_agent_count="102400" 
> --registry_store_timeout="100secs" --registry_strict="false" 
> --root_submissions="true" --user_sorter="drf" --version="false" 
> --webui_dir="/usr/local/share/mesos/webui" --work_dir="/tmp/5DRa8u/master" 
> --zk_session_timeout="10secs"
> I0210 03:08:05.025264  1025 master.cpp:435] Master only allowing 
> authenticated frameworks to register
> I0210 03:08:05.025276  1025 master.cpp:449] Master only allowing 
> authenticated agents to register
> I0210 03:08:05.025285  1025 master.cpp:462] Master only allowing 
> authenticated HTTP frameworks to register
> I0210 03:08:05.025293  1025 credentials.hpp:37] Loading credentials for 
> authentication from '/tmp/5DRa8u/credentials'
> I0210 03:08:05.025387  1025 master.cpp:507] Using default 'crammd5' 
> authenticator
> I0210 03:08:05.025441  1025 http.cpp:919] Using default 'basic' HTTP 
> authenticator for realm 'mesos-master-readonly'
> I0210 03:08:05.025512  1025 http.cpp:919] Using default 'basic' HTTP 
> authenticator for realm 'mesos-master-readwrite'
> I0210 03:08:05.025560  1025 http.cpp:919] Using default 'basic' HTTP 
> authenticator for realm 'mesos-master-scheduler'
> I0210 03:08:05.025619  1025 master.cpp:587] Authorization enabled
> I0210 03:08:05.025728  1023 hierarchical.cpp:161] Initialized hierarchical 
> allocator process
> I0210 03:08:05.025754  1027 whitelist_watcher.cpp:77] No whitelist given
> PC: @ 0x7f69d2296012 process::ProcessManager::spawn()
> *** SIGSEGV (@0x0) received by PID 6019 (TID 0x7f69c46d5700) from PID 0; 
> stack trace: ***
> @ 0x7f69c2408725 (unknown)
> I0210 03:08:05.026340  1023 master.cpp:2124] Elected as the leading master!
> I0210 03:08:05.026357  1023 master.cpp:1646] Recovering from registrar
> I0210 03:08:05.026406  1025 registrar.cpp:329] Recovering registrar
> @ 0x7f69c240d2f1 (unknown)
> @ 0x7f69c24011e8 (unknown)
> I0210 03:08:05.027294  1024 registrar.cpp:362] Successfully fetched the 
> registry (0B) in 865024ns
> I0210 03:08:05.027330  1024 registrar.cpp:461] Applied 1 operations in 
> 2848ns; attempting to update the registry
> @ 0x7f69d027b370 (unknown)
> I0210 03:08:05.028261  1028 registrar.cpp:506] Successfully updated the 
> registry in 916992ns
> I0210 03:08:05.028313  1028 registrar.cpp:392] Successfully recovered 
> registrar
> I0210 03:08:05.028419  1028 master.cpp:1762] Recovered