[ 
https://issues.apache.org/jira/browse/MESOS-7686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jack Crawford updated MESOS-7686:
---------------------------------
    Description: 
Mesos master fails to update registrar and fails.

Running with mesos 1.2.0, 1 master

```
Jun 16 02:20:22 ava-mesosmasterl001 mesos-master[9415]: E0616 02:20:22.562098  
9422 registrar.cpp:528] Registrar aborting: Failed to update registry: Failed 
to perform store within 20secs
Jun 16 02:20:32 ava-mesosmasterl001 mesos-master[9415]: F0616 02:20:32.965498  
9419 master.cpp:6420] Failed to mark agent 
acc02700-53c8-4961-b8f4-d952e58432c3-S742 at slave(1)@10.0.239.60:5051 
(ip-10-0-239-60) un
Jun 16 02:20:53 ava-mesosmasterl001 mesos-master[9415]: E0616 02:20:36.198673  
9426 process.cpp:2426] Failed to shutdown socket with fd 18: Transport endpoint 
is not connected
Jun 16 02:20:53 ava-mesosmasterl001 mesos-master[9415]: *** Check failure stack 
trace: ***
Jun 16 02:21:24 ava-mesosmasterl001 mesos-master[9415]:     @     
0x7f696c7923cd  google::LogMessage::Fail()
Jun 16 02:21:34 ava-mesosmasterl001 mesos-master[9415]:     @     
0x7f696c794180  google::LogMessage::SendToLog()
Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]:     @     
0x7f696c791fb3  google::LogMessage::Flush()
Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]:     @     
0x7f696c794ba9  google::LogMessageFatal::~LogMessageFatal()
Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]:     @     
0x7f696bb48e6d  mesos::internal::master::Master::_markUnreachable()
Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]:     @     
0x7f696c6f1f0c  process::ProcessBase::visit()
Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]:     @     
0x7f696c704933  process::ProcessManager::resume()
Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]:     @     
0x7f696c70f537  
_ZNSt6thread5_ImplISt12_Bind_simpleIFZN7process14ProcessManager12init_threadsEvEUt_vEEE6_M_runEv
Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]:     @     
0x7f696a950c80  (unknown)
Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]:     @     
0x7f696a1636ba  start_thread
Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]:     @     
0x7f6969e9982d  (unknown)
Jun 16 02:23:40 ava-mesosmasterl001 systemd[1]: mesos-master.service: Main 
process exited, code=killed, status=6/ABRT
Jun 16 02:23:40 ava-mesosmasterl001 systemd[1]: mesos-master.service: Unit 
entered failed state.
```

  was:
Mesos master fails to update registry and fails.

Running with mesos 1.2.0, 1 master

```
Jun 16 02:20:22 ava-mesosmasterl001 mesos-master[9415]: E0616 02:20:22.562098  
9422 registrar.cpp:528] Registrar aborting: Failed to update registry: Failed 
to perform store within 20secs
Jun 16 02:20:32 ava-mesosmasterl001 mesos-master[9415]: F0616 02:20:32.965498  
9419 master.cpp:6420] Failed to mark agent 
acc02700-53c8-4961-b8f4-d952e58432c3-S742 at slave(1)@10.0.239.60:5051 
(ip-10-0-239-60) un
Jun 16 02:20:53 ava-mesosmasterl001 mesos-master[9415]: E0616 02:20:36.198673  
9426 process.cpp:2426] Failed to shutdown socket with fd 18: Transport endpoint 
is not connected
Jun 16 02:20:53 ava-mesosmasterl001 mesos-master[9415]: *** Check failure stack 
trace: ***
Jun 16 02:21:24 ava-mesosmasterl001 mesos-master[9415]:     @     
0x7f696c7923cd  google::LogMessage::Fail()
Jun 16 02:21:34 ava-mesosmasterl001 mesos-master[9415]:     @     
0x7f696c794180  google::LogMessage::SendToLog()
Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]:     @     
0x7f696c791fb3  google::LogMessage::Flush()
Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]:     @     
0x7f696c794ba9  google::LogMessageFatal::~LogMessageFatal()
Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]:     @     
0x7f696bb48e6d  mesos::internal::master::Master::_markUnreachable()
Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]:     @     
0x7f696c6f1f0c  process::ProcessBase::visit()
Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]:     @     
0x7f696c704933  process::ProcessManager::resume()
Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]:     @     
0x7f696c70f537  
_ZNSt6thread5_ImplISt12_Bind_simpleIFZN7process14ProcessManager12init_threadsEvEUt_vEEE6_M_runEv
Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]:     @     
0x7f696a950c80  (unknown)
Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]:     @     
0x7f696a1636ba  start_thread
Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]:     @     
0x7f6969e9982d  (unknown)
Jun 16 02:23:40 ava-mesosmasterl001 systemd[1]: mesos-master.service: Main 
process exited, code=killed, status=6/ABRT
Jun 16 02:23:40 ava-mesosmasterl001 systemd[1]: mesos-master.service: Unit 
entered failed state.
```


> registrar aborting, failed to mark agent causes fatal error
> -----------------------------------------------------------
>
>                 Key: MESOS-7686
>                 URL: https://issues.apache.org/jira/browse/MESOS-7686
>             Project: Mesos
>          Issue Type: Bug
>            Reporter: Jack Crawford
>
> Mesos master fails to update registrar and fails.
> Running with mesos 1.2.0, 1 master
> ```
> Jun 16 02:20:22 ava-mesosmasterl001 mesos-master[9415]: E0616 02:20:22.562098 
>  9422 registrar.cpp:528] Registrar aborting: Failed to update registry: 
> Failed to perform store within 20secs
> Jun 16 02:20:32 ava-mesosmasterl001 mesos-master[9415]: F0616 02:20:32.965498 
>  9419 master.cpp:6420] Failed to mark agent 
> acc02700-53c8-4961-b8f4-d952e58432c3-S742 at slave(1)@10.0.239.60:5051 
> (ip-10-0-239-60) un
> Jun 16 02:20:53 ava-mesosmasterl001 mesos-master[9415]: E0616 02:20:36.198673 
>  9426 process.cpp:2426] Failed to shutdown socket with fd 18: Transport 
> endpoint is not connected
> Jun 16 02:20:53 ava-mesosmasterl001 mesos-master[9415]: *** Check failure 
> stack trace: ***
> Jun 16 02:21:24 ava-mesosmasterl001 mesos-master[9415]:     @     
> 0x7f696c7923cd  google::LogMessage::Fail()
> Jun 16 02:21:34 ava-mesosmasterl001 mesos-master[9415]:     @     
> 0x7f696c794180  google::LogMessage::SendToLog()
> Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]:     @     
> 0x7f696c791fb3  google::LogMessage::Flush()
> Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]:     @     
> 0x7f696c794ba9  google::LogMessageFatal::~LogMessageFatal()
> Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]:     @     
> 0x7f696bb48e6d  mesos::internal::master::Master::_markUnreachable()
> Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]:     @     
> 0x7f696c6f1f0c  process::ProcessBase::visit()
> Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]:     @     
> 0x7f696c704933  process::ProcessManager::resume()
> Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]:     @     
> 0x7f696c70f537  
> _ZNSt6thread5_ImplISt12_Bind_simpleIFZN7process14ProcessManager12init_threadsEvEUt_vEEE6_M_runEv
> Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]:     @     
> 0x7f696a950c80  (unknown)
> Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]:     @     
> 0x7f696a1636ba  start_thread
> Jun 16 02:21:44 ava-mesosmasterl001 mesos-master[9415]:     @     
> 0x7f6969e9982d  (unknown)
> Jun 16 02:23:40 ava-mesosmasterl001 systemd[1]: mesos-master.service: Main 
> process exited, code=killed, status=6/ABRT
> Jun 16 02:23:40 ava-mesosmasterl001 systemd[1]: mesos-master.service: Unit 
> entered failed state.
> ```



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to