Dmytro Sen created AMBARI-18922:
-----------------------------------
Summary: Agent Auto Restart Doesn't Release Ping Port
Key: AMBARI-18922
URL: https://issues.apache.org/jira/browse/AMBARI-18922
Project: Ambari
Issue Type: Bug
Components: ambari-agent
Affects Versions: 3.0.0
Reporter: Dmytro Sen
Assignee: Dmytro Sen
Priority: Critical
Fix For: 3.0.0
Agent auto-restart fails with
{code}
INFO 2016-11-10 17:56:58,319 security.py:148 - Encountered communication error.
Details: error(104, 'Connection reset by peer')
ERROR 2016-11-10 17:56:58,320 Controller.py:425 - Connection to 192.168.64.1
was lost (details=Request to
https://192.168.64.1:8441/agent/v1/heartbeat/c6401.ambari.apache.org failed due
to Error occured during connecting to the server: [Errno 104] Connection reset
by peer)
INFO 2016-11-10 17:57:33,233 Controller.py:285 - Heartbeat (response id = 1157)
with server is running...
INFO 2016-11-10 17:57:33,233 NetUtil.py:62 - Connecting to
https://192.168.64.1:8440/connection_info
INFO 2016-11-10 17:57:33,300 security.py:100 - SSL Connect being called..
connecting to the server
INFO 2016-11-10 17:57:33,366 security.py:61 - SSL connection established.
Two-way SSL authentication is turned off on the server.
ERROR 2016-11-10 17:57:33,368 Controller.py:349 - Error in responseId sequence
- restarting
INFO 2016-11-10 17:57:33,369 ExitHelper.py:53 - Performing cleanup before
exiting...
INFO 2016-11-10 17:57:33,369 threadpool.py:112 - Shutting down thread pool
INFO 2016-11-10 17:57:33,409 scheduler.py:607 - Scheduler has been shut down
INFO 2016-11-10 17:57:33,409 threadpool.py:52 - Started thread pool with 3 core
threads and 20 maximum threads
INFO 2016-11-10 17:57:33,410 AlertSchedulerHandler.py:166 - [AlertScheduler]
Stopped the alert scheduler.
INFO 2016-11-10 17:57:33,410 threadpool.py:112 - Shutting down thread pool
INFO 2016-11-10 17:57:33,410 ExitHelper.py:67 - Cleanup finished, exiting with
code:77
INFO 2016-11-10 17:57:33,544 main.py:96 - loglevel=logging.INFO
INFO 2016-11-10 17:57:33,544 main.py:96 - loglevel=logging.INFO
INFO 2016-11-10 17:57:33,544 main.py:96 - loglevel=logging.INFO
INFO 2016-11-10 17:57:33,545 DataCleaner.py:39 - Data cleanup thread started
INFO 2016-11-10 17:57:33,547 DataCleaner.py:120 - Data cleanup started
INFO 2016-11-10 17:57:33,548 DataCleaner.py:122 - Data cleanup finished
ERROR 2016-11-10 17:57:33,573 main.py:377 - Failed to start ping port listener
of: Could not open port 8670 because port already used by another process:
UID PID PPID C STIME TTY TIME CMD
root 4750 1 0 17:34 pts/0 00:00:00 /usr/bin/python /usr/lib/python2
INFO 2016-11-10 17:57:33,574 PingPortListener.py:61 - Ping port listener killed
INFO 2016-11-10 17:57:33,574 ExitHelper.py:53 - Performing cleanup before
exiting...
{code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)