Dmytro Sen created AMBARI-18922:
-----------------------------------

             Summary: Agent Auto Restart Doesn't Release Ping Port
                 Key: AMBARI-18922
                 URL: https://issues.apache.org/jira/browse/AMBARI-18922
             Project: Ambari
          Issue Type: Bug
          Components: ambari-agent
    Affects Versions: 3.0.0
            Reporter: Dmytro Sen
            Assignee: Dmytro Sen
            Priority: Critical
             Fix For: 3.0.0


Agent auto-restart fails with
{code}
INFO 2016-11-10 17:56:58,319 security.py:148 - Encountered communication error. 
Details: error(104, 'Connection reset by peer')
ERROR 2016-11-10 17:56:58,320 Controller.py:425 - Connection to 192.168.64.1 
was lost (details=Request to 
https://192.168.64.1:8441/agent/v1/heartbeat/c6401.ambari.apache.org failed due 
to Error occured during connecting to the server: [Errno 104] Connection reset 
by peer)
INFO 2016-11-10 17:57:33,233 Controller.py:285 - Heartbeat (response id = 1157) 
with server is running...
INFO 2016-11-10 17:57:33,233 NetUtil.py:62 - Connecting to 
https://192.168.64.1:8440/connection_info
INFO 2016-11-10 17:57:33,300 security.py:100 - SSL Connect being called.. 
connecting to the server
INFO 2016-11-10 17:57:33,366 security.py:61 - SSL connection established. 
Two-way SSL authentication is turned off on the server.
ERROR 2016-11-10 17:57:33,368 Controller.py:349 - Error in responseId sequence 
- restarting
INFO 2016-11-10 17:57:33,369 ExitHelper.py:53 - Performing cleanup before 
exiting...
INFO 2016-11-10 17:57:33,369 threadpool.py:112 - Shutting down thread pool
INFO 2016-11-10 17:57:33,409 scheduler.py:607 - Scheduler has been shut down
INFO 2016-11-10 17:57:33,409 threadpool.py:52 - Started thread pool with 3 core 
threads and 20 maximum threads
INFO 2016-11-10 17:57:33,410 AlertSchedulerHandler.py:166 - [AlertScheduler] 
Stopped the alert scheduler.
INFO 2016-11-10 17:57:33,410 threadpool.py:112 - Shutting down thread pool
INFO 2016-11-10 17:57:33,410 ExitHelper.py:67 - Cleanup finished, exiting with 
code:77
INFO 2016-11-10 17:57:33,544 main.py:96 - loglevel=logging.INFO
INFO 2016-11-10 17:57:33,544 main.py:96 - loglevel=logging.INFO
INFO 2016-11-10 17:57:33,544 main.py:96 - loglevel=logging.INFO
INFO 2016-11-10 17:57:33,545 DataCleaner.py:39 - Data cleanup thread started
INFO 2016-11-10 17:57:33,547 DataCleaner.py:120 - Data cleanup started
INFO 2016-11-10 17:57:33,548 DataCleaner.py:122 - Data cleanup finished
ERROR 2016-11-10 17:57:33,573 main.py:377 - Failed to start ping port listener 
of: Could not open port 8670 because port already used by another process:
UID        PID  PPID  C STIME TTY          TIME CMD
root      4750     1  0 17:34 pts/0    00:00:00 /usr/bin/python /usr/lib/python2

INFO 2016-11-10 17:57:33,574 PingPortListener.py:61 - Ping port listener killed
INFO 2016-11-10 17:57:33,574 ExitHelper.py:53 - Performing cleanup before 
exiting...
{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to