saurabh1284 commented on issue #2264: Unable to connect to the controller 
docker: Controller docker not starting
URL: 
https://github.com/apache/incubator-openwhisk/issues/2264#issuecomment-313165119
 
 
   @rabbah . Tried again redeployment, but still getting the same issue. Steps 
performed -->>
   
   > Cleaned all running container (except registry and nginx) as below ==
            -> docker ps --format "{{.Names}} - {{.Image}} - {{.Ports}}"
                nginx - nginx:1.11 - 0.0.0.0:80->80/tcp, 0.0.0.0:443->443/tcp, 
0.0.0.0:8443->8443/tcp
                whisk_docker_registry - registry:2.3.1 - 0.0.0.0:5000->5000/tcp
   > Ran following ansible playbooks ==
            -> ansible-playbook -i environments/distributed couchdb.yml
            -> ansible-playbook -i environments/distributed initdb.yml
            -> ansible-playbook -i environments/distributed wipe.yml
            <<  All above playbook ran without any Error >>
            -> ansible-playbook -i environments/distributed openwhisk.yml
                 It failed with Error >>
                 TASK [controller : wait until the Controller in this host is 
up and running] 
                 *********************************
                Wednesday 05 July 2017  15:59:56 +0000 (0:01:04.168)       
0:34:19.560 ********
                FAILED - RETRYING: wait until the Controller in this host is up 
and running (12 retries left).
                FAILED - RETRYING: wait until the Controller in this host is up 
and running (11 retries left).
                FAILED - RETRYING: wait until the Controller in this host is up 
and running (10 retries left).
                FAILED - RETRYING: wait until the Controller in this host is up 
and running (9 retries left).
                FAILED - RETRYING: wait until the Controller in this host is up 
and running (8 retries left).
                FAILED - RETRYING: wait until the Controller in this host is up 
and running (7 retries left).
                FAILED - RETRYING: wait until the Controller in this host is up 
and running (6 retries left).
                FAILED - RETRYING: wait until the Controller in this host is up 
and running (5 retries left).
                FAILED - RETRYING: wait until the Controller in this host is up 
and running (4 retries left).
                FAILED - RETRYING: wait until the Controller in this host is up 
and running (3 retries left).
                FAILED - RETRYING: wait until the Controller in this host is up 
and running (2 retries left).
                FAILED - RETRYING: wait until the Controller in this host is up 
and running (1 retries left).
                fatal: [10.11.12.24]: FAILED! => {"attempts": 12, "changed": 
false, "content": "", "failed": true, 
                "msg": "Status code was not [200]: Request failed: <urlopen 
error [Errno 111] Connection 
                 refused>", "redirected": false, "status": -1, "url": 
"http://10.11.12.24:10001/ping"}
   
                 PLAY RECAP 
***************************************************************************************************
   10.11.12.24                : ok=25   changed=20   unreachable=0    failed=1
   10.11.12.25                : ok=4    changed=2    unreachable=0    failed=0
   ansible                    : ok=2    changed=1    unreachable=0    failed=0
   
   Wednesday 05 July 2017  16:07:32 +0000 (0:07:35.596)       0:41:55.233 
********
   
===============================================================================
   controller : wait until the Controller in this host is up and running - 
455.69s
   kafka : wait until the kafka server started up ------------------------ 
316.73s
   kafka : create the health topic --------------------------------------- 
260.57s
   kafka : create the active-ack topics ---------------------------------- 
196.70s
   kafka : create the invoker topics ------------------------------------- 
193.12s
   kafka : wait until the Zookeeper in this host is up and running ------- 
122.22s
   consul : fill consul kv ----------------------------------------------- 
104.41s
   Gathering Facts -------------------------------------------------------- 
94.08s
   Gathering Facts -------------------------------------------------------- 
84.56s
   kafka : (re)start zookeeper -------------------------------------------- 
66.54s
   controller : (re)start controller -------------------------------------- 
64.17s
   consul : pull the consul:0.7.0 image ----------------------------------- 
59.16s
   Gathering Facts -------------------------------------------------------- 
52.96s
   consul : (re)start consul server/agent --------------------------------- 
46.37s
   consul : copy template from local to remote in consul config directory -- 
43.64s
   kafka : (re)start kafka ------------------------------------------------ 
37.65s
   consul : start registrator using docker cli ---------------------------- 
36.61s
   controller : pull the latest image of controller ----------------------- 
32.88s
   kafka : pull the zookeeper:3.4 image ----------------------------------- 
32.54s
   Gathering Facts -------------------------------------------------------- 
30.18s
   
   ==============================
   ## Additional information you deem important:
   
   -> docker ps --format "{{.Names}} - {{.Image}} - {{.Ports}}"
   controller0 - 10.11.12.24:5000/whisk/controller:latest - 
0.0.0.0:10001->8080/tcp
   kafka - ches/kafka:0.10.2.1 - 7203/tcp, 0.0.0.0:9092->9092/tcp
   zookeeper - zookeeper:3.4 - 2888/tcp, 0.0.0.0:2181->2181/tcp, 3888/tcp
   registrator - gliderlabs/registrator -
   consul - consul:0.7.0 - 0.0.0.0:8300-8302->8300-8302/tcp, 
0.0.0.0:8400->8400/tcp, 0.0.0.0:8301-8302->8301-8302/udp, 
0.0.0.0:8500->8500/tcp, 0.0.0.0:8600->8600/udp, 8600/tcp
   couchdb - couchdb:1.6 - 0.0.0.0:5984->5984/tcp
   nginx - nginx:1.11 - 0.0.0.0:80->80/tcp, 0.0.0.0:443->443/tcp, 
0.0.0.0:8443->8443/tcp
   whisk_docker_registry - registry:2.3.1 - 0.0.0.0:5000->5000/tcp
   
   -> ps -eaf | grep docker
   root      1091     1  5 14:58 ?        00:06:00 /usr/bin/dockerd -H 
tcp://10.11.12.24:4243 -H unix:///var/run docker.sock --storage-driver=aufs 
--insecure-registry 10.11.12.24:5000 --raw-logs
   root      1159  1091  2 14:58 ?        00:02:38 docker-containerd -l 
unix:///var/run/docker/libcontainerd/docker-containerd.sock --shim 
docker-containerd-shim --metrics-interval=0 --start-timeout 2m --state-dir 
/var/run docker/libcontainerd/containerd --runtime docker-runc
   .
   .
   
   -> telnet 10.11.12.24 4243
   Trying 10.11.12.24...
   Connected to 10.11.12.24.
   Escape character is '^]'.
   ^C
   Connection closed by foreign host.
   
   -> curl http://10.11.12.24:4243/info
   
{"ID":"VK7F:GMEU:K6W4:3K5Y:ELVI:AIAG:LRW3:2N5T:K7PN:W5OR:MN5J:5KBA","Containers":8,"ContainersRunning":7,"ContainersPaused":0,"ContainersStopped":1,"Images":134,"Driver":"aufs","DriverStatus":[["Root
 Dir","/var/lib/docker/aufs"],["Backing 
Filesystem","extfs"],["Dirs","176"],["Dirperm1 
Supported","true"]],"SystemStatus":null,"Plugins":{"Volume":["local"],"Network":["bridge","overlay","host","null"],"Authorization":null},"MemoryLimit":true,"SwapLimit":false,"KernelMemory":true,"CpuCfsPeriod":true,"CpuCfsQuota":true,"CPUShares":true,"CPUSet":true,"IPv4Forwarding":true,"BridgeNfIptables":true,"BridgeNfIp6tables":true,"Debug":false,"NFd":45,"OomKillDisable":true,"NGoroutines":72,"SystemTime":"2017-07-05T16:52:19.604149476Z","ExecutionDriver":"","LoggingDriver":"json-file","CgroupDriver":"cgroupfs","NEventsListener":1,"KernelVersion":"4.4.0-31-generic","OperatingSystem":"Ubuntu
 14.04.5 
LTS","OSType":"linux","Architecture":"x86_64","IndexServerAddress":"https://index.docker.io/v1/","Regi
 
stryConfig":{"InsecureRegistryCIDRs":["127.0.0.0/8"],"IndexConfigs":{"10.11.12.24:5000":{"Name":"10.11.12.24:5000","Mirrors":[],"Secure":false,"Official":false},"docker.io":{"Name":"docker.io","Mirrors":null,"Secure":true,"Official":true}},"Mirrors":null},"NCPU":4,"MemTotal":8371515392,"DockerRootDir":"/var/lib/docker","HttpProxy":"","HttpsProxy":"","NoProxy":"","Name":"skg-faas-infra","Labels":null,"ExperimentalBuild":false,"ServerVersion":"1.12.0","ClusterStore":"","ClusterAdvertise":"","SecurityOptions":["apparmor"],"Runtimes":{"runc":{"path":"docker-runc"}},"DefaultRuntime":"runc","Swarm":{"NodeID":"","NodeAddr":"","LocalNodeState":"inactive","ControlAvailable":false,"Error":"","RemoteManagers":null,"Nodes":0,"Managers":0,"Cluster":{"ID":"","Version":{},"CreatedAt":"0001-01-01T00:00:00Z","UpdatedAt":"0001-01-01T00:00:00Z","Spec":{"Orchestration":{},"Raft":{},"Dispatcher":{},"CAConfig":{},"TaskDefaults":{}}}},"LiveRestoreEnabled":false}
   
   -> netstat -aptn
   Active Internet connections (servers and established)
   Proto Recv-Q Send-Q Local Address           Foreign Address         State    
   PID/Program name
   tcp        0      0 10.11.12.24:4243        0.0.0.0:*               LISTEN   
   1091/dockerd
   tcp        0      0 0.0.0.0:22              0.0.0.0:*               LISTEN   
   1372/sshd
   tcp        0      0 10.11.12.24:55964       10.11.12.24:4243        
TIME_WAIT   -
   tcp        0      0 172.17.0.1:48284        172.17.0.8:9092         
ESTABLISHED 7134/docker-proxy
   tcp        0      0 10.11.12.24:22          10.20.2.4:58544         
ESTABLISHED 1815/sshd: ubuntu [
   tcp        0      0 10.11.12.24:22          10.20.2.4:58592         
ESTABLISHED 1837/sshd: ubuntu [
   tcp        0      0 172.17.0.1:48332        172.17.0.8:9092         
ESTABLISHED 7134/docker-proxy
   tcp        0      0 172.17.0.1:48336        172.17.0.8:9092         
ESTABLISHED 7134/docker-proxy
   tcp        0      0 10.11.12.24:22          10.20.2.4:33246         
ESTABLISHED 2690/sshd: ubuntu [
   tcp        0      0 172.17.0.1:48428        172.17.0.8:9092         
ESTABLISHED 7134/docker-proxy
   tcp        0      0 172.17.0.1:48280        172.17.0.8:9092         
ESTABLISHED 7134/docker-proxy
   tcp        0      0 172.17.0.1:49428        172.17.0.5:8500         
ESTABLISHED 5897/docker-proxy
   tcp6       0      0 :::10001                :::*                    LISTEN   
   9648/docker-proxy
   tcp6       0      0 :::8500                 :::*                    LISTEN   
   5897/docker-proxy
   tcp6       0      0 :::22                   :::*                    LISTEN   
   1372/sshd
   tcp6       0      0 :::443                  :::*                    LISTEN   
   1720/docker-proxy
   tcp6       0      0 :::8443                 :::*                    LISTEN   
   1710/docker-proxy
   tcp6       0      0 :::5984                 :::*                    LISTEN   
   3121/docker-proxy
   tcp6       0      0 :::9092                 :::*                    LISTEN   
   7134/docker-proxy
   tcp6       0      0 :::2181                 :::*                    LISTEN   
   6799/docker-proxy
   tcp6       0      0 :::5000                 :::*                    LISTEN   
   1739/docker-proxy
   tcp6       0      0 :::8300                 :::*                    LISTEN   
   5955/docker-proxy
   tcp6       0      0 :::8301                 :::*                    LISTEN   
   5935/docker-proxy
   tcp6       0      0 :::8302                 :::*                    LISTEN   
   5917/docker-proxy
   tcp6       0      0 :::8400                 :::*                    LISTEN   
   5908/docker-proxy
   tcp6       0      0 :::80                   :::*                    LISTEN   
   1729/docker-proxy
   tcp6       0      0 10.11.12.24:9092        172.17.0.9:60928        
ESTABLISHED 7134/docker-proxy
   tcp6       0      0 10.11.12.24:9092        172.17.0.9:60980        
ESTABLISHED 7134/docker-proxy
   tcp6       0      0 10.11.12.24:10001       172.17.0.5:41858        
TIME_WAIT   -
   tcp6       0      0 10.11.12.24:9092        172.17.0.9:60976        
ESTABLISHED 7134/docker-proxy
   tcp6       0      0 10.11.12.24:8500        172.17.0.6:49122        
ESTABLISHED 5897/docker-proxy
   tcp6       0      0 10.11.12.24:9092        172.17.0.9:60924        
ESTABLISHED 7134/docker-proxy
   tcp6       0      0 10.11.12.24:9092        172.17.0.9:32840        
ESTABLISHED 7134/docker-proxy
   
   -> curl http://10.11.12.24:10001/ping
   curl: (7) Failed to connect to 10.11.12.24 port 10001: Connection refused
   
   -> cat /tmp/wsklogs/controller0/controller0_logs.log
   .
   .
   [2017-07-05T16:07:23.645Z] [INFO] Slf4jLogger started
   [2017-07-05T16:08:27.321Z] [INFO] Slf4jLogger started
   [2017-07-05T16:08:47.422Z] [INFO] [??] [Config] environment set value for 
consulserver.host
   [2017-07-05T16:08:47.596Z] [INFO] [??] [Config] environment set value for 
whisk.version.date
   [2017-07-05T16:08:47.608Z] [INFO] [??] [Config] environment set value for 
whisk.version.buildno
   [2017-07-05T16:08:47.620Z] [INFO] [??] [Config] environment set value for 
consul.host.port4
   [2017-07-05T16:08:47.628Z] [INFO] [??] [Config] environment set value for 
port
   [2017-07-05T16:08:47.652Z] [INFO] [??] [WhiskConfig] reading properties from 
consul at 10.11.12.24:8500
   [2017-07-05T16:10:29.082Z] [WARN] [??] [WhiskConfig] failed to read 
properties from consul: Futures timed out after [1 minute]
   [2017-07-05T16:10:30.297Z] [ERROR] [??] **[Config] required property db.port 
still not set**
   [2017-07-05T16:10:30.441Z] [ERROR] [??] **[Controller] Bad configuration, 
cannot start.**
   [2017-07-05T16:11:31.111Z] [INFO] Slf4jLogger started
   [2017-07-05T16:11:47.210Z] [INFO] [??] [Config] environment set value for 
consulserver.host
   [2017-07-05T16:11:47.518Z] [INFO] [??] [Config] environment set value for 
whisk.version.date
   [2017-07-05T16:11:47.611Z] [INFO] [??] [Config] environment set value for 
whisk.version.buildno
   [2017-07-05T16:11:47.616Z] [INFO] [??] [Config] environment set value for 
consul.host.port4
   [2017-07-05T16:11:47.764Z] [INFO] [??] [Config] environment set value for 
port
   [2017-07-05T16:11:47.769Z] [INFO] [??] [WhiskConfig] reading properties from 
consul at 10.11.12.24:8500
   [2017-07-05T16:13:21.270Z] [WARN] [??] [WhiskConfig] failed to read 
properties from consul: Futures timed out after [1 minute]
   [2017-07-05T16:13:22.227Z] **[ERROR] [??] [Config] required property db.port 
still not set**
   [2017-07-05T16:13:22.307Z] **[ERROR] [??] [Controller] Bad configuration, 
cannot start.**
   [2017-07-05T16:14:29.321Z] [INFO] Slf4jLogger started
   [2017-07-05T16:14:47.801Z] [INFO] [??] [Config] environment set value for 
consulserver.host
   [2017-07-05T16:14:47.988Z] [INFO] [??] [Config] environment set value for 
whisk.version.date
   [2017-07-05T16:14:48.006Z] [INFO] [??] [Config] environment set value for 
whisk.version.buildno
   [2017-07-05T16:14:48.016Z] [INFO] [??] [Config] environment set value for 
consul.host.port4
   [2017-07-05T16:14:48.020Z] [INFO] [??] [Config] environment set value for 
port
   [2017-07-05T16:14:48.052Z] [INFO] [??] [WhiskConfig] reading properties from 
consul at 10.11.12.24:8500
   [2017-07-05T16:16:37.235Z] [INFO] [#sid_130] [Controller] starting 
controller instance 0 [marker:controller_startup0_count:110630]
   [2017-07-05T16:16:39.753Z] [INFO] [??] [LoadBalancerService] 
blackboxFraction = 0.1
   [2017-07-05T16:17:03.289Z] [INFO] [#sid_120] [WhiskAction] [GET] serving 
from datastore: id: whisk.system/invokerHealthTestAction 
[marker:database_cacheMiss_count:137033]
   .
   .
   =======
   Please suggest.
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

Reply via email to