Hi Ashish, We recommend using 'sudo service <process> stop' to restart a service. This works because monit will automatically restart the service when it notices that it has stopped.
* Using 'service <process> restart' can cause multiple instances of the same process to run, as both monit and service could end up creating a new instance of the process. * Using 'monit restart <process>' can cause the poll_process scripts to fail while monit is restarting the process, so they then cause monit to abort the process - see https://github.com/Metaswitch/homestead/issues/149 for an example of this. If Homer and Homestead-prov don't start running by themselves, then (if you're running on a version of Clearwater older than Yoshi's Island) you may have hit this issue - https://github.com/Metaswitch/crest/issues/192. To check this, look whether the line 'Couldn't listen on any:/tmp/.<process>-sock-0: [Errno 98] Address already in use' is in the logs at /var/log/<process>/<process>-err.log. If the line is present, delete the file '/tmp/.<process>-sock-0', and the process should restart. There is no overall command to start/stop all Clearwater services. Hope this helps, Ellie From: Ashish Maheshwari -X (ashimahe - L & T TECHNOLOGY SERVICES LIMITED at Cisco) [mailto:[email protected]] Sent: 24 February 2015 14:38 To: Eleanor Merry; [email protected] Cc: Chinna Morampalle (cmorampa); Satheesh Marappan (sathkuma) Subject: RE: SCSCF sending repetetive MAR to HSS Hi Ellie, Would like to share few of the issues facing with CW IMS : Whenever I did some config changes in /etc/cleawater/config file and restart the service later on I observed few of the services got failed which didn't come even after restart. I have few queries when you suggest to use monit to restart service is it mean just use "monit" as command or monit start service is the right way to start it and what is the command to restart all services in one move ?? When I used this config mentioned below to integrate with external HSS and then restart the services then I observed few services got failed # Deployment definitions home_domain=example.com sprout_hostname=50.50.13.6 hs_hostname=50.50.13.6:8888 #hs_provisioning_hostname=10.105.244.84:8889 xdms_hostname=50.50.13.6:7888 chronos_hostname= # Local IP configuration local_ip=50.50.13.6 public_ip=10.105.244.84 public_hostname=10.105.244.84 # Email server configuration smtp_smarthost=127.0.0.1 smtp_username=username smtp_password=password [email protected]<mailto:[email protected]> # I-CSCF/S-CSCF configuration upstream_hostname=50.50.13.6 #HSS Configuration hss_hostname=50.50.13.100 hss_host=3868 monit status Program 'poll_homestead-prov' status Status failed monitoring status Monitored last started Tue, 24 Feb 2015 07:02:11 last exit value 1 data collected Tue, 24 Feb 2015 07:02:28 Process 'homestead-prov' status Execution failed monitoring status Monitored data collected Tue, 24 Feb 2015 07:02:58 Program 'poll_homer' status Status failed monitoring status Monitored last started Tue, 24 Feb 2015 07:02:58 last exit value 1 data collected Tue, 24 Feb 2015 07:02:58 Process 'homer' status Does not exist monitoring status Monitored data collected Tue, 24 Feb 2015 07:02:11 I restarted homestead services but still theses services didn't come up I just want to know these services are individual one or are associated with homestead and is any need to restart them separately ??? I am attaching pcap file where I am seein 403 error please help me to configure config file because once I did the changes in config few services got stopped .... Thanks & Regards, Ashish Maheshwari From: Eleanor Merry [mailto:[email protected]] Sent: 20 February 2015 19:11 To: Ashish Maheshwari -X (ashimahe - L & T TECHNOLOGY SERVICES LIMITED at Cisco); [email protected]<mailto:[email protected]> Cc: Chinna Morampalle (cmorampa); Satheesh Marappan (sathkuma) Subject: RE: SCSCF sending repetetive MAR to HSS Hi Ashish, It looks like there's a problem with memcached - from the Sprout logs: 20-02-2015 04:07:13.921 UTC Debug memcachedstore.cpp:649: memcached_add command for av\\[email protected]\46fa0be12dbafeef failed on replica 0, rc = 47 (SERVER HAS FAILED AND IS DISABLED UNTIL TIMED RETRY), expiry = 40 (140124113689584) SERVER HAS FAILED AND IS DISABLED UNTIL TIMED RETRY, host: 10.105.244.117:11211 -> libmemcached/connect.cc:633 20-02-2015 04:07:13.921 UTC Error memcachedstore.cpp:708: Failed to write data for av\\[email protected]\46fa0be12dbafeef to 1 replicas 20-02-2015 04:07:13.921 UTC Error avstore.cpp:73: Failed to write Authentication Vector for private_id [email protected]<mailto:[email protected]> Are there any memcached logs in /var/log/memcached.log or in /var/log/syslog? Also, can you check if there are any reported errors about memcached when Sprout initially attempts to connect to memcached - you'll need to restart Sprout ('sudo service sprout stop' - it'll be restarted by monit). The memcached error is causing the repetitive MARs because when the S-CSCF receives the REGISTER for the P-CSCF, it first checks its authorization store for the subscriber. On the initial REGISTER this is empty, so the S-CSCF makes a MAR to the HSS. It then writes the returned auth information into the store (this step is failing in your test), and returns a 401 to the P-CSCF. Then when the S-CSCF receives the REGISTER with authorization, it checks this against its stored information. However in your case, Sprout looks up the subscriber in the auth store, doesn't find anything (as the write before failed), so makes a MAR to the HSS. We should probably return a better code to the P-CSCF if the write fails (a 500 rather than a 401) - I'll look into this. In the meantime, can you check the memcached process? Ellie From: Ashish Maheshwari -X (ashimahe - L & T TECHNOLOGY SERVICES LIMITED at Cisco) [mailto:[email protected]] Sent: 20 February 2015 11:45 To: Eleanor Merry; [email protected]<mailto:[email protected]> Cc: Chinna Morampalle (cmorampa); Satheesh Marappan (sathkuma) Subject: SCSCF sending repetetive MAR to HSS Hi Ellie, I am facing the issue that SCSCF sending repetitive MAR request to HSS instead of SAR even though first MAA diameter success. I am describing the detailed call flow: UE----SIP Register -------PCSCF(Bono) PCSCF-----sip reg frwrd------SCSCF(sprout) SCSCF-------MAR------------HSS HSS-------MAA------------SCSCF SCSCF--------401 unaut------PCSCF PCSCF----------401 unaut----UE UE----SIP Register -------PCSCF(Bono) PCSCF-----sip reg frwrd------SCSCF(sprout) SCSCF-------MAR------------HSS (instead of SAR) I am attaching pcap,monit status, config,bono and sprout logs file. Regards, Ashish Thanks & Regards, Ashish Maheshwari _______________________________________________ Clearwater mailing list [email protected] http://lists.projectclearwater.org/listinfo/clearwater
