Hi, I'm having a strange issue. I have agents that normally report to the 
manager just fine, but after an undetermined amount of time, this appears 
in the logs

2019/12/16 01:20:55 rootcheck: INFO: Starting rootcheck scan.
2019/12/16 01:40:58 rootcheck: INFO: Ending rootcheck scan.
2019/12/16 13:18:52 ossec-agentd: WARN: Server unavailable. Setting lock.
2019/12/16 13:19:13 ossec-agentd(4101): WARN: Waiting for server reply (not 
started). Tried: 'SERVER-IP'.
2019/12/16 13:19:15 ossec-agentd: INFO: Trying to connect to server SERVER-
IP, port 1514.
2019/12/16 13:19:15 INFO: Connected to SERVER-IP at address SERVER-IP, port 
1514
2019/12/16 13:19:36 ossec-agentd(4101): WARN: Waiting for server reply (not 
started). Tried: 'SERVER-IP'.
2019/12/16 13:19:56 ossec-agentd: INFO: Trying to connect to server SERVER-
IP, port 1514.
2019/12/16 13:19:56 INFO: Connected to SERVER-IP at address SERVER-IP, port 
1514
2019/12/16 13:20:17 ossec-agentd(4101): WARN: Waiting for server reply (not 
started). Tried: 'SERVER-IP'.
2019/12/16 13:20:51 ossec-logcollector: WARN: Process locked. Waiting for 
permission...
2019/12/16 13:20:55 ossec-agentd: INFO: Trying to connect to server SERVER-
IP, port 1514.
2019/12/16 13:20:55 INFO: Connected to SERVER-IP at address SERVER-IP, port 
1514
2019/12/16 13:21:16 ossec-agentd(4101): WARN: Waiting for server reply (not 
started). Tried: 'SERVER-IP'.
2019/12/16 13:22:12 ossec-agentd: INFO: Trying to connect to server SERVER-
IP, port 1514.
2019/12/16 13:22:12 INFO: Connected to SERVER-IP at address SERVER-IP, port 
1514
2019/12/16 13:22:33 ossec-agentd(4101): WARN: Waiting for server reply (not 
started). Tried: 'SERVER-IP'.
2019/12/16 13:23:47 ossec-agentd: INFO: Trying to connect to server SERVER-
IP, port 1514.
2019/12/16 13:23:47 INFO: Connected to SERVER-IP at address SERVER-IP, port 
1514

There's nothing in the manager logs to indicate any sort of issue, and 
other agents that are connected to the same manager keep on reporting fine. 
I have some agents that disconnect after a few hours, and others that have 
been connected for weeks without issue, though the large majority do end up 
disconnecting at some point. If I manually remove the agent from the 
manager, and then get a new key with `agent-auth` & `agent-authd` it 
continues working as normal. I've already tried configuring the 
`notify_time` to 60. I also have turned on debugging for a few agents, but 
due to the seeming randomness of the disconnects, I'd like to avoid waiting 
weeks to finally get a useful log / disconnect. The server is v3.3.0 and 
agents are generally either v3.2.0 or v3.3.0

Also I'm aware I can try switching from UDP to TCP, however that would 
require reconfiguring 100s of servers across a half dozen environments, so 
I'd like to avoid doing that unless I'm certain it will be the fix.


-- 
This email and its attachments are confidential and may be privileged.  Any 
unauthorized use or disclosure is prohibited.  If you receive this email in 
error, please notify the sender and permanently delete the original without 
forwarding, making any copies or disclosing its contents. NextCapital is a 
brand name representing NextCapital Group, Inc. and its subsidiaries, 
NextCapital Software, Inc. and NextCapital Advisers, Inc. 

-- 

--- 
You received this message because you are subscribed to the Google Groups 
"ossec-list" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to ossec-list+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/ossec-list/6d4bbd12-8031-4be2-be52-0aeeecc4772f%40googlegroups.com.

Reply via email to