I've had moderate success by changing the ControlPersist in ssh_args parameter in ansible.cfg:
ssh_args = -o ControlMaster=auto -o ControlPersist=300s That said, on big playbooks with a couple hundred hosts/tasks I generally still have at least one or two generic connection failures. Like you it happens on different tasks/hosts every time for me. On Friday, 29 July 2016 10:49:17 UTC-6, Jeff Richards wrote: > > Hi, > > Is there any way, in Ansible 1.9.6, of having Ansible retry a failed > connection attempt x times at y intervals? We are seeing flaky network > behavior when deploying on AWS, with apparently random ssh failures causing > runs to break. > > It's never in the same place twice so it would be nice if we could have > Ansible back off and retry before bailing. > > If not in 1.9.6 (which we're stuck on for another few months .. .OpenShift > reasons), how about 2.x? > > Jeff > -- You received this message because you are subscribed to the Google Groups "Ansible Project" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/ansible-project/5901060b-8862-4531-9260-5f414e4d9199%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.
