A couple hundred hosts we manage with ansible are HPC nodes. Sometimes during normal use a portion of these hosts may be heavily loaded. In this case the hosts will allow an ssh connection within the timeout constraints, but tasks exec'd by ansible may take a very long time to finish, delaying completion of the ansible-playbook completion for this group by hours. In our environment ansible runs nightly with the latest configs from git and at this point it's been two days since this group of research systems has completed a run because any given night a few hosts have very high load. Certainly I could script a check for this situation that excludes hosts, but is there another configuration option than ssh timeout that would better handle this situation? Is anyone else experiencing this situation?
-- You received this message because you are subscribed to the Google Groups "Ansible Project" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/ansible-project/65445d47-38e4-448d-a410-ab7d6b6ab2ad%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.
