Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bump Ansible SSH ControlPersist param to 300s #2510

Draft
wants to merge 1 commit into
base: main
Choose a base branch
from

Commits on Nov 4, 2024

  1. Bump Ansible SSH ControlPersist param to 300s

    We see the following error downstream [1], [2] indicates this error is
    caused by the SSH socket Ansible creates to multiplex connections fails.
    
    Rather than handling the failure, Ansible fails the task. This is
    reportedly failing around 30% of the time on a very long running CI job.
    
    300s was selected as it's used by the openstack-ansible project [3] [4]
    
    Jira: https://issues.redhat.com/browse/OSPRH-10719
    
    [1]
    ```
    TASK [reproducer : Ensure we can ping controller-0 from ctlplane _raw_params=ping -c2 controller-0.utility] ***
    fatal: [hypervisor -> ceph-1(ceph-1.hypervisor)]: FAILED! => changed=false
      module_stderr: ''
      module_stdout: ''
      msg: |-
        MODULE FAILURE
        See stdout/stderr for the exact error
      rc: -13
    ```
    
    [2] ansible/ansible#78344
    [3]
    https://opendev.org/openstack/openstack-ansible/src/commit/32c6aa2cec1a2145e2c20a37df23f8b4e4b93e4c/scripts/openstack-ansible.rc#L52
    [4] https://opendev.org/openstack/openstack-ansible/commit/cbdba67ad0b5a3e29db390c8e6b66721719184c0
    lewisdenny committed Nov 4, 2024
    Configuration menu
    Copy the full SHA
    791fbf3 View commit details
    Browse the repository at this point in the history