# Ruck Rover 2022-08-12 to 2022-08-18 ###### tags: `ruck_rover` ###### Previous RR notes: https://hackmd.io/5zEei6JoTSmZ9mXlN5bxow [Cockpit](http://dashboard-ci.tripleo.org/d/HkOLImOMk/upstream-and-rdo-promotions?orgId=1) [Downstream cockpit](http://tripleo-cockpit.lab4.eng.bos.redhat.com) [OpenStack Program Meeting 2022]( https://docs.google.com/document/d/1n6ArkMh68R9zivjlyGbpedkggk1wMwEIcrMZSN2uIjc/edit) [Downstream promoter](http://10.0.110.143/promoter_logs/) --- Ongoing Bugs Blocking: Wallaby Check/Gate * https://bugs.launchpad.net/tripleo/+bug/1986960 Wallaby Ceph based scenarios(SC01/SC04/SC010) are failing with "ERROR: Container release quincy != cephadm release pacific * https://bugs.launchpad.net/tripleo/+bug/1986755 Master/wallaby Security component "Standalone with ipa" and fs039 job are failing with ERROR! couldn't resolve module/action 'freeipa.ansible_freeipa.ipahost'. This often indicates a misspelling, missing collection, or incorrect module path. * Xek proposed: https://review.opendev.org/853478 * ipa job running in check itself - need to work with security team to continue debug/fixing. Degraded: https://bugzilla.redhat.com/show_bug.cgi?id=2119349 -Component and Integration line jobs intermittently failing with retry_limit * https://bugs.launchpad.net/tripleo/+bug/1983817 - Jobs failing with retry * Still happening - Need to capture a node for debug, but job is failing at early stage - hard to hold a node because issue is random + our keys not pushed till that point.(so even if we hold the node - without infra can't enter node and infra is out today because of recharge day) * https://bugs.launchpad.net/tripleo/+bug/1986708 - opendev.org ssh failure * Seems to be affecting all jobs randomly. Could be helpful to enable ansible debug but it's tricky to catch a failing job. * Doesn't seem to be happening anymore. https://review.opendev.org/c/openstack/project-config/+/853536 proposed by opendev folks to mitigate. * https://trello.com/c/3p8i2YdZ/2639-cixlp1982874tripleociproa-testcreateobjectwithtransferencoding-is-failing-on-tripleo-jobs * Christian is out and will be after next week, we will share node with him once he is back. --- ### New bug: * https://bugzilla.redhat.com/show_bug.cgi?id=2119349 Component and Integration line jobs intermittently failing with retry_limit * https://bugs.launchpad.net/tripleo/+bug/1986960 Wallaby Ceph based scenarios(SC01/SC04/SC010) are failing with "ERROR: Container release quincy != cephadm release pacific #### 18th Aug Check/Gate following in recheck * https://review.opendev.org/c/openstack/tripleo-common/+/848597/ * https://review.opendev.org/c/openstack/python-tripleoclient/+/852645/ ssh error seen in one more job:- https://storage.bhs.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_649/852532/5/gate/tripleo-ci-centos-9-content-provider/649da27/job-output.txt ~~~ 2022-08-18 01:14:06.982440 | primary | failed: [undercloud] (item=quay.io/prometheus/node-exporter:v1.3.1) => {"ansible_loop_var": "item", "item": "quay.io/prometheus/node-exporter:v1.3.1", "msg": "Failed to connect to the host via ssh: kex_exchange_identification: Connection closed by remote host\r\nConnection closed by 127.0.0.2 port 22", "unreachable": true} ~~~ --- #### 17th Aug #### New Bugs https://bugs.launchpad.net/tripleo/+bug/1986755 Master/wallaby Security component "Standalone with ipa" and fs039 job are failing with ERROR! couldn't resolve module/action 'freeipa.ansible_freeipa.ipahost'. This often indicates a misspelling, missing collection, or incorrect module path. * Xek proposed: https://review.opendev.org/853478 * ipa job running in check itself - need to work with security team to continue debug/fixing. #### Check/gate Below patches in gate failed with https://bugs.launchpad.net/tripleo/+bug/1986708 - random ssh issue , All below patches are in recheck. https://review.opendev.org/c/openstack/python-tripleoclient/+/852645/ https://review.opendev.org/c/openstack/tripleo-ansible/+/852513/ https://review.opendev.org/c/openstack/tripleo-common/+/848597/ #### Master - last promotion - 17th (Green) Tracking today's hash here: https://review.rdoproject.org/r/c/testproject/+/38348 #### Master - component * security - fs039/ipa - bug reported - https://bugs.launchpad.net/tripleo/+bug/1986755 #### Wallaby c9 - last promotion - 17th Tracking today's hash here: https://review.rdoproject.org/r/c/testproject/+/39357 #### Wallaby c9 components * security - ipa/fs039 - bug reported - https://bugs.launchpad.net/tripleo/+bug/1986755 rerunning here: https://review.rdoproject.org/r/c/testproject/+/44516 #### Wallaby c8 - last promotion - 15th - Green No new hash when I checked in morning. #### Wallaby c8 components - All green #### Train c8 - last promotion - 16th - Green No new hash when I checked in morning. #### Train c8 components - All green #### 16.2/8 - promoted 17th Aug - All green #### 16.2/8 - component - Green now #### 17/9 - promoted 17th Aug - All green #### 17/9 - component - All green now #### 17.1/9 - promoted 17th Aug - All green #### 17/8 Rerunning failing jobs here: https://code.engineering.redhat.com/gerrit/c/testproject/+/403445 #### 17/8 component line * cloudops * tripleo - ovb Rerunning here: https://code.engineering.redhat.com/gerrit/c/testproject/+/424969 --- 16th Aug #### Master - last promotion - 12th * Tracking here: fs39 remaining * https://review.rdoproject.org/r/c/testproject/+/38348 * ~~https://code.engineering.redhat.com/gerrit/c/testproject/+/201382~~ #### Master - component * baremetal - fs001 * security - many jobs failing(line running right now) Didn't debug, blank recheck first here: https://review.rdoproject.org/r/c/testproject/+/28446 #### Wallaby c9 - last promotion - 12th * Tracking here: fs035 left * https://review.rdoproject.org/r/c/testproject/+/44516 * ~~https://code.engineering.redhat.com/gerrit/c/testproject/+/209874~~ #### Wallaby c9 - component * baremetal - ovb failing * tripleo - sc04/ovb failing * security - ipa/fs039 failing Didn't debug, blank recheck first here: https://review.rdoproject.org/r/c/testproject/+/42657 #### Wallaby c8 - last promotion - 15th(Green) #### Wallaby c8 component line - green #### Train C8 - Last promotion - 16th(Green) #### Train c8 component line - green #### Downstream We changed private network - wohoo passed earlier metadata issue. https://code.engineering.redhat.com/gerrit/c/openstack/sf-config/+/424881 #### 17/9 * bm fs001 failed * https://code.engineering.redhat.com/gerrit/c/testproject/+/424969 #### 16.2/8 * No new hash for integration line #### 16.2/8 components * baremetal - ovb https://code.engineering.redhat.com/gerrit/c/testproject/+/209874 --- 15th Aug ## New bugs ~~https://bugs.launchpad.net/tripleo/+bug/1986502~~ - master/wallaby/train lines are impacted with node_failure, started on 13th Aug, 2022 ~~https://bugs.launchpad.net/tripleo/+bug/1985981~~ - standalone job failing with Error: container-init binary not found on the host: stat /usr/libexec/podman/catatonit: no such file or directory" ## Check/Gate ### Promotion status ## Master - 12th Aug Blocked on https://bugs.launchpad.net/tripleo/+bug/1985981 and https://bugs.launchpad.net/tripleo/+bug/1986502 trying to fix catatonit issue via https://review.rdoproject.org/r/c/testproject/+/44527 ## Wallaby C9 - 12th Aug https://bugs.launchpad.net/tripleo/+bug/1986502 - master/wallaby/train lines are impacted with node_failure, started on 13th Aug, 2022 ## Wallaby C8 - 15th Aug ~~chasing promotion here: https://review.rdoproject.org/r/c/testproject/+/44522~~ ## Train - 13th Aug chasing promotion here: https://review.rdoproject.org/r/c/testproject/+/44526 ## Downstream Explored possibility of enabling config-drive in downstream but need some info from infra first * Left comment here: https://trello.com/c/6ZWMl7pM/2662-cixbz2116287osp162osp17no-promotions-occurs-due-to-nodefailure-at-downstream-on-weekend-since-7th-august 12th Aug ## New bugs https://bugs.launchpad.net/tripleo/+bug/1985981 - Sc010 kvm internal job failing with Error: container-init binary not found on the host: stat /usr/libexec/podman/catatonit: no such file or directory" Gate blocker * ~~https://bugs.launchpad.net/tripleo/+bug/1984175/~~ * Mirror came back in sync:- * https://lists.centos.org/pipermail/centos-devel/2022-August/120525.html Master promotion blocker ~~https://bugs.launchpad.net/tripleo/+bug/1984184~~ * Fix: https://review.opendev.org/c/openstack/tripleo-heat-templates/+/852790 merged * fs035 passed with above fix - https://review.rdoproject.org/r/c/testproject/+/36254 * ~~https://bugs.launchpad.net/tripleo/+bug/1984453~~ * ~~Same mirror issue - rekicked fs002 via testproject to confirm green run https://review.rdoproject.org/r/c/testproject/+/28537~~ ## Check/Gate * https://5e6d48fe25fecf1b2eda-cc994f454b3da94a07b55a097da7db60.ssl.cf5.rackcdn.com/852880/3/check/openstack-tox-linters/0abde65/job-output.txt ~~~ 2022-08-11 15:29:01.824649 | ubuntu-focal | TypeError: 'NoneType' object is not iterable 2022-08-11 15:29:01.824656 | ubuntu-focal | error: Support for editable installs via PEP 660 was recently introduced 2022-08-11 15:29:01.824668 | ubuntu-focal | in `setuptools`. If you are seeing this error, please report to: 2022-08-11 15:29:01.824676 | ubuntu-focal | 2022-08-11 15:29:01.824683 | ubuntu-focal | https://github.com/pypa/setuptools/issues 2022-08-11 15:29:01.824690 | ubuntu-focal | 2022-08-11 15:29:01.824697 | ubuntu-focal | Meanwhile you can try the legacy behavior by setting an 2022-08-11 15:29:01.824704 | ubuntu-focal | environment variable and trying to install again: 2022-08-11 15:29:01.824711 | ubuntu-focal | 2022-08-11 15:29:01.824718 | ubuntu-focal | SETUPTOOLS_ENABLE_FEATURES="legacy-editable" 2022-08-11 15:29:01.824725 | ubuntu-focal | [end of output] ~~~ Seen last night - rlandy updated tox.ini in undercloud non-voting patch, but tox-py jobs were getting timedout. ysandeep have removed tox.ini changes and linters passed in current run.(not seeing the issue anymore) ### Promotion status # Master - 01st Aug (11 days old) * blocked till tht merges: https://review.rdoproject.org/r/c/testproject/+/36254 (+wed) * fs035 passed with this patch in depends-on * tracking two hashes with depends on above patch * 4e352c3ada5a2e91b161ff220dc42d85 : https://review.rdoproject.org/r/c/testproject/+/38348 * Need to skip internal sc10 kvm job - bug reported(as same job passing in vexx I think safe to skip and promote if 035/34 passes) * 120b06809f1abde369d06cef81ded6ab: https://review.rdoproject.org/r/c/testproject/+/36254 * just waiting for 002 # Wallaby C9 - 09th Aug (03 days old) * 120b06809f1abde369d06cef81ded6ab: fs20 failed * rerunning here https://review.rdoproject.org/r/c/testproject/+/39357 # Wallaby C8 - 11th Aug (Green) # Train - 09th Aug * tracking here: https://review.rdoproject.org/r/c/testproject/+/44516 # Downstream Explored possibility of enabling config-drive in downstream but need some info from infra first * Left comment here: https://trello.com/c/6ZWMl7pM/2662-cixbz2116287osp162osp17no-promotions-occurs-due-to-nodefailure-at-downstream-on-weekend-since-7th-august