###### tags: `ruck_rover` ###### Previous RR notes: https://hackmd.io/oEM6x1zCS0y9WhH2XGQI0w ##### ruck & rover: Cedric ##### ruck & rover alias: @ruckroverciframework [Next Gen cockpit](https://prometheus.monitoring.softwarefactory-project.io/grafana/d/5JC-ru3Vz/github-projects-workflow-status?orgId=1) * [doc](https://docs.google.com/document/d/14CcHuMEACfXpAy1HkSc8s_CpN1KNC_POG2lzLnbKYLM/edit) [RDO Cockpit](http://dashboard-ci.tripleo.org/d/HkOLImOMk/upstream-and-rdo-promotions?orgId=1) / [RHOS Cockpit](http://tripleo-cockpit.lab4.eng.bos.redhat.com) [RDO Promoter](http://promoter.rdoproject.org/promoter_logs/) / [RHOS Promoter](http://10.0.110.143/promoter_logs/) [OpenStack Program Meeting indexes](https://docs.engineering.redhat.com/pages/viewpage.action?spaceKey=PRODCHAIN&title=Meeting+notes) [Modernization CI](https://hackmd.io/f9JzEVjMQC-hkJu1L9eolw) Zuul Status: * [opendev.org:openstack](https://zuul.opendev.org/t/openstack/status/) * [rdoproject.org:rdoproject.org](https://review.rdoproject.org/zuul/status) * [redhat.com:tripleo-ci-internal](https://sf.hosted.upshift.rdu2.redhat.com/zuul/t/tripleo-ci-internal/status) Prow Status * [Operators - Check/Gate failures](https://prow.ci.openshift.org/?job=pull-ci-openstack-k8s-operators*&state=failure) * For now, we just look (and investigate) if a test has multiple failures across different PRs * [Operators - Check/Gate status](https://prow.ci.openshift.org/?job=pull-ci-openstack-k8s-operators*) * [pre-commit jobs](https://prow.ci.openshift.org/?job=pull-ci-openstack-k8s-operators*-main-precommit-check&state=failure) * [pre-commit jobs status](https://prow.ci.openshift.org/?job=pull-ci-openstack-k8s-operators*-main-precommit-check) * [unit tests jobs](https://prow.ci.openshift.org/?job=pull-ci-openstack-k8s-operators*-operator-main-unit&state=failure) * [kuttl tests Jobs](https://prow.ci.openshift.org/?job=pull-ci-openstack-k8s-operators*-operator-build-deploy-kuttl&state=failure) * [Tempest Jobs](https://prow.ci.openshift.org/?job=pull-ci-openstack-k8s-operators*-operator-build-deploy-tempest&state=failure) [Review List](/FGMoCiRfSNa8puA1BpTQ-Q) #### Podifed Container Build Lines * [openstack-periodic-container-master-centos9](https://review.rdoproject.org/zuul/buildsets?pipeline=openstack-periodic-container-master-centos9) * [openstack-periodic-container-antelope-centos9](https://review.rdoproject.org/zuul/buildsets?pipeline=openstack-periodic-container-antelope-centos9&skip=0) --- ## Bugs - https://issues.redhat.com/browse/OSPCIX-40 - test_server_basic_ops tempest test failing on EDPM antelope Zuul job - https://issues.redhat.com/browse/OSPCIX-41 - OSP-18 EDPM image build is failing with cp: error writing No space left on device - https://issues.redhat.com/browse/OSP-27549 - EDPM/Baremetal Jobs fails randomly with no endpoints available for service "nmstate-webhook - https://issues.redhat.com/browse/OSPCIX-42 - The ssl_verify_client parameter is required when setting ssl_ca (file: /etc/puppet/modules/horizon/manifests/wsgi/apache.pp on cs9 wallaby - https://issues.redhat.com/browse/OSPCIX-45 - openstack-baremetal-operator-crc-podified-edpm-baremetal fails when /spec/roles/edpm-compute/baremetalSetTemplate/ctlplaneInterface is set to enp1s0 --- ## Promotion Status ### Upstream - Podified Master: 2023-08-17 (recovering from vexx) - Podified Antelope: 2023-08-24 - CentOS 9 Wallaby : 2023-08-15 (recovering from vexx) - CentOS 8 Wallaby: 2023-08-18 (recovering from vexx) - CentOS 8 Train: 2023-08-18 (no new hash - but should get new hash on Fri Aug 25) ### Downstream * Podified RHEL 9 RHOS 18.0: 2023-08-16 (blocked on ospcix-41) * RHEL 9 RHOS 17.1: 2023-08-20 (no new hash) * RHEL 9 RHOS 17.0: 2023-08-09 (bm_envB still failing) * RHEL 8 RHOS 17.1: 2023-08-17 (no new hash) * RHEL 8 RHOS 16.2: 2023-08-20 (no new hash) --- ## Aug 24 - RDO seems to recover from Vexx outage - re-kicked Wallaby lines: https://softwarefactory-project.io/zuul/t/rdoproject.org/status/change/272138c - (re-kick failed in 2 seconds with RETRY/RETRY_LIMIT...) - Re-kicked Train promotion, we should have new content (Networking at least) - https://review.rdoproject.org/r/c/testproject/+/46686 ## Aug 23 - RDO Zuul faces mirror issues - probably related to Vexx changes. - https://issues.redhat.com/browse/RHOSZUUL-1423 - envB issue for 17.0 on el9: apparently actual env issue, now all controllers are OK, but compute is flapping (timeout while waiting for it to boot) - https://issues.redhat.com/browse/OSPCIX-46 - IMHO we may indeed discard that job and promote as-is. IIRC envB already proved unstable in the past. (PR pushed downstream against tripleo-environment) ## Aug 22 - RDO Zull still hitting node_failures, retry and related issues. NOT re-kicking lines yet. ## Aug 21 - RHEL 9 RHOS 17.0 re-run failed job: https://code.engineering.redhat.com/gerrit/c/testproject/+/446068 - RDO ZUUL down: https://redhat-internal.slack.com/archives/C042U5CT6TH/p1692592842118939