owned this note
owned this note
Published
Linked with GitHub
# Ruck and Rover notes #40
###### tags: `ruck_rover`
:::info
Important links for ruck rover's [ruck/rover links to help](https://hackmd.io/07z0xroHTFi2IbX93P5ZfQ)
**Ruck Rover - Unified Sprint #<fix>**
Dates: Feb 4 - Feb 25
Tripleo CI team ruck|rover: arxcruz , ysandeep
OSP CI team ruck|rover: <fix>Names</fix>
Previous notes: [link](https://hackmd.io/_XvcCzQlQ1-A9ygYMlmvKA)
:::
[TOC]
### Issues to track on-going
put these issues in the spoiler.
:::danger
#### tripleo
check/Gate:
Stein branch check/Gate jobs are failing because of missing container images, Error - ImageNotFoundException
https://bugs.launchpad.net/tripleo/+bug/1915921
promotions:
**Master: 17th Feb** (Yellow)
Jobs are green, we haven't got promotion recentrly because earlier promoter was stopped and we have also stopped triggering master line for c8 stream work : https://review.rdoproject.org/r/#/c/32064/
* Last run from 22nd: https://review.rdoproject.org/zuul/builds?pipeline=openstack-periodic-integration-main
sc01/02 only failed once, passed in testproject
We have a bug for fs39 for master, fixed now.. fix need to hit integration line.
* [ Master Promotion blocker] Heat stack creation failing for featureset039 in Master branch with "ResourceInError: resources.NovaCompute: Went to status ERROR due to "Message: MessagingTimeout, Code: 500"
https://bugs.launchpad.net/tripleo/+bug/1916445
* Fix merged https://review.opendev.org/c/openstack/tripleo-common/+/776870
We also need to talk with security dfg about fs039 - need to drop/migrate this job
**Victoria - Green - last promoted yesterday 23rd Feb**
* Launchpad bug 1916742 in tripleo "Victoria Container build job is failing, buildah deprecations warning is added as a first item in the list instead of container name" [Critical,Triaged]
https://bugs.launchpad.net/tripleo/+bug/1916742
**Ussuri - Green - last promoted yesterday 23rd Feb**
**c8 train- yellow - Last promoted on 17th Feb, 2021**
Waiting for promoter server to be ready for c8 train, last run was completely green https://review.rdoproject.org/zuul/buildset/0c074bf21d4e463d82d51e42baf7400b
** c7 train - Red - 23rd Jan**
[CIX][LP:1915519][tripleoci][proa][Train][CentOS7][scenario004] Failing with Error: 'ip-192.168.24.3' already exists. Too many tries"
https://bugs.launchpad.net/tripleo/+bug/1915519
* Stein - Green - Promoted on 24th Feb
* Rocky - Red
periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset010-rocky is failing with NeutronError: "Invalid input for operation: segmentation_id requires physical_network for VLAN provider network"
https://bugs.launchpad.net/tripleo/+bug/1916695
* Queens - Promoted on 11th Feb
:::
:::info
add dates in decending order so the latest date is at the top. Break out TripleO and OSP sections.
:::
## Feb 25th
### Tripleo
## Feb 24th
### Tripleo
* Launchpad bug 1916742 in tripleo "Victoria Container build job is failing, buildah deprecations warning is added as a first item in the list instead of container name" [Critical,Triaged]
https://bugs.launchpad.net/tripleo/+bug/1916742
* https://review.opendev.org/c/openstack/tripleo-ci/+/777378
* Rocky - periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset010-rocky is failing with NeutronError: "Invalid input for operation: segmentation_id requires physical_network for VLAN provider network"
https://bugs.launchpad.net/tripleo/+bug/1916695
* Upgrade jobs looking for wrong hash
https://bugs.launchpad.net/tripleo/+bug/1916689
Promotions:
Victoria: below jobs failed in last run, Monitor today's run
~~~
periodic-tripleo-ci-centos-8-scenario010-standalone-victoria
periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset035-victoria
periodic-tripleo-ci-centos-8-ovb-1ctlr_2comp-featureset020-victoria
~~~
Ussuri: below jobs failed yesterday, Cleared in today's run
~~~
periodic-tripleo-ci-centos-8-standalone-on-multinode-ipa-ussuri
~~~
## Feb 23rd
### Tripleo
* [Fixed] Promotion blocker - https://bugs.launchpad.net/tripleo/+bug/1916561
Launchpad bug 1916561 in tripleo "tripleo-ci-centos-7-containers-multinode does not build with patches" [High,Triaged]
* https://review.opendev.org/c/openstack/tripleo-quickstart/+/777051
## Feb 22nd
### Tripleo
* [Promotion blocker] Heat stack creation failing for featureset039 in Master branch with "ResourceInError: resources.NovaCompute: Went to status ERROR due to "Message: MessagingTimeout, Code: 500"
https://bugs.launchpad.net/tripleo/+bug/1916445
* https://review.opendev.org/c/openstack/tripleo-common/+/776870 will fix job.
* We also need to talk with security dfg about fs039 - need to drop/migrate this job
* tripleo-ci-centos-7-containers-multinode-queens failing on TASK [tripleo-bootstrap : Set 'dns=none' in /etc/NetworkManager/NetworkManager.conf] with "error while evaluating conditional (ansible_facts.services['NetworkManager.service']['status'] == 'enabled'): 'dict object' has no attribute 'status"
https://bugs.launchpad.net/tripleo/+bug/1916459
https://review.opendev.org/c/openstack/tripleo-common/+/776925 fix merged today
* periodic-tripleo-ci-centos-8-scenario010-standalone-master failed twice on tempest - ran successfully with testproject
* periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset035-victoria failed in last run - will monitor today's run
* periodic-tripleo-ci-centos-8-standalone-on-multinode-ipa-ussuri failed twice - rerunning via testproject https://review.rdoproject.org/r/#/c/28446/
## Feb 19th
### Tripleo
* Promotions:-
* Master - Green
* Victoria - periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset035-victoria failed yesterday, will monitor today's run
* Ussuri - periodic-tripleo-ci-centos-8-standalone-full-tempest-api-ussuri & periodic-tripleo-ci-centos-8-scenario002-standalone-ussuri failed in today's run, Cleared in rerun: https://review.rdoproject.org/r/#/c/28446
* c8 train: All green today
## Feb 18th
### Tripleo
* Promotions:-
* Master - Good
* Victoria: periodic-tripleo-centos-8-buildimage-overcloud-full-victoria - failed, other ovb jobs skiped, Failed job rerun with https://review.rdoproject.org/r/#/c/28446/, awaiting results
* Ussuri - periodic-tripleo-ci-centos-8-standalone-ussuri failed in last run, cleared in rerun https://review.rdoproject.org/r/#/c/31612/
* c8 - Train: periodic-tripleo-ci-centos-8-multinode-1ctlr-featureset010-train - Failed in tempest, Next scheduled run is ongoing
## Feb 17th
### Tripleo
* [Fixed][Promotion blocker]Launchpad bug 1915932 in tripleo "Centos7 Stein Jobs failed with error - sudo: pip3: command not found" [Critical,Triaged]
https://bugs.launchpad.net/tripleo/+bug/1915932
* https://review.opendev.org/c/openstack/tripleo-quickstart/+/776391
* [Promotion blocker]Launchpad bug 1915921 in tripleo "Stein branch check/Gate jobs are failing because of missing container images, Error - ImageNotFoundException" [Critical,Triaged]
https://bugs.launchpad.net/tripleo/+bug/1915921
* good news all c8 line promoted today
## Feb 16th
### Tripleo
* Victoria branch Scenario10 logs collection not collecting all the directory and files.
https://bugs.launchpad.net/tripleo/+bug/1915778
* https://review.opendev.org/c/openstack/tripleo-quickstart/+/771593
## Feb 15th
### Tripleo
* Good news - master/v/u/c8 train all promoted yesterday
## Feb 12th
### Tripleo
* "[Fixed]Issue with Upstream Mirror - error: Status code: 403 "
https://bugs.launchpad.net/tripleo/+bug/1915487
* Workaround:-
Infra team added afs server - afsdb01 and afsdb02 to emergency disable list and added back missing public UDP ports in firewall rules.
Permanent solution:-
https://review.opendev.org/c/opendev/system-config/+/775311
## Feb 11th
### TripleO
* Promotion status
* Master - 10th Feb - Green
* Victoria - 05th Feb (06 days old now) - Red
* Ussuri - 05th Feb (06 days old now) - Red
* C8 train - 07th Feb (04 days old now) - Yellow
**Let's wait for today's run**
* Seeing some node failures in rdo - https://review.rdoproject.org/zuul/builds?result=NODE_FAILURE
* Ping on internal #rhos-ops channel, dpawlik updated the ticket with vexxhost
* NB planned outage in VexxH today http://lists.openstack.org/pipermail/openstack-discuss/2021-February/020347.html
* Infra issue with some ovb jobs:-
https://logserver.rdoproject.org/openstack-periodic-integration-stable2/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-ovb-1ctlr_2comp-featureset020-ussuri/aa5e30e/job-output.txt
~~~
2021-02-11 05:58:08.405522 | TASK [ovb-manage : Attach instance to public OVB network]
2021-02-11 05:58:28.797485 | primary | Failed to attach network adapter device to bcb82723-5001-42fd-95fa-6ccde37417a2 (HTTP 500) (Request-ID: req-095b7c67-e1b1-479a-882e-875a7e3b88e1)
~~~
* Reported to #rhos-ops
* Check/Gate patches failures
* https://review.opendev.org/c/openstack/tripleo-heat-templates/+/774779/ , Ussuru Upgrade job failed - Looks like network failure between node and content provider job - posted recheck
* https://review.opendev.org/c/openstack/tripleo-quickstart-extras/+/774892- 500 Server Error: Internal Server Error for url: https://opendev.org/openstack/requirements/raw/branch/stable/ussuri/upper-constraints.txt - Posted recheck
## Feb 10th
### TripleO
* Promotion - Master
* periodic-tripleo-ci-centos-8-ovb-1ctlr_2comp-featureset020-master and periodic-tripleo-ci-centos-8-standalone-full-tempest-api-master failed on tempest - Will keep an eye on next scheduled run.
* [Fixed]Promotion blocker [Train] [CentOS7] Undercloud jobs puppet task ertmonger_certificate[haproxy-external-cert] fails with Unrecognized parameter or wrong value type
https://bugs.launchpad.net/tripleo/+bug/1915242
## Feb 09th
### TripleO
* [Fixed][promotion-blocker] Number of jobs fail with RETRY_LIMIT, No module named 'setuptools_rust
https://bugs.launchpad.net/tripleo/+bug/1915101
Patch
https://review.opendev.org/c/openstack/requirements/+/774593
We need new pip:-
https://review.opendev.org/c/openstack/tripleo-ci/+/774603
## Feb 08th
### TripleO
* Good news
- **Master promoted - 08th feb**
* sc-10 still timing out, removed from criteria(I think wes removed it from criteria)
* https://review.opendev.org/c/openstack/openstack-tempest-skiplist/+/774419 - Skipping some tests to avoid timeout
- **train promoted - 07th feb**
* periodic-tripleo-ci-centos-8-scenario004-standalone-master failed in last periodic run, Cleared in testproject rerun
* periodic-tripleo-ci-centos-8-scenario004-standalone-master https://review.rdoproject.org/zuul/build/1735f67f22014c35afec550efc257772 : SUCCESS in 1h 29m 45s
* [Fixed][Promotion blocker]Victoria/Ussuri fs020 failed with Error: The --deployed-server cannot be used without the --disable-validations https://bugs.launchpad.net/tripleo/+bug/1914982
* https://review.opendev.org/c/openstack/tripleo-quickstart/+/774416
* Testing here - https://review.rdoproject.org/r/#/c/28446/
* [Not observed again]Again observing issues with limestone mirror - https://bugs.launchpad.net/tripleo/+bug/1914585
* Chatting with #opendev infra
* [Fixed]Jobs fails on stein and older branches with ERROR! the role 'tripleo_validations' was not found (https://bugs.launchpad.net/tripleo/+bug/1914993)
* Patch - https://review.opendev.org/c/openstack/tripleo-quickstart-extras/+/774445
## Feb 05th
### TripleO
* [Fixed]Promotion blocker [Master featureset020/fs039 failed while trying to upload image, HttpException: 500: Server Error for url: https://192.168.24.2:13292/v2/images/b898eee8-66c3-4a27-bbad-e6ca19d1f8d8/file, Internal Server Error](https://bugs.launchpad.net/tripleo/+bug/1914735)
* Patches are up for fs39, testproject is running for fs020
* periodic-tripleo-ci-centos-8-scenario010-standalone-network-master is timing out
* We need network component promotion for new python-ovn-octavia-provider patch - https://review.opendev.org/c/openstack/ovn-octavia-provider/+/771889/
* Context: Brian comment on https://review.opendev.org/c/openstack/openstack-tempest-skiplist/+/774079
* periodic-tripleo-ci-centos-8-scenario010-ovn-provider-standalone-ussuri/victoria failed on tempest
* Victoria(No logs)
* ussuri failed on TrafficOperationsScenarioTest
https://logserver.rdoproject.org/openstack-periodic-integration-stable2/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-scenario010-ovn-provider-standalone-ussuri/9773373/logs/undercloud/var/log/tempest/stestr_results.html.gz
## Feb 04th
### TripleO
Check:-
* [Bug#1914585 Content provider jobs failed after failing to connect to mirrors, Failed to connect to mirror.regionone.limestone.opendev.org port 8080: No route to host Edit](https://bugs.launchpad.net/tripleo/+bug/1914585)
* Pinged on #opendev for someone from infra to look
* [Bug#1914600 "periodic-tripleo-ci-centos-8-scenario010-ovn-provider-standalone-master is failing on tempest test with Exception: Server 192.168.24.122 on port 60092/8085 did not begin passing traffic within the timeout period."](https://bugs.launchpad.net/tripleo/+bug/1914600)
* Failing tempest test was not running on last green run, this test was removed from skiplist recently
* https://opendev.org/openstack/openstack-tempest-skiplist/commit/ecb8c966af1fbc544fd69d3b7b185ea807713a91
* https://review.opendev.org/c/openstack/openstack-tempest-skiplist/+/774079 - we added the test back in skiplist
Promotions:-
* Master promotion line - Fs001/35 failed on tempest
* Cleared with testproject rerun https://review.rdoproject.org/r/#/c/28458/
~~~
periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001-master https://review.rdoproject.org/zuul/build/3903c34aa7cf425ba4b59153d6f5ba9c : SUCCESS in 3h 38m 04s
periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset035-master https://review.rdoproject.org/zuul/build/23bb024a00664f3cb50bfdd86f61af0f : SUCCESS in 3h 52m 02s
~~~
* Ussuri promotion line - periodic-tripleo-ci-centos-8-standalone-on-multinode-ipa-ussuri failed
* Rerunning with testproject - https://review.rdoproject.org/r/#/c/23626/
* Train promotion line - Some jobs were skipped due to node_failure on 1 job
* Rerunning with testproject - https://review.rdoproject.org/r/#/c/28537/
### OSP