ruck_rover
Important links for ruck rover's ruck/rover links to help Ruck Rover - Unified Sprint #36 (NovDec 2020 timeframe) Dates: Nov 11 - Jan 11
Tripleo CI team ruck|rover: marios, rlandy, wes OSP CI team ruck|rover:
Previous notes: https://hackmd.io/a_PhAlijTr-KTWHzGuf3qQ
RUCK / ROVER BUGS: https://docs.google.com/spreadsheets/d/1M1U-ekjEsec-bRjRq7q5rzjbJWKE2uT-ESX4SkeC0Uc/edit#gid=0
Please try this out instead of cutting pasting bugs. Give feedback to wes please
http://dashboard-ci.tripleo.org/d/HkOLImOMk/upstream-and-rdo-promotions?orgId=1
add dates in decending order so the latest date is at the top. Break out TripleO and OSP sections.
Tempest volume tests filures
Ara installation is failing
Destination directory /home/zuul/workspace/logs does not exists
Mirros are down
Tempest test [Victoria] FloatingIpTestCasesAdmin.test_two_vms_fips
Python deps error in content provider
FS037-updates stein job fails consistenly. https://bugs.launchpad.net/tripleo/+bug/1910399
Fix : https://review.opendev.org/c/openstack/tripleo-ci/+/769536
test project passed - https://review.rdoproject.org/r/#/c/31424/
https://bugs.launchpad.net/tripleo/+bug/1909658
periodic-tripleo-ci-centos-8-singlenode-featureset050-upgrades-ussuri is failing during undercloud install with message - error checking if pulling from registry is blocked: unable to parse the registries configuration (/etc/containers/registries.conf): mixing sysregistry v1/v2 is not supported
Fixed https://bugs.launchpad.net/tripleo/+bug/1909671
Centos 7 based jobs are failing on TASK [Enable container-tools and disable rhel-modules] with error: /bin/sh: line 1: dnf: command not found
Component pipeline https://bugs.launchpad.net/tripleo/+bug/1909574 "periodic-tripleo-ci-centos-8-scenario010-kvm-standalone-octavia-master/victoria/ussuri failing on tempest tests with SSHTimeout issue with error message "socket.timeout: timed out""
train promotion blocker
Fixed https://bugs.launchpad.net/tripleo/+bug/1909582 "Introspection failing in train branch with message: mistral.exceptions.InvalidActionException: Failed to find action [action_name=baremetal_introspection.get_status]\n', 'introspection_attempt': 0}" [Critical,Triaged] [High,Triaged]
master:
victoria:
Gate:
master:
https://bugs.launchpad.net/tripleo/+bug/1909105 ( ERROR openstack Stderr: 'level=debug msg="Pull Policy for pull [PullIfNewer]"\nerror building at STEP "RUN ln -s /usr/share/openstack-tripleo-common/healthcheck/swift-account failing on build-containers-ubi-8-push master )
https://bugs.launchpad.net/tripleo/+bug/1908976 ( ovsdbapp.backend.ovs_idl.idlutils.RowNotFound: Cannot find Logical_Router with name=neutron-<uuid> is failing on featureset001 and featureset035 master)
victoria:
train:
master:
https://bugs.launchpad.net/tripleo/+bug/1908976 ( ovsdbapp.backend.ovs_idl.idlutils.RowNotFound: Cannot find Logical_Router with name=neutron-<uuid> is failing on featureset001 and featureset035 master)
https://bugs.launchpad.net/tripleo/+bug/1907193 ( package podman-1.6.4-23.module_el8.3.0+566+4759265c.x86_64 is filtered out by modular filtering) - failing featureset050-upgrades-master, multinode-oooq-container-updates-master and featureset037-updates-master
https://bugs.launchpad.net/tripleo/+bug/1909008 ( tempest.lib.exceptions.PreconditionFailed: Precondition Failed on standalone-full-tempest-api-master)
victoria:
master:
periodic-tripleo-ci-centos-8-multinode-1ctlr-featureset037-updates-master and scenario000-multinode-oooq-container-updates-master is failing = https://bugs.launchpad.net/tripleo/+bug/1907193
Depsolve Error occured: \n Problem: cannot install both podman-1.6.4-23.module_el8.3.0+566+4759265c.x86_64 and podman-2.0.5-5.module_el8.3.0+512+b3b58dca.x86_64\n - package podman-catatonit-2.0.5-5.module_el8.3.0+512+b3b58dca.x86_64 requires podman = 2.0.5-5.module_el8.3.0+512+b3b58dca, but none of the providers can be installed
ovb-1ctlr_1comp-featureset002-master is failing
featureset035-master is failing because of some tempest test failure
master tempest failures .. start debugging from https://logserver.rdoproject.org/openstack-periodic-integration-main/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001-master/43f38e4/logs/overcloud-controller-0/var/log/extra/errors.txt.txt.gz
Victoria:
pypy depsolv, ubi-container in check, centos 8.2 - 8.3, content-provider patches all contributing to failures upstream atm. We're through most issues atm. OVB in check is full red, but as I said looks to be coming back to green via periodic promotions.
* pypy - lower-constraints https://review.opendev.org/q/topic:%2522tripleo-lower%2522+(status:open+OR+status:merged)
* ubi-8 - container-builds e.g. https://review.opendev.org/c/openstack/tripleo-ci/+/766839
* c8-train -> ussuri upgrade jobs failing on pacemaker mismatch - requires promotion to fix or method to inject change to content-providers.
* content-provider ip failures https://bugs.launchpad.net/tripleo/+bug/1907657
* overcloud-full-hardened moved to nv until we promote all c8 branches https://bugs.launchpad.net/tripleo/+bug/1907457
master appears to be back and healthy enough for a promotion. fs01/035 starting to pass again. wes to make sure master is promoted soon.
c8-train is the most import and a problem. container-builds are failing and tripleo-common needs patches to help us diagnose. See https://review.rdoproject.org/r/#/c/31336/
need to land to fix ovb 3rd party https://review.opendev.org/c/openstack/paunch/+/766779 https://bugs.launchpad.net/tripleo/+bug/1907833
Need help landing https://review.opendev.org/c/openstack/tripleo-common/+/766516
Please review: https://review.opendev.org/c/openstack/tripleo-ci/+/766621
Please focus on unblocking: https://review.opendev.org/#/q/owner:"amolkahat+%253Camolkahat%2540gmail.com%253E"+branch:stable/ussuri
gate - release issue https://bugs.launchpad.net/tripleo/+bug/1907122 https://review.opendev.org/q/topic:"release-aware-gate"+(status:open OR status:merged)
Not sure if https://bugs.launchpad.net/tripleo/+bug/1907006 is related to the above issue or not
new bugs in https://docs.google.com/spreadsheets/d/1M1U-ekjEsec-bRjRq7q5rzjbJWKE2uT-ESX4SkeC0Uc/edit#gid=0
https://review.opendev.org/c/openstack/openstack-ansible-os_tempest/+/764055 Fix stackviz for failed tempest runs [WIP] [NEW]
periodic-tripleo-ci-centos-8-standalone-full-tempest-scenario-compute-master
https://bugs.launchpad.net/tripleo/+bug/1905348 has a fix https://review.opendev.org/c/openstack/python-tripleoclient/+/763919
https://bugs.launchpad.net/tripleo/+bug/1905421 - Tempest error
https://bugs.launchpad.net/tripleo/+bug/1905418 - Tempest error
Error in train jobs due python 2.7 and pymod2pkg
introspection bug 1904936 fix
(details below…)
https://bugs.launchpad.net/tripleo/+bug/1905036 Scenario10 octavia jobs failing deploy stack creation with "Error Referenced Attribute (OctaviaBase update_tasks) is incorrect 15:40 < beagles> rlandy|rover: marios|ruck https://review.opendev.org/#/c/763561/
https://bugs.launchpad.net/tripleo/+bug/1903581 tripleo-ci-centos-8-scenario000-multinode-oooq-container-updates is pulling containers from docker.io
https://bugs.launchpad.net/tripleo/+bug/1905034 repo-setup fail: undercloud upgrade for ussuri fails on package dep python3-openstackclient >= 5.2.0 waiting to merge: https://review.opendev.org/c/openstack/tripleo-quickstart/+/763633
15:40 < beagles> rlandy|rover: marios|ruck https://review.opendev.org/#/c/763561/
09:32 < chandankumar> marios: sshnaidm I am tracking rdocloud nodeset removal here https://etherpad.opendev.org/p/remove-rdocloud-jobs
12:07 < ykarel> marios|ruck, fyi ^ in case u see something wron gin ussuri jobs, let us know 12:07 < ykarel> we tested multiple jobs and it went good, but just in case if there is some issue
Should have been fixed by: https://review.opendev.org/#/c/762871/
RUCK / ROVER BUGS: https://docs.google.com/spreadsheets/d/1M1U-ekjEsec-bRjRq7q5rzjbJWKE2uT-ESX4SkeC0Uc/edit#gid=0
Please try this out instead of cutting pasting bugs. Give feedback to wes please
blocks victoria gate fix at https://review.opendev.org/#/c/763005/ testing with https://review.rdoproject.org/r/31112
blocks victoria gate fix at https://review.opendev.org/#/c/763005/ testing with https://review.rdoproject.org/r/31112
reopened bug
fix at https://review.rdoproject.org/r/#/c/31114/
tested at https://review.rdoproject.org/r/#/c/25325/
@marios pls confirm that the upgrades/updates jobs are ok with the one playbook for multinode.
https://7085ed736f9157d6d701-223d8b88d73ea59070ac36b627fdc3bc.ssl.cf1.rackcdn.com/762658/1/check/tripleo-ci-centos-8-undercloud-upgrade-victoria/ad84735/logs/undercloud/home/zuul/undercloud_upgrade.log tripleo_common.image.exception.ImageNotFoundException: Not found image: http://23.253.57.32:5001/v2/tripleovictoria/openstack-keystone/manifests/7f974e10d7184d5fc45445a3073333bd"], "stdout": "", "stdout_lines": []}
https://logserver.rdoproject.org/openstack-periodic-integration-main/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-ovb-1ctlr_2comp-featureset020-master/835c3e6/logs/undercloud/var/log/extra/baremetal_list.txt.gz https://logserver.rdoproject.org/openstack-periodic-integration-main/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001-master/b99e7f1/job-output.txt https://logserver.rdoproject.org/openstack-periodic-integration-main/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001-master/2726291/job-output.txt https://logserver.rdoproject.org/openstack-periodic-integration-main/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-ovb-1ctlr_2comp-featureset020-master/970a3ea/job-output.txt https://logserver.rdoproject.org/openstack-periodic-integration-main/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-ovb-1ctlr_1comp-featureset002-master/72f523b/job-output.txt
Related fixes:
https://review.opendev.org/#/c/763097/ Make victoria upgrades job non-voting until fixed
~~https://review.rdoproject.org/r/31120 Add undercloud fs050 upgrades to master/v/u periodic ~~
@marios : your thoughts here?
where though? are you referring to upstream undercloud upgrade job fixed by https://review.opendev.org/#/c/763005/ ? or periodic job? can't be periodic since you're adding it with https://review.rdoproject.org/r/31120
shoudl be fixed by https://review.opendev.org/#/c/762935 (will need cherry-pick to train, stein, rocky) Related fix on removing rt-repo: https://review.opendev.org/#/c/762926/
https://review.opendev.org/#/c/761892/ (it is the fix for another bug https://bugs.launchpad.net/tripleo/+bug/1903498)
https://review.opendev.org/#/c/761892/ (it is the fix for another bug https://bugs.launchpad.net/tripleo/+bug/1903498)
Additional fix: https://review.opendev.org/#/c/762871/
.
Last promotion 10th Nov Last buildset - https://review.rdoproject.org/zuul/buildset/f8e8812c49c14d2c8915af307d85731c
victoria promotions blocked on https://bugs.launchpad.net/tripleo/+bug/1903508/comments/6
periodic-tripleo-ci-centos-8-standalone-on-multinode-ipa-victoria(In criteria) & periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp_1supp-featureset039-victoria, Bug: https://bugs.launchpad.net/tripleo/+bug/1903508 Cixed to security team, Provided reproducer to martin today, Patch is up - https://review.opendev.org/#/c/762497
Master/Ussuri/Victoria promotion blocker
https://bugs.launchpad.net/tripleo/+bug/1900949
Ovb jobs are failing on master branch, imported nodes are not transitioning to manageable state with ‘Error: Unable to establish IPMI v2 / RMCP+ session\n’
Last promotion 11/13
master scenario2 standalone failures https://zuul.openstack.org/builds?job_name=tripleo-ci-centos-8-scenario002-standalone&branch=master * blocks https://review.opendev.org/#/c/762497/4
c8 should promote again today https://review.rdoproject.org/zuul/buildset/1f410357edb34516ab3ce4cadffa491f
c7 is in trouble: (rlandy investigating) https://bugs.launchpad.net/tripleo/+bug/1904214 - centos-7 train jobs are failing on dependency resolution: glibc-common = 2.17-307.el7.1 and python3-libs(x86-64) = 3.6.8-13.el7
<amoralej> yes, 7.9.20009 was pushed yesterday to mirrors
<rlandy|rover> that's probably it
<amoralej> let me dig into it
<jpena> amoralej: that server mounts opendev's AFS, just like the rest of the mirrors we have in opendev <jpena> looking at https://grafana.opendev.org/d/HxrNXt2Gk/afs?orgId=1, it looks like the mirror was synced 1 hour ago <jpena> we might check where does it sync from, maybe that one seed does not have 7.9 yet <jpena> so, what I see in https://opendev.org/opendev/system-config/src/branch/master/playbooks/roles/mirror-update/files/centos-mirror-update#L51 is that the opendev AFS mirrors sync from mirror.lstn.net/centos <jpena> looking at http://mirror.lstn.net/centos/7/os/x86_64/Packages/?C=M;O=D, it looks like they don't have the 7.9 content yet
Blocking Sagi's patch will solve the issue https://review.opendev.org/#/c/761892/ for https://bugs.launchpad.net/tripleo/+bug/1903498
2020-11-13 12:22:22.839674 | primary | Error: Package: glibc-2.17-307.el7.1.i686 (base)
2020-11-13 12:22:22.839814 | primary | Requires: glibc-common = 2.17-307.el7.1
2020-11-13 12:22:22.839848 | primary | Installed: glibc-common-2.17-317.el7.x86_64 (@base)
2020-11-13 12:22:22.839870 | primary | glibc-common = 2.17-317.el7
2020-11-13 12:22:22.839898 | primary | Available: glibc-common-2.17-307.el7.1.x86_64 (base)
2020-11-13 12:22:22.839920 | primary | glibc-common = 2.17-307.el7.1
2020-11-13 12:22:22.839943 | primary | Error: Package: python3-devel-3.6.8-13.el7.x86_64 (base)
2020-11-13 12:22:22.839989 | primary | Requires: python3-libs(x86-64) = 3.6.8-13.el7
2020-11-13 12:22:22.840015 | primary | Installed: python3-libs-3.6.8-17.el7.x86_64 (@base)
2020-11-13 12:22:22.840037 | primary | python3-libs(x86-64) = 3.6.8-17.el7
2020-11-13 12:22:22.840062 | primary | Available: python3-libs-3.6.8-13.el7.x86_64 (base)
2020-11-13 12:22:22.840083 | primary | python3-libs(x86-64) = 3.6.8-13.el7
2020-11-13 12:22:22.840125 | primary | You could try using --skip-broken to work around the problem
2020-11-13 12:22:23.195322 | primary | You could try running: rpm -Va --nofiles --nodigest
* rekicked on vexx https://review.rdoproject.org/r/#/c/29969/16/.zuul.yaml
* blocked on https://bugs.launchpad.net/tripleo/+bug/1904214
* promoted yesterday - leaving for weekend run
* testproject there (rdo) https://review.rdoproject.org/r/30906 per https://bugs.launchpad.net/tripleo/+bug/1903996/comments/1 but failing so try vexx
* testproject there (vexx) https://review.rdoproject.org/r/#/c/25325/
* blocked on https://bugs.launchpad.net/tripleo/+bug/1904214
<amoralej> we are updating ovs/ovn 2.13 for master and victoria today. We've gated it with different jobs so i expect to go smooth but let us know if you find any issue
[Blocking check jobs]tripleo-ci-centos-8-scenario000-multinode-oooq-container-updates-train job is failing on TASK [undercloud-deploy : Write containers-prepare-parameter.yaml] with error 'dict object' has no attribute 'registry_ip_address_branch' https://bugs.launchpad.net/tripleo/+bug/1903980
[Promotion blocker for c7 train/stein/rocky]Intermittently some jobs are timing out while gathering facts on different tasks : [tripleo-inventory : Ensure gather_facts has been run against localhost] or [validate-undercloud : gather facts used by role] https://bugs.launchpad.net/tripleo/+bug/1903961
"Intemittenly tripleo-ci-centos-8-standalone-upgrade-ussuri timeouts while running tempest tests." https://bugs.launchpad.net/tripleo/+bug/1903993
Queens periodic jobs are failing on tempest test: tempest.scenario.test_network_basic_ops.TestNetworkBasicOps with testtools.matchers._impl.MismatchError: 'ACTIVE' != u'DOWN': - FloatingIP: is at status: DOWN. failed to reach status: ACTIVE https://bugs.launchpad.net/tripleo/+bug/1903996
Ran rocky /stein promotion … https://review.rdoproject.org/zuul/builds?pipeline=openstack-periodic-integration-stable4-5
stein should promote. rocky failed container builds again. need to look at increasing the timeout there