Ruck Rover - 2022-06-10 - 2022-06-16

tags: ruck_rover
Previous RR notes: https://hackmd.io/HMjD8MouQXmGWMJP5o76JQ

Cockpit

Downstream cockpit

Handover Notes

Active Bugs

Ceph

NotCeph

CIXes to close or revisit

Jobs to actively monitor

  • kvm internal, fs035, fs01, fs039

Promotion Status

  • Master :- 15 june
  • wallaby c9:- 13 june
  • wallaby c8:- 13 june
  • train c8:- 13 june

Downstream

  • RHOS-17 RHEL-9 - 13th
  • RHOS-17 RHEL-8 - 15th
  • RHOS-16 RHEL-8 - 14th

Active Bugs

  • Check handover bugs list

16 June

component pipelines

Ignore everything below this line

15 June

Active Bugs

cs9 wallaby tempest component

component pipelines

CIX cards we can clear today

14 June

components pipeline

13 June

Promotions

  • wallaby c9 promoted
  • wallaby c8 should promote with current testproject run
  • rhos-17 on rhel 9 should promote with current testproject run
  • working on promoting wallaby c9 components (promoted so far: security, validation, baremetal, network) tempest and tripleo are in rerun

patches to clear fs02//fs01 / image build

Tempest patches

CS9 Master:

Active Bugs

CS9 Wallaby / Master

CS9 Master

CS9 Wallaby

CS8 Wallaby

CS8 Train

RHEL 9 RHOS17

RHEL 8 RHOS17

Other

13 June

CS9 Master:

Fri, Jun 10

Promotions

CS9 Wallaby / Master

CS8 Wallaby

RHEL-8 RHOS-16.2

Thu Jun 09

Promotions

CS9 Wallaby / Master

13 June

https://bugs.launchpad.net/tripleo/+bug/1978298

  • reckick the job

Transient Bugs

fs039 - series of unknown issues, difficult to triage 1

They all seem to be unrelated, but they're causing havoc to the line.

"Can't run container"

2 2022-06-09 22:58:35 | 2022-06-09 22:58:35.006643 | | WARNING | ERROR: Can't run container nova_api_ensure_default_cells [] 2022-06-09 22:58:35 | 2022-06-09 22:58:35.010171 | | WARNING | ERROR: Can't run container placement_api_db_sync []

Internal Server Error Keystone

3 2022-06-10 15:08:13 | 2022-06-10 15:08:13.341227 | fa163e2d-7640-c8bc-5072-00000000a20c | TIMING | tripleo_keystone_resources : Create identity service | undercloud | 0:29:54.730355 | 1.67s 2022-06-10 15:08:13 | 2022-06-10 15:08:13.352842 | fa163e2d-7640-c8bc-5072-00000000a20d | TASK | Create identity public endpoint 2022-06-10 15:08:17 | An exception occurred during task execution. To see the full traceback, use -vvv. The error was: keystoneauth1.exceptions.http.InternalServerError: Internal Server Error (HTTP 500)

4 [Fri Jun 10 15:08:17.218323 2022] [wsgi:error] [pid 17:tid 38] [remote 172.17.0.184:33542] mod_wsgi (pid=17): Exception occurred processing WSGI script '/var/www/cgi-bin/keystone/keystone'.

SSH Permission denied

5 2022-06-10 18:05:50 | 2022-06-10 18:05:50.412897 | fa163e10-0939-a651-5d12-000000001759 | FATAL | Run tripleo_os_net_config_module with network_config | overcloud-controller-2 | error={"msg": "Data could not be sent to remote host "192.168.24.30". Make sure this host can be reached over ssh: Warning: Permanently added '192.168.24.30' (ED25519) to the list of known hosts.\r\nheat-admin@192.168.24.30: Permission denied (publickey,gssapi-keyex,gssapi-with-mic,keyboard-interactive).\r\n"}

Cannot download ansible-macros

6 2022-06-09 15:03:03 | Error: Error downloading packages: 2022-06-09 15:03:03 | ansible-macros-2021.1.2-2.el9s.noarch: Cannot download, all mirrors were already tried without success

Failed to download packages: mod_lua-2.4.51-8

7 2022-06-06 22:01:03.474740 | fa163ec0-bb0e-7746-1f99-000000000cb2 | FATAL | ensure apache is installed | undercloud | error={"changed": false, "msg": "Failed to download packages: mod_lua-2.4.51-8.el9.x86_64: Cannot download, all mirrors were already tried without success", "results": []}

overcloud-2 didn't start

8 One of 3 overcloud nodes didn't start.

Select a repo