Deployers Team Meeting
Triage Query:
https://github.com/pulp/pulp-oci-images/issues?q=is%3Aissue+is%3Aopen+label%3ATriage-Needed
https://github.com/pulp/pulp-operator/issues?q=is%3Aissue+is%3Aopen+label%3ATriage-Needed
Oct 22
- Mike to focus on documenting / polishing things over the next week or 2
- Showed off Konflux CI 1 2 3
Oct 1
- More CI improvements planned
Image Not Showing
Possible Reasons
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More →
Sep 24
- Working on more CI improvements
- Removing galaxy images
- Only ansible core uses it in their CI, they use a specific tag: 4.7.1
- Adding the UI to the Pulp (s6) images
- Discussed user operator PR to make some secret fields no longer immutable
Sep 17
Jul 30th
Jul 16th
Jul 9th
Jul 2nd
- FYI: Konflux CI opens up new possiblitiies
- Haven't looked into how much RAM, etc is available
- Demo on Konflux CI
Jun 25th
- On the services, will use virtualenv to avoid /usr/lib being modified.
- Might fix multiple python versions
June 18th
- Python versions configurable: Reached out to Colin Walters, agreed to talk to PyGObject devs
- pulp-oci-images links: Was advised by Pedro Possoa to use absolute links. Also can update relative links. I Cannot fix it with per-repo config.
June 11th
- See June 4th item about pulp-operator GA
- We agreed to release, but there's some final changes/checks 1st
- Python versions other than el9 defaul (3.9) not meeting pulp-rpm
- pulp-services will use 3.9
- Mike will reach out to Colin Walters (ostree) or PyGObject about this
- libmodulemd definitely not needed by services, will remove also in pulp-oci-images (pulp-rpm 3.19 stopped using it)
June 4th
- One potential Pulp user asked about pulp operator going GA in the comment section: https://www.youtube.com/watch?v=LpgcdBd9xZI
- We'll follow up with hyagi when he gets back
- docs fix needs review
- CI failing for plugins due to CentOS 8 Stream ("pulp_ci_centos" image) going EOL
- Submitted PR, and backports, to use CentOS Vault in the base image that it is based on
Apr 23
- No agenda items, just did triage
Apr 16
- No agenda items, just did triage
Apr 9
- No agenda items, just did triage
Apr 2
- Fix for configurable py version
- Need to create 3.49 branch
Mar 26
Mar 19
Mar 12
- CI Change / Python versions PR
- galaxy-operator now builds their own smaller images instead of using ours
- TODO: Gerrod to find out if we can drop building galaxy images (altogether or just branches)
Mar 5, 2024
- still need to write the blog post
- In the middle of reviewing the big CI change / Python versions PR
Feb 27, 2024
- still need to write the blog post
Feb 20, 2024
- Outline for blog post on EL9 upgrade
- Why we did it
- Newer python versions
- Other benefits?
- We're developing upstream - latest and greatest
- Python 3.8 EOL
- image naming
- image tags for python versions
- plugin-template updated
- remaining work:
- Fix NSS output issue that breaks oci_env (Daniel working on this FYI)
- EL8 images again
- Make multiple EL9 python versions available
- Old branches for pulpcore releases have not been created for the past ~20 releases
Feb 13, 2024
- S6 upgrade seems to have fixed the issue with the fix-perms feature.
- After fixing the postgres version detection properly, and some minor changes to the commits, we merged the EL9 upgrade!
- Remaining work
- EL8 builds also (Python 3.8)
- Multiple EL9 python versions
- Need to show off how I created the repos on dockerhub & quay
Feb 7, 2024
- Fixed 2 Ci issues
- Pulp images are now publishing
- Helping Daniel with EL9 upgrade and the logic for fixing the permissions on the mounted folder
- Upgraded S6 (worth seeing if it fixes it)
- Design for python versions vs os versions
- We previously agreed on tags
- We previously agreed to create a new image, pulp_ci_centos9
- Old el8 pulp_ci_centos image will be kept around for both the old image branches and the old plugin template
- Python 3.8 is only on el8
- Python 3.9 is on both el8 and el9
- Python 3.11 is only on el9 (el9 needed for other reasons too)
Jan 30, 2024
- pulp_installer bug reports and PRs
- Agreed: Close them
- Agreed: Archive the repository
- User is experiencing a problem with pulp-worker pods getting restarted when the DB connection drops
- The behavior is different from pulp-api pods
- Changing the behavior of pulp-worker would not fix the underlying problem that the DB is becoming unavailable
Jan 23, 2024
- I merged a feature request to pulp-oci-images
Jan 16, 2024
Jan 10, 2024
- User requests for docker-compose with https
- Point them to nginx container docs
- Docs issue with navigation missing for a few pages
Dec 19, 2023
- 1 PR to review:
- We will not need to take turns to release the operator anymore because now we have a script to make the process easier
Dec 12, 2023
- Multi-arch builds
- Seems to be reliable now!
- Mike will probably test on a Raspberry Pi 4 soon (forget if 2GB or 4GB)
- Upgrading image to EL9 / python 3.9
- We will check that no downstream products use python 3.8
Dec 5, 2023
- Status of multi-arch builds
- podman-compose up issue
- [dkliban] will try to reproduce again
- About the operator release cycle
Nov 28, 2023
- [mikedep333] 2 things to review for pulp-operator
- Status of multi-arch builds
- [decko] had to revert this last week
- mikedep333 to research multi-arch registries and review PR inbetween services work
Nov 14, 2023
- [ttereshc] ansible branch for pulp-operator can be removed
- a discourse post would be nice to have
- [hyagi] We can replace it with a tag
- [mikedep333] This will preserve git history
- [mikedep333] I recommend renaming the branch to "deprecated-ansible" or "ansible-deprecated"
- Agreed: Renamed branch or tag
- [hyagi] ansible team is planning to use the galaxy-ng and galaxy-ui images in galaxy-operator, so, if they manage to make it work with these images, we can stop maintaining galaxy-web and galaxy-minimal
- Agreed: Have discussions in #pulp-deployments matrix channel or in the #pulp slack channel
- [dkliban] services team is putting on hold the request for the "watch all namespaces" feature
- Looking into using pulp resources directly managed by clowder framework rather than clowder calling pulp-operator
- [Ina] This would preclude the Open Operations Initiative
- [dkliban] These services can be open, the example of console.fedoraproject.org (upstream for insights)
- [dkliban] Clowder is specific to Red Hat currently, but could be used by other orgs.
- [dkliban] As an example of stuff pulp-operator does do that we'd have to reimplement, is watching the config objects and then regenerating pulp settings (and applying them)
- [Ina] There are people who test pulp-operator. Are they doing things to supplement it like how RH uses clowder?
- [dkliban] services team is exploring switching the deployments on console.redhat.com to using Clowder https://docs.google.com/document/d/1I-ZI90_xEQpEl5GQdUt-Mcgs6PAwhjy88AHNHAQ_9T4/edit#heading=h.60zkfjl40xzp
- [mikedep333] Status of fixing the compose data loss bug (PR 468)
- probable issue with old podman & new podman-compose breaking the CI (the container is up)
- So I tried upgrading podman, and now it's trying and failing to install old pulp-file for some weird pip dependency reason
October 31, 2023
- Status of nested containers
- Fixed an issue whereby the nested container needed 64K + 1 UIDs from the outer container (because UID 0 is already mapped in Podman but not Docker)
- Hopefully the CI passes, we won't need to run "podman system migrate" in the inner container, and I can move onto the pulp-container PR.
- weird CI breakage for pulp-ansible / pulpcore version
- pulp-operator dual Pulp deployment check
- Update on testing 2 instances of pulp-operator with kind (K8s on container instead of VM)
- we are maxing the limits of GHA CI
- We got the rudimentary check done
October 25, 2023
- Status of nested containers
- Explored alternatives thoroughly, but approach is require outer containers to support UIDs up to 75535 (65535 + 10000)
- This avoids need for migration of existing containers
- This avoids security issues and potential compatibility of supporting UIDs inside the nested container of upto 65534.
- OpenShift is compatible for now because they do not run containers in user namespaces and therefore have 4G UIDs available
- [tanya/hyagi] need clarification on impact on community because of "[HMS] Singleton operator - can watch all the namespaces"
- what is the use case
- at some point we moved from cluster-scoped to namespace-scoped operator, any concern in watching all namespaces now?
- will we provide one option or all the combinaitons possible?
- [hyagi] CVE for operator framework
- [hyagi] Issue #1121
- Does it affect pulp?
- Should we warn users about "galaxy-operator" migration?
- What is the correct approach to fix the error?
- [hyagi] In a galaxy thread on slack the following question arose:
- "Isn't it required to stop or scale down services before running migrations to prevent locks or conflicts?"*
October 17, 2023
October 10, 2023
- Resuming work on nested latest podman (for pulp_container CI)
- 2 issues fixed, at least 1 remaining
- Going to finally merge the data loss bug
- 1st need to rebase
- was delayed due to CI being red
- pulp-operator testing
- Modify an existing test to deploy 2 instances of the operator side-by-side, for image-builder + content-sources
- We are considering making our operator global, watching all namespaces
- (Another project) image is registered as a catalog source, as a bundle image. So they push these in CI, and their cluster-wide OLM operator knows to deploy a new version.
- Releasing to the catalog from the main branch?
October 3, 2023
September 12, 2023
Sept 5, 2023
- backup of /var/lib/pulp/tmp content
- can we ignore this folder during backup? does it have anything that is important?
Aug 29, 2023
- pulp_container CI is failing with latest pulp-ci-centos image
- There was a regression in the capabilities of newuidmap / newgidmap - easy to fix
- I noticed that we never set VOLUMES for /var/lib/containers & /var/lib/pulp/.local/share/containers
- Ask the pulp_container team about this.
- Lots of other prescribed changes too that we never did
- Agreed: Do not focus on all the other prescribed changes
- Agreed: File an issue
Aug 22, 2023
- Compose data loss bug
- Fixing the migration of config files from multi-process container to compose
- Helping jhutar with performance testing cluster
- pulpcore-selinux 2.0.0 released for katello / satellite
Aug 15, 2023
- oci_env issues that jtanner had
s6-rc: fatal: unable to take locks: Resource busy
- Hard to research online
- [mikedep333] Will look into if I run int it myself.
the next issue is that because of the chained restarts and config updates, we don't have the social auth postgres tables ... so something needs to run migrations again
Aug 8, 2023
- Idea (not proposal) for pulpcon: demo HA pulp in k8s
- Services team will be doing HA soon (part of production plan)
- Let's do a joint presentation: mikedep333 & hyagi
-
- [mikedep333] Please re-explain the pulp-operator deployment modified function check
- Any changes with decko on leave?
- dkliban and mikedep333 to maintain pulp-oci-images
- [mikedep333] [services] pulp-smash config issue
Aug 1, 2023
- Enabling multiple pulp operators to run in a namespace
- Needed because content-sources and image-builder will each have a pulp operator running in a single namespace
- Except for a few singletons, almost all k8s objects will have names like "content-sources-pulp-api-abcdef" and "image-builder-pulp-api-abcdef"
Jul 25, 2023
- mikedep333 to continue following up with user support requests / bug reports for pulp_installer
Jul 19, 2023
- CI dashboard will be helpful
- for hosted-content: CI smoke tests with CJI IQE (requires writing plugin) or a custom bash test script that calls pytest?
- Agreed: Write custom bash test script for now
Jul 12, 2023
- We fixed pulp-oci-images CI last week
- Did the release for fixed images for prior pulp versions
- oci_env CI has the Ubuntu podman-compose fix now (from pulp-oci-images)
Jun 27, 2023
Jun 20, 2023
- Haven't done the quick fix / release for AAP-11911 yet
- Hosted Content: Deploying to staging is making slow but steady progress
- Re-review please: https://github.com/pulp/pulp-oci-images/pull/508
- We need to publish a pulp-web image to be used on consoledot.
- Agreed: Create another dockerfile in pulp-clowder-deployments repo. Build in same script.
Jun 13, 2023
- Was asked to fix AAP-11911
- Hosted Content: Deploying to staging is making steady progress
- 3.26 fixed release:
- Not released yet, we just merged the PR to update the list of branches to build
- Manually triggered a 3.26 CI run (pulp-oci-images CI) to build and push the 3.26 image
Jun 6, 2023
- Hosted services: Mike working on deployment method
- Plan to merge PR to pulp-clowder-deployments, and submit PR to app-interface today
- Did CI fixes so that 3.25 image is built and pushed
May 30, 2023
- Hosted services: Mike working on deployment method
- Working on the dummy script, trying things out
- A couple of open PRs / issues
- Mike providing reviews, answering questions
- Mike helped Decko with UBI image limitations
- Some packages exist in repos like "RHEL base" but not "UBI base"
- Adding CentOS Stream Repos for now
- If there's breakage, we can try the rockylinux / almalinux repos. Not using them now because their 8.9 release will lag behind RHEL 8.9 release by a few weeks.
May 23, 2023
- AAP-11911 is actually on hold (bmbouter relayed this on 5/17)
- Work with the consulting team (Matt Dorn)?
- No issues have come up
- I think we wanted their review on this: https://github.com/pulp/pulp-operator/pull/898
- Agreed: This is such a big review, we will focus as a team (with or without their help) on reviewing it when hyagi gets back, and establishes who needs this, and when.
May 17, 2023
Prior meetings: