changed 4 years ago
Linked with GitHub

Velero Community Meeting and Open Discussion Notes - Archive June - December 2020

This doc is meant to serve as the "one doc to rule them all" for Velero community meetings and open discussions.

Join our Google Group to get updates on the project and invites to community meetings.
Community meetings are held every Tuesday at 12PM ET / 9AM PT (Convert to your time zone) - Zoom link

You can find recordings of these meeting on YouTube.

Archive of older meeting notes
_
Add your request, question or suggestion to our issue list

If you have anything you'd like to put on the agenda, please do so below for the next meeting:

Jan 19, 2021

Status Updates

Discussion Topics

Community Shoutouts

Jan 12, 2021

Status Updates

  • dave
    - e2e tests running in Azure
    - Working on e2e tests running against all clouds as install target
  • nrb
  • bridget
    • Community support last week
    • Working on multiple credentials support
      • Need to make a decision on approach to take
    • Need to follow up on some PRs that have been updated
  • carlisia
    - This week will be PR reviews and addressing code reviews on my PRs
    - Sharing community support with Nolan
    - PTO this Thu and Fri

Discussion Topics

  • dave
    • Azure issues with memory limit and plug-in failure. Hard to debug some plug-in failures.
      • nrb - There is an issue w/ Azure plugin where object storage chunk uploading uses the maximum memory of the Velero pod.
      • nrb - Adding ~10 log messages into the Backup CR seems useful, especially when it can't upload to object store
  • Dylan
    • Ability to selectively patch resources on per restore basis.
      • Use case is Secrets & CRs
        • User wants to continually patch Secrets vs replace it
        • Operators watching CRs, they wanted to make sure the operators were going to keep working
      • Current design of plugins is very much all or nothing.
        • Plugins should be able to allow mutations or patching
      • Give post hooks the backup manifest to help
      • When patching, this is a merge operation. That may result in a merged, incorrect object at the end
      • Can we add a way to download individual JSON from a backup to then re-submit with Kubectl?
        • This exists with the velero backup download command, but you need to take it out of a tarball
      • Can we modify the restoreitemaction plugin API to return an action?
        • The plugin could identify the relevant fields and return a patch object & the patch action
      • Action - Alay and Dylan to work on design doc
  • Phuong: (phuong.n.hoang@dell.com) invite to design meeting for Velero plugin timeout 9am PST Friday Jan 15th. This is the Zoom meeting: https://Dell.zoom.us/j/91078486282?pwd=Yk9sU1UwT0NScXE2Q2oyRGV3K002QT09
  • carlisia
  • nrb
  • bridget
    • Can we do a v1.5.3 release in the meantime before 1.6?
      • nrb:
        Image Not Showing Possible Reasons
        • The image file may be corrupted
        • The server hosting the image is unavailable
        • The image path is incorrect
        • The image format is not supported
        Learn More →
        • We need to include some patches like properly deleting when DeleteItemActions are missing, and fixing include/exclude plugin logic (I don't have a full list)

Contributor Shoutouts

Jan 5, 2021

Status Updates

  • carlisia
    - Back from 2 weeks PTO
    - This week will be PR reviews and addressing code reviews on my PRs
  • bridget
    • Catching up after PTO
    • On community support this week
    • Will pick up work on multiple credentials support again
  • nrb
    • Also catching up after PTO - need to catch up on opened issues and outstanding PRs

Discussion Topics

Contributor Shoutouts

Dec 15, 2020

The Velero maintainer team at VMware will be on holiday between December 19, 2020 to January 4, 2021.

Status Updates

  • nrb
  • carlisia
  • bridget
    • Community support first half of this week
    • Starting vacation Wed/Thurs
    • Continuing to work on multiple credentials proposal
    • PR reviews
    • About to submit a PR for delve debugging in Tilt
  • dsmithuchida
    • Submitted talk proposal for Kubecon EU on Backup/restore of operator driven apps
    • Started on design for upload progress and data movement tracking

Discussion Topics

  • carlisia
  • nrb
    • New GitHub Processes
    • Using Ginkgo
      • Frankie:
        • Uses BDD descrptions, which isn't necessarily used
        • Doesn't feel "Go-native"
      • Carlisia
        • controller-runtime uses Ginkgo's BeforeTest entrypoints to do setup.
      • Dave
        • Used e2e tests cause we were using it elsewhere
        • Thought there were benefits to it that haven't manifested

Contributor Shoutouts

Dec 8, 2020

Status Updates

  • carlisia
  • bridget
    • Spent quite a bit of time on PR reviews
    • Experimenting with Delve/Tilt debugging
    • Working on initial steps for supporting multiple credentials
      • Mostly experimental work based on design doc
  • dsmithuchida
    • Working on Distributed Data Generator on Azure
  • nrb
    • PR reviews
    • Working on board processes

Discussion Topics

Community Shoutouts

  • Swanand Shende for helping folks out in our upstream Slack

Dec 1, 2020

Status Updates

  • carlisia

  • bridget

    • Updated design for multiple credentials support
      • Needs further thought on the impact on upgrades and Helm
    • Updated the docker actions used in CI
    • Spent some time on community support last week over Thanksgiving
  • nrb

    • Bereavement & holiday last week
    • PR reviews yesterday

Discussion Topics

  • carlisia: Tilt setup demo

Community Shoutouts

Nov 17, 2020

Status Updates

  • carlisia
    • Was on community support last week
    • Had a day-long hiring event (for VMW new grads)
    • Looking around on how to get a GH Projects automation of issues working for multiple repos. Testing this one: https://github.com/philschatz/project-bot
    • PR reviews pending
  • bridget
  • ashish
    • Community support this week
    • Kubecon NA 2020.
    • Made progress on E2E tests.
      • Using ginkgo as the testing framework
      • Setting up a testing scaffolding. PR 3060 with Dave
    • Bug fix: BSLs with validation disabled should be validated at least once
      • What?
        • IsReadyToValidate is expected to return true for all BSLs for the first validation, even if repeated validation is disabled by setting the location.Spec.ValidationFrequency to 0
        • Unit test case to demonstrate the failure
          ​​​​​​​​​​​​​​​​{
          ​​​​​​​​​​​​​​​​    name: "should return true when validation frequency is zero and lastValidationTime is nil",
          ​​​​​​​​​​​​​​​​    bslValidationFrequency: &metav1.Duration{Duration: 0},
          ​​​​​​​​​​​​​​​​    defaultLocationInfo: DefaultBackupLocationInfo{
          ​​​​​​​​​​​​​​​​        StoreValidationFrequency: 0,
          ​​​​​​​​​​​​​​​​    },
          ​​​​​​​​​​​​​​​​    ready: true,
          ​​​​​​​​​​​​​​​​},
          
        • Fix for this is available in PR #3084
    • CI jobs validating that Velero CRDs can be applied in different versions of kubernetes cluster started failing and engineerd/setup-kind had to be updated from v0.4.0 to v0.5.0 as a fix.
    • In 1.6 we want to migrate a subset of controllers to kubebuilder. I had started work on migrating the BackupController at the end of the v1.5 release. OK to prioritize it for v1.6?
  • nolan
    • Working on process flow; will catch up on PR backlog this week along with KubeCon
    • Blocker: getting design PRs more attention
  • david
    • Conflict

Community Shoutouts

Nov 10, 2020

Status Updates

Discussion Topics

Community Shoutouts

Oct 27, 2020

Status Updates

  • carlisia
  • ashish
    • Support-a-thon last week
    • Community support this week
    • Have you heard about our Velero Office hours?
      this opens another hackmd page
  • dsmithuchida
    • Support-a-thon last week
    • Working on basic e2e test, mostly working
  • nrb
    • Support-a-thon last week
    • Revisting our bug triage process
    • Released v1.4.3 and v1.5.2 last week with a security fix - please upgrade when you can.
    • Blocker - several design PRs are waiting on other maintainers to review

Discussion Topics

  • Rafael @brito-rafa , Frankie @codegold79 and Jenting @jenting - Migration doc: https://github.com/vmware-tanzu/velero/issues/2551 . About K8s API Groups: http://www.rafaelbrito.com/2020/04/kubernetes-api-groups.html
  • Alay: Provide more visibility on reasons for PartialFailure by reporting the errors on velero CR
    • The CRs don't provide errors directly, just he number
    • CRs are limited to 1.5mb usually due to etcd
    • There's a mismatch between restore & backup log formats, making just parsing difficult
    • This could be a subresource backed by a web hook, not a CRD

Community Shoutouts

Oct 20, 2020

Status Updates

  • carlisia
    • Community support status:
      • done: triaged all recent (couple weeks) issues
      • wip: triaging older issues and starting on the other repos
    • PR reviews
    • tackle my broken test for https://github.com/vmware-tanzu/velero/pull/3004 (DownloadRequest migration to kubebuilder)
  • nrb
  • bridget
    • Made some changes to #2993 based on comments
      • Discovered a problem with approach in newest changes so need to revisit that approach and modify tests
    • Helped Ashish with internal release processes last week
    • Reviewing design docs
  • dsmithuchida
    • PTO last week
    • Working on e2e tests

Discussion Topics

Community Shoutouts

Oct 13, 2020

Status Updates

Discussion Topics

  • Phuong
  • sseago
  • dymurray
    • Supporting restore progress reporting https://github.com/vmware-tanzu/velero/issues/21
    • Tackle restores similar to the backups right now as a first phase
    • Current issue is on restore we don't have a fully accurate number of items. For example, when using --include-namespaces, we don't know the number of to-be-restored resources ahead of time.
    • Getting a 2 pass restore system would help immensely here, since we would pre-process and match resources before sending them to the API server. This would give us an overall count to report for progress.
  • Antony
    • https://github.com/vmware-tanzu/velero/issues/2991:
      • BSL controller initializes backup storage locations almost every second
        • Significant issue for object stores where init has expensive calls (e.g API calls)
      • Any chance we could get a hotfix? When does v1.5.2 ship?

Contributor Shoutouts

Oct 6, 2020

Status Updates

  • dsmithuchida
    • Working on e2e testing and ginkgo framework
  • bridget
    • Couple of outstanding PRs that I'd like reviewed (#2951, #2969)
    • Been working on some bugs:
      • Backup deletion fails if cloud resources already deleted #2980
        This is a regression in behaviour from v1.4.2 due to the introduction of DeleteItemActions. We can introduce a fix, but how should we handle the case where there are DeleteItemActions to perform but the resources are deleted?
      • Unable to install Velero using go get #2966
        We can fix this by upgrading the version of CSI external-snapshotter but it requires updating the version of Kubernetes modules we're using to 0.19.0. What is the impact of upgrading these libraries?
  • nrb
    • Not a whole lot of updates; was on semi-PTO last week.
    • Intending to do a "support-a-thon"/bug smash session when the whole team is back

Discussion Topics

  • Waiting for Additional Items to restore
  • Issue #2980 - deleteitemactions regression
    • Happens all the time, but what do we do when there's DeleteItemActions but no backup tarball?
    • Since there's no way to really know what to match the plugins against, log that there was no tarball and continue, skipping the plugins.

Community Contributions

Sept 22, 2020

Status Updates

  • nrb
  • carlisia
    • Last week worked on v1.5.1 release
    • Reviews
    • Piling up: more reviews, GH issues + Helm chart
    • working on the CLI side of the download request migration to kubebuilder/controller-runtime.
  • bridget
    • Worked on some improvements to our release scripts following the 1.5.1 release
    • Now looking at internal build processes
  • dsmithuchida
    • Resource blocking in vSphere plug-in for Project Pacific internal resources
    • OpenSource project approvals for GVDDK (currently part of Astrolabe) and Data Generator (Kibishii) test tool

Discussion Topics

  • nrb: Defining prerelease to GA timeframes
    • From last week:
      • Prereleases ~1 week between each?
      • RC -> GA, ~2 weeks?
    • Phuong - 3 month release cycle is acceptable for them, waiting for features on that timeframe is reasonable. Longer is too much.
      • The RC was helpful to integrate and test with their product. It meant updating to the actual release was just removing some characters.
      • 1 week seemed reasonable to them, but if they hit bugs it may not be enough time.
    • Dylan - Red Hat's Konveyor lags a little. They want to support older Kubernetes releases like v1.7 due to supporting OpenShift 3. (this isn't determined for upstream Velero)
      • Konveyor does OpenShift 3->4 and 4->4 migration.
      • Red Hat also has OADP for OpenShift data protection, and their Velero fork is used there, too.
      • Red Hat likes the 3 month release cycle, especially for OADP. Backwards compatibility in Konveyor is trickier.
      • For OADP, they tried the RC for basic tasks.
  • poojita: Recover OpenShift's native resource: DeploymentConfig
    • Velero restore error seen has been captured below:
      • Velero failed to restore namespace frank3. {"namespaces":{"frank3":["error restoring imagetags.image.openshift.io/frank3/httpd-example:latest: ImageTag.image.openshift.io \"httpd-example:latest\" is invalid: spec: Required value: spec is a required field during creation"]}}. Velero restore '51685651-5369-51e1-88df-8977874919ca-2020-09-16-08-53-48-frank3' failed: {\n "phase": "PartiallyFailed",\n "warnings": 6,\n "errors": 1\n}."
      • Dylan - There's an OpenShift plugin that can help restore these CRDs on vanilla Velero installs. https://github.com/konveyor/openshift-velero-plugin
        • For ImageTag, it skips restore of these
        • Plugin recreates ImageStream, which then recreates the ImageTag.
        • ImageTag is a new, undocumented resource in OpenShift v4.4.
        • The plugin is used in the context of the migration product (Konveyor) and data protection (OADP).
        • Can file GitHub issues on that repo if you have issues/questions.
        • OADP bundles Velero and the AWS plugin on OpenShift. The benefit here is that it backs up the images to the S3 bucket.
          • Plugin can't do it by itself right now. OADP sets up a image repo that the plugin doesn't orchestrate right now
  • Alay - Plugin dependencies are challenging
    • If you deploy Velero by itself, without a wrapper, there are challenges in connecting plugins to external systems such as StatefulSets or Deployments.
    • Dave: Difference between the vSphere plugin and OADP is that OADP is at what point in the lifecycle they're active.
    • May be able to extend plugins via the Velero app operator that VMware is working on.
    • RH's requirements:
      • Before the plugin runs, ensure the dependency is healthy
      • During/after backup, ensure the dependency is healthy.
        • If it's not, short circuit the operation instead of trying the full backup operation and failing.
  • Narashima - Took a backup w/ Velero and tried to restore it to another cluster. ReplicaSets/Deployments are getting duplicated in the new cluster.
    • kubectl get shows multiple entries.

Shoutouts

Slow week for contributions, but thanks to mikkael for the PR to allow users to change the container's timezone

Sept 15, 2020

Status Updates

  • ashish
    • v1.5.0-rc.1 released on 9/10
      • Getting exec restore hooks merged. 🎉 Thank you Andrew Reed!
      • Testing the RC release.
      • Helping out with hardening the release procedure.
      • Getting PRs merged and fixing some documentation bugs.
    • Migration to kubebuilder:
      • BackpController- work started and made some progress.
  • nrb
    • Was on PTO last week, not a whole lot to discuss there
    • Would like to get v1.5.0 out for sure this week
    • Following up on some PR reviews that were opened while I was still out
    • Looking at design docs for the v1.6 timeframe
    • Welcome Dave to the team officially!
  • carlisia
  • bridget
    • Still getting up to speed with things
    • Want to get #2881 reviewed/merged
    • Looking for something new to start on
  • stephanie
    • 1.6 stack rank design and implementation
      • some delays getting this published and we have a deadline for getting that up and published for community viewing this week
      • we encourage comments and feedback to the stack rank
      • any roadmap updates will also be published following the finalized 1.6 stack rank
  • dave
    • Vsphere plugin testing on 1.5 rc looking good
    • E2E testing:
      • something developers can run
        • Nolan: expect this to be open source
      • something that goes into CI

Discussion Topics

  • nrb: Concurrent backup/restore job support
    • This is at the job level, not within a job (for example, grabbing all PVs in parallel)
    • There was a proposal for implementing this as worker pods, but I'd like to look at different design approaches, not just one
    • Would like to start this design work, but it's going to take a while. Will initially get a design doc with goals/non-goals up.
    • Things to address:
      • quiescing at backup
      • contention at restore time
      • Locking
    • Become more distributed
    • [Carlisia] Sounds more like product requirements
    • Can we implement concurrency piecemeal?
      • For example, is it acceptable to be concurrent only on a namespace level?
    • Can we make PoC(s)?
    • Action item - get a doc with goals/non-goals that set up guard rails, then have a sessions to discuss it w/ whiteboarding
  • carlisia:
  • ashish:
    • What is the general timeline for a release:
      • Pre -> RC -> GA
      • Nolan: RC -> GA: ~2 weeks
      • Nolan: Release cadence is up to debate, though
      • Ashish: ~1 week between pre-releases.
  • dave:
    • Passing feature flags to init containers
      • The init containers just copy the plugin to a well-known location.
        • vSphere plugin has a bunch of ancillary stuff and we install from the init container
        • The --features flag is passed from the core velero to the plugin. I spoke with someone on the vpshere plugin to suggest this.

Contributor Shoutouts

Sept 8, 2020

Status Updates

  • carlisia
    • RC this week
    • On community support
    • Nolan is on PTO
    • Working on the download request migration
  • ashish
    • 1.5 release testing
    • restore hooks PR:
      • testing
      • PR 2804 reviews
      • getting that merged hopefully today.
  • bridget
    • Reviewing PRs for the 1.5 release

Discussion Topics

  • Phuong: Timeout for plugins. For app consistent backup, the pod will be quiesced while snapshot being taken. If the PVC plugin taking the snapshot and that plugin take a long time (say because it fails and retries), the application will be blocked for a long time which is unwanted behavior.

Contributor Shoutouts

Sept 1, 2020

Status Updates

  • carlisia

    • pto last Friday
    • addressing code reviews
    • going to review/test PRs for 1.5
  • nrb

    • Planning for Q4 w/ VMware
    • Some planning for v1.6
    • reviewing PRs
    • v1.5.0 delayed (in discussion topics)
    • Welcoming Dave!
    • Community support this week
    • On vacation next week, will transfer responsibilities over to folks prior
  • bridget

    • Finally transitioned over to the Velero project
    • Getting up to speed
    • Reviewing PRs for 1.5

Discussion Topics

  • nrb: v1.5.0 has been delayed. We've got a list of release blocker PRs that we'd like to get through, and will not add more code changes unless there's a critical bug/security fix. The PRs are in relative priority.
    • Timeframe - ~2 weeks
    • Testing will be more automated starting in v1.6, but we don't have a defined e2e test suite right now, just whatever is in make ci
  • nrb: VMware will be having a company holiday on Friday, September 4 and Monday, September 7 will be US Labor Day.
  • carlisia: do we want to halt the Velero server in this case: "Invalid value for disable-controllers flag provided" (https://github.com/vmware-tanzu/velero/blob/976484553062262324cc1304607ea00841262db8/pkg/cmd/server/server.go#L822)
    • Action Item: Carlisia to open an issue to discuss this and document the issue
  • phuong:
    • quick update on the backup cluster resources
      • Able to back up clustorerole and pv in the context of a namespace, don't need to modify the selection logic
    • backing up PVC in parallel
      • Taking backups of pods in logical group in a serial manner could lead to drift
      • This is parallelization w/in a backup job, not parallelization of all jobs
      • Currently this would be VolumeSnapshotter plugins - may not be a resource contention issue
        • This is the main issue here - use case is for Cassandra
        • How would this work with the CSI plugin? Since it's BackupItemAction, the interface is way different
      • At the Kubernetes API server level, would we want to specify parallelism per resource type?
        • CRDs may need this, too
        • Need to be careful to not overflow resource limits
      • Don't want to overwhelm writing to the backup tarball
        • Once we get the body, that could be funneled to the tarball serially
      • Action Item: Phuong to make an issue to continue discussion: https://github.com/vmware-tanzu/velero/issues/2888
  • sseago: posted design doc for RestoreItemAction "wait for additional item to be ready" (https://github.com/vmware-tanzu/velero/pull/2867)
  • brito-rafa: will work with jenting on supporting multiple API Groups (https://github.com/vmware-tanzu/velero/issues/2551)

Contributor Shoutouts

August 25, 2020

Status Updates

  • stephanie
    • building the priority/stack rank list for the 1.6 release
    • this will be published after we complete planning next week for the community to refer to and will include additional details that compliment our current product roadmap
  • ashish
    • Community support this week.
    • Misc work for the 1.5 release:
      • Pre-release is out. Try it out in your non-prod environments
      • Bug fix for server version not being correctly displayed as output of velero version.
      • Updating some docs
      • Getting PRs reviewed and merged.
      • Testing out the release
    • Will be taking another look at the exec restore hooks PR.
  • carlisia
  • nrb
    • released v1.5.0-beta.1
    • testing the beta - please try it out in your environments, too!
    • planning for v1.6
    • Owe a review on the restore hooks exec work

Discussion Topics

  • nrb: We aim to have v1.5.0 out by September 1
  • Let us know what Velero code you depend on
  • Defining printer columns
    • Do we need to get the exact same values as Velero's CLI in kubectl printer columns?
      • Not necessarily - they can be raw values, but we can use webhooks for calculations.
  • Phuong updating go modules
    • 1.5.0-beta.1 generating CRs having issues
    • The error has an issue with missing a context argument - related to upgrading to Kubernetes v1.18
      ​​​​​​​​ERROR] generated/clientset/versioned/typed/applicationtemplate/v1beta1/applicationtemplate.go:73:5: not enough arguments in call to c.client.Get().Namespace(c.ns).Resource("applicationtemplates").Name(name).VersionedParams(&options, scheme.ParameterCodec).Do
      ​​​​​​​​[ERROR]         have ()
      ​​​​​​​​[ERROR]         want (context.Context)
      

Contributor Shoutouts

August 18, 2020

Status Updates

Discussion Topics

  • Phuong Hoang: avoid backup PersistentVolumes not referenced by included namespaces when enable include-cluster-resources

    • Use exclude-resources=persistentvolumes?
      • We still want to include the PersistentVolumes being used by PVCs belong to the namespace being backed up but not including other PersistentVolumes
      • Plugin additional items may ignore this, but we can't remember
    • Also want to include other cluster-scoped resources like ClusterRoleBindings
    • Sounds like we want OR selector logic support
    • One idea - use include-resources and specify exactly what you want
      • Limitation: you have to specify exactly what you want
    • Action item: Phuong will investigate a proposal to enhance item selection.
  • stephanie

    • Velero 1.6 release planning (starting around v1.5 beta release)
      • We'll share this around Sept 1 in ZenHub & GitHub
      • Will update roadmap.md, too
      • Between now and then, please
        Image Not Showing Possible Reasons
        • The image file may be corrupted
        • The server hosting the image is unavailable
        • The image path is incorrect
        • The image format is not supported
        Learn More →
        any issues that you're interested in seeing in the v1.6 release
  • Let us know what Velero code you depend on

  • Ashish: Velero Office Hours are coming, exact dates & times to be announced

Contributor Shoutouts

August 11, 2020

Status Updates

  • carlisia

    • PTO last week
    • Reviewing PRs
    • Back to the last of migrating ServerStatusRequest to kubebuilder
  • ashish

    • Community support last week
      • Got to a bunch of PRs and GH issues.
    • Restore Hooks:
      • PR for Init container restore hooks PR 2787
      • PR to fix API tyes PR 2793
    • Fix to pass the --default-volumes-to-restic flag from schedules to the backup options. PR 2776
    • Setup CRD validation CI in GitHub actions PR 2805
      • Currently setup to run against kubernetes version 1.16.9, 1.17.0 and 1.18.4
      • AI: Switch to latest of each minor release
    • Restructuring build broke the EntryPoint for the restic-restore helper. PR 2802
    • PRs in the CSI plugin repo:
      • Updating the CI in the repo to push images with latest tag. PR 65
      • Better handling of restore when VolumeSnapshot resources exist but the associated workloads got deleted PR 64
  • nrb

  • stephanie

    • We are aiming to have 1.6 release planning start following the 1.5 beta at the end of August
      • refer to our roadmap for items currently planned for December 2020 which corresponds with the 1.6 release
    • Velero security policy published

Discussion Topics

  • Introducing Bridget!
  • Let us know what Velero code you depend on
  • Velero-plugin-for-CSI PR 1902 KEP to add validation hooks on the VolumeSnapshot API objects.
    • Took a look at it and these don't seem to violate our assumptions/expectations
    • Would love to have more 👀 on it
  • What versions of Kubernetes does Velero support?
    • Alay: Red Hat supports all the way back to Kube v1.7, will bump up to v1.9. Uses Velero v1.3
    • An LTS

Contributor Shoutouts

August 4, 2020

Status Updates

  • ashish
    • Community support this week
      • Have a list of PRs to review
      • New issues that need triage
    • Working on restore hooks
      • PR 2787 to implement the init container restore hooks.
      • Need to run this through tests before removing WIP.
  • nrb
    • Out sick late last week
    • Working on DeleteItemAction design. Please take a look!
    • Investigating options on project management and release cadence to get issues handled more quickly, as well as get fixes out more regularly.
  • carlisia
    • PTO!
    • Working on migrating ServerStatusRequest to kubebuilder

Discussion Topics

Contributor Shoutouts

July 28, 2020

Status Updates

  • carlisia
    • Community support this week
    • PTO next week
    • Bringing the Helm repo up to date with v1.4 + more Velero Helm TLC
    • Continue work on the ServerStatusRequest migration to kubebuilder
  • ashish
    • Restore hooks:
    • Backing up pod volumes using restic by default:
      • PR 2611
      • Excluding volumes mounting secrets and config maps
      • Opening Issue 2770. Support for label selector to choose pods to which the defaultVolumesToRestic should be applied.
        Image Not Showing Possible Reasons
        • The image file may be corrupted
        • The server hosting the image is unavailable
        • The image path is incorrect
        • The image format is not supported
        Learn More →
        /
        Image Not Showing Possible Reasons
        • The image file may be corrupted
        • The server hosting the image is unavailable
        • The image path is incorrect
        • The image format is not supported
        Learn More →
        in the issue
    • PR reviews
  • nrb
    • Working on some CI improvements around assignment/reviewers, but ran into issues with how GitHub Actions expose secrets
    • PR reviews
    • Intending to get DeleteItemAction design doc updated today or tomorrow
    • Was on community support last week

Discussion Topics

Contributor Shoutouts

Helm Chart

July 21, 2020

Status Updates

Discussion Topics

Community Shoutouts

July 14, 2020

Status Updates

  • ashish
    • Community support last week
    • Fix for capturing version of CRD before version remap plugin:
    • Fixed Issue 17 with PR 60 in the velero-plugin-for-csi repo
      • allows users of the CSI plugin to keep regular volumesnapshotclass separate from that used by Velero. Primarily to allow different DeletionPolicy.
  • [carlisia]
  • [nrb]
    • v1.4.2 is out!
      • Includes fixes from CRD
    • Going to try to help catch up on community support this week, particularly on GH issues/PRs.
    • Get master -> main transition done this week

Discussion topics

Contributor Shoutouts

July 7, 2020

Status Updates

  • nrb
    • Community support last week, still playing catch up on GH issues though
    • Review blitz yesterday, but did not get them all
    • End in sight for our internal build process work, hope to give more time to this open source work!
    • Restore hooks meeting yesterday, thanks everyone for attending!
  • carlisia
    • BSL controller PR: https://github.com/vmware-tanzu/velero/pull/2674
    • Worked on adding the BSL validation as webhooks; talked to @nrb and decided we won't do it
    • Did and doing PR reviews
    • Finished reviewing the volume backup/restore progress design doc
    • Next: converting ServerStatusRequest to controller-runtime
  • ashish
    • Community support this week:
    • PR to fix a CRD backup regression in 1.4. PR Issue
      • Tested against "v1.16.9" "v1.17.0" "v1.18.4"
      • Will be included in the release for 1.4.1
    • Working on internal build setup. Should mostly be done, need a few finishing touches
    • PR to fix the file permissions on the manifests in the backup tarball.

Discussion topics

Contributor Shoutouts

June 30, 2020

Status Updates

  • carlisia
    • BSL controller to handle validation + update BSL status phase - PR ready for review (this changes the behavior of the Velero server): https://github.com/vmware-tanzu/velero/pull/2674/
    • WIP: add webhook validation to BSL create/update
    • PR reviews owed
  • ashish
    • on community support last week
    • worked on some internal build setup and testing of the build artifacts
    • Opened this issue last week Issue 2673
    • Currently working on some load testing for restic
  • nrb
    • Was out for the end of last week for family health issues, didn't get everything I wanted done.
    • Reviews!
    • Updates on docs, getting Jason S up to speed for implementing new commands
    • On community support this week

Discussion topics

  • [nrb] Moving code from backup & restore controllers out into the pkg/{backup/restore} directories
    Helps with kubebuilder/concurrency
  • [stephanie] Scheduling a backup and restore focus meeting Monday July 6th, more info to come here

Contributor shoutouts

Vishnu Itta for the volume snapshotter progress design doc!! Let's get that merged this week (for real!)
Marc Campbell for the restore hooks design doc!
Phuong for the work on the ordering design doc!

June 23, 2020

Status Updates

Discussion Topics

  • nrb Restore hooks - let's have a seperate call for this to address some concerns about using init containers.
    • Main use case seems to be quiescing/unquiescing databases
      • Will need to use some sort of tool to send this to a backup PV, and a tool to take it from the backup PV to a runtime PV
  • nrb/carlisia - Public/private APIs
    • Let us know what you rely on!
    • This will be phased, not all at once
    • In 2.0, the current Go client for Velero will be replaced by a controller-runtime client. The informers/listers will be replaced by a client manager.
  • Vishnu (Enable restic by default - https://github.com/vmware-tanzu/velero/pull/2611) and (Progress on backup/restore by plugins - https://github.com/vmware-tanzu/velero/pull/2543)

Contributor Shoutouts

June 16, 2020

Status Updates

Discussion Topics

Contributor Shoutouts

Select a repo