owned this note
owned this note
Published
Linked with GitHub
OpenEBS e2e Discussions Notes
===
:::info
- **Location:** https://meet.google.com/bme-vjsj-qdp
- **Date:** Every week from 12:45am to 01:15pm on Monday (IST)
- **Agenda**
1. Walk through the status update
3. Discussion topic
*Add your request, question or suggestion to our [issue list](https://github.com/openebs/e2e-tests/issues)*
*If you have anything you’d like to put on the agenda, please do so below for the next meeting:*
:::
## December, 7th 2020
Attendees: Sathya
- [name=Sathya]
- Refactor the Backup and restore test case - segregate it into different test case [In progress]
- Modified the test case to cleanup the spc pool [done]
- refactor the deprovision logic for the application deprovison test case. [done]
## November, 30th 2020
Attendees: Sathya
- [name=Sathya]
- Refactor the Backup and restore test case - segregate it into different test case [pending]
- Modified the test case to cleanup the spc pool
- refactor the deprovision logic for the application deprovison test case.
## November, 23rd 2020
Attendees: Sathya
- [name=Sathya]
- Refactor the Backup and restore test case - segregate it into different test case [pending]
- Refactor the spc rothreshold limit testcase [flaky]
## November, 9th 2020
Attendees: Sathya
- [name=Sathya]
- Refactor the Backup and restore test case - segregate it into different test case [pending]
## November, 2Nd 2020
Attendees: Sathya
- [name=Sathya]
- Automate the test case for pool expansion when the bd is already in use by any pool [done]
- Automate the test case for blockdevice replcaement when the bd is already in use by any pool [done]
- Refactor the Backup and restore test case - segregate it into different test case [pending]
## October, 26th 2020
Attendees: Sathya
- [name=Sathya]
- Automate the test case to perform openebs provision other than the openebs namespace [Done]
- Automate the test case for pool expansion when the bd is already in use by any pool [In Review]
- Automate the test case for blockdevice replcaement when the bd is already in use by any pool [In Review]
- Refactor the Backup and restore test case - segregate it into different test case [pending]
## October, 19th 2020
Attendees: Sathya
- [name=Sathya]
- Automate the test case to perform openebs provision other than the openebs namespace
- Automate the test case for pool expansion when the bd is already in use by any pool
## October, 12th 2020
Attendees: Sathya, Giri
- [name=Aman Gupta]
- Automation of e2e-test for incremental backup & restore for zfs-localpv.
- Restore on different nodes.
- [name=Sathya]
- Pipeline scripts for SPC and cStor volume migration [Done]
- Include litmuschaos experiment for target resource limit test [Done]
- Gitlab transition from shashank
- backup non cstor csi volume migrate spc pool restore the backup
## October, 5th 2020
Attendees: Sathya, Giri
- [name=Sathya]
- Pipeline scripts for SPC and cStor volume migration [in review]
- Include litmuschaos experiment for target resource limit test [in progress]
- Gitlab transition from shashank
- [name=Aman Gupta]
- Completion of backup & restore experiment [e2e-test] and Add this into pipelines [in-progress]
## September, 28th 2020
Attendees: Sathya, Giri, Aman
- [name= Shashank]
- Resolve infra related issues.
- Montitor pipelines and fix failed cases. [in progress]
- [name=Sathya]
- Automate the test case for cstor spc and cstor volume migration [in review]
- Refactor the test case for cstor operator and csi provisioner [in review]
- Pipeline scripts for SPC and cStor volume migration [in progress]
## September, 21st 2020
Attendees: Sathya, Aman, Giri
- [name=Aman]
- Automate the e2e test for backup and restore for zfs-localpv [in-progress]
- Keep continue my task with e2e for backup restore with different scenraio and on different OS.
- [name=Sathya]
- Automate the test case for cstor spc and cstor volume migration [in progress]
- Refactor the test case for cstor operator and csi provisioner [in progress]
- Pipeline scripts for SPC and cStor volume migration
## September ,7th 2020
### Status Updates
Attendees: Sathya, Aman, Shashank, Giri, Kiran
- [name=shashank]
- Adding local csi based backup/restore in pipeline [in progress]
- [name=sathya]
- Refactoring the csi volume and cspc pool upgrade test case. [in review]
- Include the container run time failure test scenario for csi based cstor volume into pipeline [in review]
- Automate the test case for app target affinity for csi cstor volume [in progress]
- Pipeline for cspc/ci upgrade [ in progress ]
- [name=Aman Gupta]
- validation of the zfs-operator yaml (one for all OS) [https://github.com/openebs/zfs-localpv/pull/204] ubuntu- done by Pawan , centos 7-done, centos8- done by one user (pending on me), on k3s-pending
- After backup restart PR gets merge, will start manual testing of this feature.
## August 31st, 2020
### Status Updates
Attendees: Sathya, Aman, Shashank, Giri, Kiran
- [shashank]
- Adding local csi based backup/restore in pipeline [inprogress]
- Deploying lab-infra bot on GCP VM [Done]
- [name=Aman Gupta]
- Added pod restart (one which is deployed before upgrade zfs driver) test cases in upgrade pipeline. [done]
- Node failure scenario for zfs-localpv [automation-done, pipeline script-pending]
- [name=sathya]
- Automate the test case for pool creation with block device has filesystem.[done]
- Refactoring the csi volume and cspc pool upgrade test case. [in progress]
- Automate the test case for app target affinity for csi cstor volume [in progress]
- Refactor the target network loss failure in pipeline [done]
- Refactor the spc/cstor volume migration test case. [in progress]
- [name=giri]
- Review the Pull Requests
## August 24th, 2020
Attendees: Sathya, Aman, Shashank, Giri, Kiran
### Status Updates
- [name=shashank]
- Openshift cluster certificate rotation and cluster renewal [In progress]
- Refactorization and verification of Jiva internal snapshot deletion. [In progress]
- Automate local Backup/Restore for CSI volumes. [In progress]
- [name=sathya]
- Automate the test case for pool creation with block device has filesystem.[in review]
- Manually verify the complete cluster shutdown scenario [MV]
- Refactoring the csi volume and cspc pool upgrade test case. [ in progress ]
- Automate the test case for app target affinity for csi cstor volume [ in progress]
- Refactor the target network loss failure in pipeline [in review]
- refactor the spc/cstor volume migration test case. [in progress]
- [name=Aman Gupta]
- Node failure scenario via power off the VM's on which volume is provisioned. [manually tested, e2e-script testing pending ]
- Going through already present test cases for backup-restore and then create a test plan for zfs-localpv backup restore.
- blog on e2e for zfs-localpv [in bucket list]
## August 17th, 2020
Attendees: Sathya, Aman, Shashank, Giri, Kiran
### Status Updates
- [shashank]
- Gitlab licence renewal.
- Creation Openshift-4.5 pipeline cluster for next release.
- Refactor jiva autogen snapshot script.
- [name=sathya]
- Automate the test case for pool creation with block device has filesystem.[in review]
- Manually verify the complete cluster shutdown scenario [MV in progress]
- Refactoring the csi volume and cspc pool upgrade test case. [ in progress ]
- Refactor the target network loss failure in pipeline [incosistent in pipeline]
- refactor the spc/cstor volume migration test case. [in progress]
- [name=Aman]
- For zfs-localpv main focus will be on e2e for backup and restore feature.
- blog on e2e with zfs-localpv [14th sept 2020]
## August 10th, 2020
Attendees: Sathya, Aman, Shashank, Giri, Kiran
### Status Updates
- [name=shashank]
- Fix pipeline faiures on konvoy & Openshift.
- Fix and troubleshoot GitLab CI failures.
- [name=sathya]
- create a separate folder for cstor test cases and create readme [ create backlog]
- release activities for RC1 with openshift, konvoy pipelines [pending]
- triggering pipelines for RC tags [openeshift,konvoy]
- trigger pipeline for Upgrades [AWS]
- Manually verify spc and noncsi volume migration
- Manually verify the cstor csi upgrade.
- Automate the test case for pool creation with block device has filesystem.[in progress]
- Manually verify the complete cluster shutdown scenario [ in progress]
- Upgrade pipeline scripts for cspc pool and csi volumes in konvoy. [ done ]
- Refactor the test scripts for CSI provisioner and cStor operator deployment test case [done]
- Automate the reusablity of block devices test case [done]
- Include csi based test cases into openshift pipeline [done]
- refactor the flaky test cases [done]
- csi volume scale down
- node failure for csi based volumes
- cstor operator [included task to restart ndm componets]
- [name=aman]
- Release activities with native-k8s pipeline.
- Hands-on with latest changes re: volume capacity roundoff [zfs]
-
- [name=giri]
- Review the Pull Requests
- Upgrade openebs on director-onprem cluster
## August 03rd, 2020
Attendees: Sathya, Aman, Shashank, Giri, Kiran
### Status Updates
- [shashank]
- Issue regading Gitlab job getting stuck at end of build. (Done)
- Refactor Jiva internal snapshot deletion to restart random replica along with leader (Done)
- Include Jiva snapshot delete experiment in pipeline (in-progress)
- [Aman]
- issue raised, zfs-operator is not applying on rancher (centos 7) cluster. will follow the fix for that
- almost done with cosmetic fixes and readme for zfslocalpv in test repo. Modifying the pipeline script for accomodating the changes.
- shared mount test is running with busybox, will manualy test once with wordpress and then we can replace busybox with wordpress.
- [sathya]
- Upgrade pipeline scripts for cspc pool and csi volumes. [ In progress ]
- Refactor the test scripts for CSI provisioner and cStor operator deployment test case [in review]
- Automate the test case to validate pool deletion is failed if the pool has the volumes [ done]
- Validate the csi provisioner changes in openshift and konvoy platform [done]
- Refactor the flaky test case ro threshold limit failure into openshift pipelines [in progress]
- Verify and automate the node failure scenario for csi based volumes [done]
- Automate the reusablity of block devices test case [in review]
- create SPC pool and delete the spc pool and reuse the blockdevices to create CSPC
- create LocalPV device remove the provision and reuse the bd for creating cspc
- Use a blockdevice to create LocalPV in ext4 filesystem remove it then use it for creating PV for xfs filesystem
- Automate the test case to verify the pool creation is failed when in already claimed bd [done]
## July 27th, 2020
Attendees: Sathya, Aman, Shashank, Giri, Kiran
### Status Updates
- [shashank]
- Bring up Openshift-4.5 [Done]
- Adding above platform in pipeline(includes taking of cluster snapshot, revert-script modification etc..) [Done]
- Refactor Jiva internal snapshot deletion to restart random replica along with leader. [In-progress]
- [sathya]
- Refactor the test scripts for CSI provisioner and cStor operator deployment test case [in review]
- Automate the test case to validate pool deletion is failed if the pool has the volumes [ in review]
- Validate the csi provisioner changes in openshift and konvoy platform [manual validation is in progress]
- Refactor the flaky test case ro threshold limit failure into openshift pipelines [in progress]
- Verify and automate the node failure scenario for csi based volumes [manual validation is in progress]
- Automate the reusablity of block devices test case [in progress]
- create SPC pool and delete the spc pool and reuse the blockdevices to create CSPC [ in review]
- create LocalPV device remove the provision and reuse the bd for creating cspc [ in progress]
- Use a blockdevice to create LocalPV in ext4 filesystem remove it then use it for creating PV for xfs filesystem [ pending]
- Automate the test case to verify the pool creation is failed when in already claimed bd [pending]
- [Aman]
- From previous week task; readme update for zfs-localpv test cases are partially done. but from cosmetic fixes side (zfs-LocalPV=>zfs-localpv) task is paused, as changes in directory path will need modification in all the plateforms for pipline script.
- shared mount for zfspv [Automation done & merged ; pipeline script in progress]
- Negative test case for localpv provisioning on selected device [PR in review].
## July 20th, 2020
Attendees: Shashank, Sathya, Giri, Aman, Kiran
### Status Updates
- [shashank]
- Use coreOS-4.4.3 as host OS for Openshift-4.5 (done)
- Bring up Openshift-4.5 (in-progress)
- [sathya]
- Refactor the test scripts for CSI provisioner and cStor operator deployment test case [ in progress]
- Refactor the flaky test case ro threshold limit failure into openshift pipelines [ in progress]
- Refactor the test case for snapshot creation - flaky test [done]
- Automate the reusablity of block devices test case [pending]
- create SPC pool and delete the spc pool and reuse the blockdevices to create CSPC
- create LocalPV device remove the provision and reuse the bd for creating cspc
- Use a blockdevice to create LocalPV in ext4 filesystem remove it then use it for creating PV for xfs filesystem
- [Aman]
- add zfspv shared mount e2e test into pipeline and make sure its validation.
- zfspv validation on CentOS 8 based cluster.
- Cosmetic fixes like(zfs-LocalPV ===> zfs-localpv) in e2e-test repo as discussed in zfs syncup last friday.
- Update the master branch for e2e-nativek8s pipeline repo with READme's
- [Giri]
- Bring up konvoy 1.17.7 cluster For pipeline
## July 13th, 2020
Attendees: Shashank, Sathya, Giri, Aman, Kiran
### Status Updates
- [Shashank]
- Stablization of e2e-pipelines (includes runner fixes)
- Bring up stable cluster of Openshift 4.2 [Done]
- [Aman]
- Running the native-k8s pipelines with release tags as well upgrade pipelines.
- Shared-mount support for zfspv [Manually verified, will automate this]
- [Giri]
- Replace d2iq pipeline cluster with Konvoy v1.4.4
- [sathya]
- Release activities
- Upgrade pipelines for RC2 tag [done]
- Release pipelines for RC2 tag openshift [done], konvoy [pending]
- Verify the changes for migration removed version details form cvc [in progress]
- Manually reusability of blockdevice [ in progress ]
- create SPC pool and delete the spc pool and reuse the blockdevices to create CSPC
- create LocalPV device remove the provision and reuse the bd for creating cspc
- Use a blockdevice to create PV in ext4 filesystem remove it then use it for creating PV for xfs filesystem
## July 6th, 2020
Attendees: Shashank, Sathya, Giri, Aman, Kiran
### Status Updates
- [ shashank ]
- Automation Jiva internal snapshot deletion [Done]
- Enable test jiva inerenal snapshot deletion test in pipeline [in-progress]
- Adding csi based backup/restore job in konvoy-pipleine [Done]
- [sathya]
- Automate the CSPC pool and CSI volume Migration [in review]
- Automate the Upgrade for CSPC pool and CSI volume [ done]
- Bring up Openshift pipeline cluster for release 1.12 [done]
- Include below test cases into konvoy pipeline [in progress]
- csi volume target network delay
- cspc pool pod container kill
- Manually reusability of blockdevice [ in progress ]
- create SPC pool and delete the spc pool and reuse the blockdevices to create CSPC
- create LocalPV device remove the provision and reuse the bd for creating cspc
- Use a blockdevice to create PV in ext4 filesystem remove it then use it for creating PV for xfs filesystem
- [ Aman ]
- E2e test for b-tree fstype for zfspv
- E2e test for shared mount for zfspv volume
## June 29th, 2020
Attendees: Shashank, Sathya, Giri, Aman, Kiran
### Status Updates
- [shashank]
- Adding csi based backup/restore job in konvoy-pipleine [Under-review]
- Adding support for backup/restore for csi volume in experiment. [Done]
- Test cases compatibily check with velero-1.4.0 in pipelines [Done]
- Jiva internal snapshot deletion [In-progress]
- [sathya]
- Automate the CSPC pool and CSI volume migration [in progress]
- Automate the Upgrade for CSPC pool and CSI volume [in review]
- Include below test cases into konvoy pipeline [in progress]
- csi volume target network delay
- cspc pool pod container kill
- cspc poo pod failure when the pool expansion is in progress
- Refactor the below flaky test cases
- OpenEBS provision [refactor to run with the custom images also] [done]
- SPC ro-threshold limit [done]
- Bring up Openshift pipeline cluster for release 1.12
- [Aman]
- zfspv-localpv tests validation on centOS 7 (konvoy pipeline cluster) [Done]
- Include below test into konvoy [Done]
- csi volume scaleup
- csi volume scaledown
## June 22nd, 2020
Attendees: Shashank, Sathya, Giri, Aman, Kiran
### Status Updates
- [Shashank]
- Backup/Restore for CSI volume into the pipeline
- Existing Backup/Restore expermients compatibility check with velero-V1.4.0 in pipelines.
- Experiment for Jiva internal snapshot deletion.
- [sathya]
- Automate the CSPC pool and CSI volume migration
- Automate the Upgrade for CSPC pool and CSI volume
- Refactor the below flaky test cases in pipeline
- OpenEBS provision [refactor to run with the custom images also]
- SPC ro threshold limit
- [Aman]
- Validation of zfs-localPV tests on CentOS based cluster (Konvoy-cluster with CentOS Linux 7)
## June 15th, 2020
Attendees: Shashank, Sathya, Giri, Aman, Kiran
### Status Updates
- [sathya]
- Automate the CSPC and CSI volume upgrade (manual verifications done)
- Automate the SPC to CSPC and non-csi to csi volume migration (manuall verifications done)
- Automate the test case for container kill(cstor-pool and cstor-pool-mgmt) during disk replacement (in-progress)
## June 8th, 2020
Attendees: Shashank, Sathya, Giri, Aman, Kiran
### Status Updates
- [shashank]
- Deletion of failed backup (in progress)
- Addition of new backup/restore experiments in pipeline (done)
- [sathya]
- Automate the pool pod failure when the pool expansion is in progress (in review)
- Automate the test case for container kill during disk replacement (in-progress)
- Manually verify and Automate the CSPC and CSI volume upgrade
- [Giri]
- Review the PRs.
- Revamp the clusters for pipelines after migrating them into different environment.
- [Aman]
- Complete the automation part for raw block volume support (work in progress)
## June 1st, 2020
Attendees: Shashank, Sathya, Giri, Aman, Kiran
### Status Updates
- [shashank]
- Restore remote backup in different namespace (Done)
- Backup/Restore using different S3 profile (under review)
- Deletion of failed backup (in-progress)
- AWS infra automation for release pipeline (Done)
- Bring up Openshift pipeline cluster for release 1.11 (in-progress)
- [sathya]
- Automate the pool pod failure when the pool expansion is in progress (in progress)
- Automate the test case for container kill during disk replacement (in-progress)
- Automate the pool pod container failure and verify the application behaviour (in review)
- Included a upgrade test scripts for aws platform (migrte the upgrade pipeline to gke to aws) (in review)
- [Giri]
- Migrate the konvoy and kubeadm pipeline clusters into different hosts (Done)
- Explore rancher architecture for setting up feature-gates (Done)
- Litmus experiment to set resource limit and request for cstor target pods (in-progress)
- [Aman]
- Automation of zfs-custom-topology support is (done). Pipeline scripts PR is there, needs to be merge.
- Same status (^^^) for e2e of restarting docker/kubelet services with zfspv test.
- Raw block volume support. (Automation is pending, in-progress)
## May 18th, 2020
Attendees: Shashank, Sathya, Giri, Aman, Kiran
### Status Updates
- [Giri]
- Litmus experiment to set resource limit and request for cstor target pods
- [Shashank]
- Backup/Restore experiment to restore the backup in different namespace from remote location. (DONE)
- Backup/Restore when CVR's are unhealthy.
- [Aman]
- Custom-topology support for zfspv (automation in progress)
- Raw block volume support (Need more exploration to automate)
- Find work-aroud and automate docker/kubelet restart infra-chaos for zfspv into pipeline (automation is stucked because of not having the root access to nodes in pipeline)
- [sathya]
- Automate the pool pod failure when the pool expansion is in progress
- Automate the pool pod container failure and verify the application behaviour
## May 11th, 2020
Attendees: Somesh, Shashank, Sathya, Giri, Aman, Kiran
### Status Updates
- [Somesh]
- [Shashank]
- Docs regarding gitlab migration.(Done)
- Basic infra and pipeline fix.(Done)
- Redirection to new openshift cluster for openshift pipeline.(Done)
- [Sathya]
- Litmus for introducing choas (pod kill) during disk replacement (Done)
- Automated the provision of cstor operator (As part of v1 api versio) (Done)
- Modified the openebs provision, cspc pool provision, day2ops (pool expansion, pool scaleup) as the change in cstor operator api version (Done)
- Litmus for introducing choas (container kill) during disk replacement (in-progress)
- [Giri]
- Exploring the way of using pumba for chaos injection in rancher environment.(in-progress)
- Litmus book for creating cstor csi snapshot class.(Done)
- Litmus book for clone creation based on cstor snapshot(in-progress)
- [Aman]
- Upgrade-pipeline for zfs-locapv (Done & Running)
- Added zfs-controller in HA (done)
- Able to get grafana dashboard up & running (in-progress)
- Custom-topology support (in-progress)
- raw block volume support (manually done, automation in-progress)
## May 4th, 2020
Attendees: Somesh, Shashank, Sathya, Giri, Aman, Kiran
### Status Updates
- [Somesh]
- Litmus for Jiva logging ( done )
- Litmus for clean-up job policy ( done )
- Litmus for pre-load changes (started)
- Setup Jiva application on workload cluster with custom jiva pool (not started)
- Presentation on how jiva volumes work (not started)
- [Shashank]
- GitLab migration to native K8s. Fixed issues (done)
- Rebuild OpenShift cluster (in-progress)
- Review the OpenShift cluster documentation ( in-progress )
- [Sathya]
- Litmus for introducing choas (pod kill, container kill) during disk replacement (in-progress)
- Automated the bulk upgrade of cstor/jiva (done)
- [Giri]
- Litmus experiment for performing node failure where cstor volumes are mounted. Applying the workaround to delete the node CR and check its availability. ( done )
- Litmus experiment for provisioning local PV on selected/tagged block devices. ( done )
- Workload cluster was recreated with ubuntu 18.04 and k8s v1.18.2
- [Aman]
- Automating upgrade for ZFS Local PV ( in-progress )
- Litmus for ZFS CSI Driver HA (in-progress)
### Discussion Topics
- [Infra Updates]
- On-prem
- GitLab Server
- ZFS Local PV Native K8s Pipeline
- OpenShift stable pipeline
- Konvoy stable pipeline
- Director
- GCP
- GitLab Runners using MicroVMs (Pipeline Executor)
- GKE
- Upgrade pipelines
- Packet
- Dev pipelines on 3 different K8s versions