You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
As a stateless service we expect the operator to be resilient to downtime, but we don't currently test for it. We should at least have a sanity test around correctness of what operator is doing when being randomly restarted. Somewhat related to general chaos testing.
The text was updated successfully, but these errors were encountered:
We should create a separate test pipeline where we would randomly delete the operator pod while the entire E2E test suite is running. This way we don't introduce noise into existing pipelines and we have a good coverage of different test scenarios. We should log and preserve timestamps when the operator pod is being removed for potential issues investigation.
As a stateless service we expect the operator to be resilient to downtime, but we don't currently test for it. We should at least have a sanity test around correctness of what operator is doing when being randomly restarted. Somewhat related to general chaos testing.
The text was updated successfully, but these errors were encountered: