-
Notifications
You must be signed in to change notification settings - Fork 498
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Initial Conformance Tests for Gateway API #969
Initial Conformance Tests for Gateway API #969
Conversation
Skipping CI for Draft Pull Request. |
c7f499c
to
3653c0e
Compare
f757859
to
98d4822
Compare
563e31c
to
529a279
Compare
529a279
to
92e5e07
Compare
At long last, I think these tests are ready for full review. They line up with the earlier conformance related GEPs and cover the following:
|
Adding a hold because this a big PR and want to have at least a couple LGTMs before merging: /hold |
return nil, nil, dumpErr | ||
} | ||
|
||
fmt.Printf("Received response:\n%s\n\n", formatDump(dump, "< ")) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit: I'd be a bit happier if we could use a different prefix for requests and responses. Easier on the eyes.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think that's already the case? One is "Sending Request: ...", and this is "Received Response: ...", any changes you'd recommend?
conformance/manifests/base.yaml
Outdated
apiVersion: gateway.networking.k8s.io/v1alpha2 | ||
kind: Gateway | ||
metadata: | ||
name: gateway-same-namespace |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Having more specific names with some semantic meaning like "shared-infra-gateway" would be better, especially as more and more tests use this.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I've added some comments at the top of this file describing the key resources, but unfortunately don't have great ideas for how to make the names of the Gateways more descriptive right now. At this point, the only real variation between them is where they accept routes to bind from:
- gateway-same-namespace (only supports route in same ns)
- gateway-all-namespaces (supports routes in all ns)
- gateway-backend-namespaces (supports routes in ns with backend label)
In the future when we add resources with different protocols, multiple listeners, etc, naming may become a bit more clear. Open to ideas though.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I know it's more verbose, but does it make sense to repeat the YAML across test cases (i.e. make each case hermetic) rather than trying to deduplicate? I know it's kind of a pain, but sharing the setup can result in coupling between test cases and the creation of a complex "base" case that makes it harder to understand the relationship later on.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah it's going to be tough to find the right balance here. For any route to work, it needs the following:
- Namespace
- Gateway
- Service
- Pods
Spinning all of those up and down for every test would quickly become quite time consuming. Alternatively if we were to leave them running indefinitely, we'd need each test case to be sure that it wasn't overlapping with resources defined by other test cases. This would also result in rather verbose yaml definitions for each test case. A final argument in favor of this shared set of base resources is that it enables conformance tests to be meaningful for implementations that are not currently able to support dynamically provisioning Gateways (this generally means there's a 1:1 relationship between Gateway and controller).
With all that said, more isolated tests would enable us to run tests in parallel which could significantly speed up a test run when there were a large number of tests in play. It would also be easier to understand each individual test case than our current approach that requires an understanding of the base resources.
I think we'll inevitably want to move towards a test suite that has more isolated test definitions, but fortunately that should be a fairly straightforward transition from this current state. I think that the current proposal is an acceptable starting point given the constraints above, but that we should try to avoid expanding the base set of resources, and in the future add test cases that are not entirely dependent on them.
92e5e07
to
e6f52dc
Compare
bfe48f8
to
acd9597
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Once other people's comments are resolved this looks like a great place to start for me. We'll be adding these to our test suite soon, the only thing I was really concerned about going into this was making sure there wasn't any friction for testing "unmanaged mode" (existing gateways) but I don't think what's here will cause any problem with that.
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: robscott, shaneutt The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
847fbc5
to
bf64c67
Compare
// Setup ensures the base resources required for conformance tests are installed | ||
// in the cluster. It also ensures that all relevant resources are ready. | ||
func (suite *ConformanceTestSuite) Setup(t *testing.T) { | ||
suite.ControllerName = kubernetes.GWCMustBeAccepted(t, suite.Client, suite.GatewayClassName, 180) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This can fail and doesn't give any feedback on what it is checking for. Suggest that every function that takes a testing.T should internally do a t.Run to give contextual feedback.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I tried t.Run
first, but that didn't feel right because these items are not really tests as much as they are setup steps. (The GatewayClass status == accepted is relatively close to a test, but the rest are not). Instead I've added logging before each step, which does seem to make the output much clearer here.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I didn't look at all the code but I think we can iterate here.
I'm comfortable merging.
@@ -0,0 +1,154 @@ | |||
/* | |||
Copyright 2022 The Kubernetes Authors. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'd recommend starting with a single conformance/utils/http
package which includes this file.
Any reason to have so manage packages upfront?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Since this is an interface that implementations can override, I've wanted to keep it as generic and possible and leave the door open for at least TCP. That's a ways out though, and I'm not really sure how we'd structure that. The idea was to separate the interface + default implementation from HTTP specific helper funcs.
if !apierrors.IsNotFound(err) { | ||
require.NoErrorf(t, err, "error getting resource") | ||
} | ||
t.Logf("Creating %s %s", uObj.GetName(), uObj.GetKind()) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Add a similar log line before c.Get() on line 62.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm a little hesitant to do that because our test output is already fairly verbose here. Here's the output I'm currently getting:
suite.go:56: Test Setup: Ensuring GatewayClass has been accepted
suite.go:59: Test Setup: Applying base manifests
apply.go:81: Updating gateway-conformance-infra Namespace
apply.go:81: Updating same-namespace Gateway
apply.go:81: Updating all-namespaces Gateway
apply.go:81: Updating backend-namespaces Gateway
apply.go:81: Updating infra-backend-v1 Service
apply.go:81: Updating infra-backend-v1 Deployment
apply.go:81: Updating infra-backend-v2 Service
apply.go:81: Updating infra-backend-v2 Deployment
apply.go:81: Updating gateway-conformance-app-backend Namespace
apply.go:81: Updating app-backend-v1 Service
apply.go:81: Updating app-backend-v1 Deployment
apply.go:81: Updating app-backend-v2 Service
apply.go:81: Updating app-backend-v2 Deployment
apply.go:81: Updating gateway-conformance-web-backend Namespace
apply.go:81: Updating web-backend Service
apply.go:81: Updating web-backend Deployment
suite.go:62: Test Setup: Ensuring Gateways and Pods from base manifests are ready
Sometimes there's a mix of Updating
and Creating
depending on if I've enabled cleanup or added new resources to the base manifests. In either case, adding a Getting
before each Updating
or Creating
line may add more confusion or noise than it's worth. I think it could be helpful to log before the first API call we're doing in case of a timeout or other connection issue, but that's actually happening earlier in the test flow. Do you think there's a better way to log this output as a whole?
bf64c67
to
4efae19
Compare
Thanks for all the great feedback on this @hbagdi, @howardjohn, @jpeach, and anyone else who's commented. I think I've resolved everything and this should be good to go now, PTAL. |
/lgtm |
4efae19
to
38e5f1d
Compare
@robscott Linter unhappy. |
38e5f1d
to
1c6c178
Compare
Woops, fixed the failing test, thanks for catching that @hbagdi! Removing the hold so next LGTM will result in a merge. /hold cancel |
/lgtm |
What type of PR is this?
/kind feature
What this PR does / why we need it:
This PR adds a very basic initial framework for conformance tests. Although there is still plenty of room for both improvement and expansion, I want to keep this initial starting point relatively small and straightforward. This PR is ready for full review, I'm especially interested in feedback on the general structure and concepts in this PR though.
Does this PR introduce a user-facing change?: