api: named filter config / filter config discovery service #7867

mattklein123 · 2019-08-08T15:18:47Z

I would like the ability to have individual filter configurations be named and fetchable from a discrete management server. This would allow federation of filter configuration from multiple sources, for example fault testing. Thought needs to be put into how this plays with FCDS for listener filters, draining, whether filters can swap config atomically via a new optional interface, etc.

If we do this right we should not need the TapDS IMO.

cc @envoyproxy/api-shepherds @htuch @yuval-k

lambdai · 2019-08-08T17:24:44Z

individual configurations is the filter chain, correct?

htuch · 2019-09-23T19:18:04Z

@mattklein123 do you think this is needed for 1.12.0 or can we do this as a v3 add-on once shipped?

mattklein123 · 2019-09-23T19:58:01Z

@mattklein123 do you think this is needed for 1.12.0 or can we do this as a v3 add-on once shipped?

We can do it as an add on, but can we do it without any structural changes? I suppose we probably can by just having an optional config source on a filter config? I'm mostly just hoping to avoid deprecation. We might want to look at it briefly and at least put the existing static config in a oneof?

htuch · 2019-09-24T21:26:46Z

Sure, I think this aspect of the issue is manageable in the v3 release cycle.

rshriram · 2019-09-26T02:39:11Z

adding another use case that I came across

Certain filters like RBAC etc. have the access control rules [something that changes dynamically] embedded inside the filter as static configuration. When access control rules change, we end up sending an update to the listeners with the updated filter config. This causes all existing connections to drain. A more efficient thing to do would be to reload the rbac filter config alone, and force the rbac filter instances to run some event handler [e.g., onConfigUpdate], wherein they can reevaluate existing connections to check if they still pass the rbac rules.

mattklein123 · 2019-09-26T16:02:01Z

@rshriram do you really need an event? I think this is possible but can't RBAC just run on new connections? If you want an event can we track as a separate issue?

rshriram · 2019-09-26T16:07:01Z

So, think about HTTP connections.. in this case, new rbac would have to become effective on new requests on the same connection right? Otherwise, people will end up forcing listener drains for the newer rules to take effect. I suggested the event thing as just one idea but as long as there is an ability to apply the newer config on existing connections, I am good.

This would apply to faults as well wont it? like you could pick one h2 stream, introduce some faults and then remove those faults, to see how the client withstands these intermittent failures. Wont this be helpful in mobile use cases?

mattklein123 · 2019-09-26T16:12:19Z

This would apply to faults as well wont it? like you could pick one h2 stream, introduce some faults and then remove those faults, to see how the client withstands these intermittent failures. Wont this be helpful in mobile use cases?

We explicitly want this for fault testing. :)

Basically, at the network level, any new connection will get the new config. At the HTTP level, any new stream will get the new configuration. I think this is sufficient?

lambdai · 2019-10-02T20:55:13Z

It's fairly easy for the new connection to get the new config.

The goal of new http stream get new configuration can be achieve by above if we drain listener/filter. But it's not smooth to achieve it without using new connection. I will start to think about it when the first step is done.

mattklein123 · 2019-10-03T14:24:49Z

The goal of new http stream get new configuration can be achieve by above if we drain listener/filter. But it's not smooth to achieve it without using new connection. I will start to think about it when the first step is done.

I think this is actually not that hard and we can do this without any draining. Essentially we just need to have a layer of indirection between the filter chain and the actual filter config, such that the filter config itself can be snapped by the connection when it is created. I'm very happy to work with whoever wants to implement this on the design. Note also that we need this at Lyft so if no one gets to it first @wgallagher is likely to implement.

lambdai · 2019-10-07T21:49:39Z

Essentially we just need to have a layer of indirection between the filter chain and the actual filter config
Are you saying the filter chain from LDS doesn't mutate? Just do some extra calculation when applying to newly created http stream? This seems easier. We already support similar calculation when applying network filter at ActiveTcpListener::newConnection

What is the missing part?
Should the similar delayed calculation apply to http_conn_manager?
Support async applier by giving it a provider and provider is supposed to fetch the latest change?

mattklein123 · 2019-10-08T02:57:18Z

Are you saying the filter chain from LDS doesn't mutate? Just do some extra calculation when applying to newly created http stream?

Correct that's what I'm thinking.

What is the missing part? Should the similar delayed calculation apply to http_conn_manager? Support async applier by giving it a provider and provider is supposed to fetch the latest change?

I haven't thought it through fully (we will need to iterate on a design) but I think we move the the filter configuration into a oneof that either is defined inline statically or comes from a config source (cc @htuch because we want to move the static filter config into a oneof for v3 to future proof it). Once we have a config source, I think we can have the owner of the filter chain starting fetching for dynamic updates with warming, etc. and then when a new connection (for L4) or a new stream (for L7) is created, we can snap the correct config for the filter.

htuch · 2019-10-08T17:55:28Z

Yep, moving to a ConfigSource is aligned with what we want in UDPA. Let's promote to a oneof for v3.

yxue · 2019-11-02T01:04:48Z

@PiotrSikora @jplevyak I am wondering if we can use the filter config discovery service for the WASM. WASM can fetch the code from some management server instead of using inline code.

kyessenov · 2019-11-15T23:31:32Z

Envoy position is that config is trusted and user supplied. Shipping code through config server may compromise the security of the data plane. There is also a high-risk of head-of-line blocking for large (>10MB Wasm modules) in ADS.

htuch · 2019-11-17T22:55:54Z

@kyessenov I think we don't want to be too prescriptive here; folks can use some of these features with caveats around not pushing large updates inline. It's no different shipping 10MB+ WASM or 10k+ filter chains, each with a 1KB certificate inline.

kyessenov · 2019-11-26T23:50:30Z

If we do end-up using this for Wasm, it would be great to be able to add filters, not just update existing filter configs. Maybe using some sort of stubs if Wasm filter is not yet needed (e.g. empty filter config should be allowed).

kyessenov · 2019-12-04T21:44:05Z

Just copying thoughts from the linked issue:

Can we please make filter config payload be a list of filters? It's more flexible, doesn't restrict to a single filter type, and allows empty lists.
Consider changing how filters are ordered. Maybe it's better to have a weighted map of filters that are sorted before application? That way we can insert the filter config at any place.

htuch · 2019-12-05T00:20:42Z

Why a list when the set of resources return by xDS is already a list? I worry that a list of lists is a bit confusing to reconcile.

kyessenov · 2019-12-05T00:40:20Z

List is used to 1) enforce ordering of filters in a "patch", 2) allow flexible number of filters to be returned. Think of Wasm filters. We can either add 10 dynamic filter configs with some unset, or have one filter config group with 0..10 Wasm filters in it.

htuch · 2019-12-05T16:31:52Z

I think this is a conflation of concerns. We have one concern that is managing which filters are in the filter chain and their relative order, and another that is how individual filters are configured. Ideally we can provide config independently for each of these, and potentially federate the responsibilities to distinct parties.

I would like to see a filter config discover service that just supplies config for a named filter. The existing Filter Chain Discovery Service is providing the list of filters and relative order. If that needs to be further decomposed, I think a distinct design proposal is needed.

mandarjog · 2019-12-20T01:57:37Z

+1 for fail-open and fail-closed, we have been discussing this in the WASM context. For example if a telemetry WASM filter fails to load you would still want to declare success and move on (and increment a metric for bad configs).

Using a URL fetcher to load WASM code has the same issue. The listener will go in "warming" state, with no way to notify the control plane.
So the ask here specifically is to be able send back a NACK after a listener enters warming state, or after config is accepted from FCDS. Is this possible ?

mattklein123 · 2019-12-20T16:09:48Z

So the ask here specifically is to be able send back a NACK after a listener enters warming state, or after config is accepted from FCDS. Is this possible ?

This is not possible today and will require a bunch of thinking to figure out the right way to do it, but it's certainly possible and something we should do. In the near term, the hacky way of dealing with this is to just delete the listener with a stat, and then have it re-added in the next go around. Though, this won't work with incremental.

mandarjog · 2019-12-21T05:31:07Z

Ok, since ETA on async NACK is unclear I think it is better to use “local file” load when using WASM so that bad wasm module or “module not found” can be rejected synchronously with a NACK. Wdyt @mattklein123 ?
I think this is generally true for any other interaction of this type.

mattklein123 · 2019-12-23T16:26:53Z

Wdyt @mattklein123 ? I think this is generally true for any other interaction of this type.

I think it really depends on the deployment. IMO fail open with a default config will work in many cases, especially since with eventually consistency the config should eventually get loaded later when things are fixed.

mattklein123 · 2019-12-26T23:29:20Z

I'm moving this out to 1.14.0, but we still need to move the static filter config into a oneof for v3. cc @lizan @htuch to track the automation around this.

htuch · 2019-12-27T21:10:00Z

@mattklein123 looking at filter config, today, we have:

  // Filter specific configuration which depends on the filter being
  // instantiated. See the supported filters for further documentation.
  oneof config_type {
    google.protobuf.Struct config = 2 [deprecated = true];

    google.protobuf.Any typed_config = 4;
  }

Do we need a new oneof, or could the dynamic ConfigSource just live under config_type?

mattklein123 · 2019-12-29T18:17:01Z

Do we need a new oneof, or could the dynamic ConfigSource just live under config_type?

Oops yeah sorry I forget we already have a oneof here from the config -> typed_config conversion. Yeah this oneof should be fine. Sorry for the runaround.

Define filter config discovery. Add FDS for HTTP filters (HTTP extensions is where the pain is felt the most). Modelled after RDS with a twist of config override for re-use. Risk Level: low (not implemented) Testing: Docs Changes: Release Notes: Issue: #7867 Signed-off-by: Kuat Yessenov <kuat@google.com>

Define filter config discovery. Add FDS for HTTP filters (HTTP extensions is where the pain is felt the most). Modelled after RDS with a twist of config override for re-use. Risk Level: low (not implemented) Testing: Docs Changes: Release Notes: Issue: envoyproxy#7867 Signed-off-by: Kuat Yessenov <kuat@google.com>

Define filter config discovery. Add FDS for HTTP filters (HTTP extensions is where the pain is felt the most). Modelled after RDS with a twist of config override for re-use. Risk Level: low (not implemented) Testing: Docs Changes: Release Notes: Issue: envoyproxy#7867 Signed-off-by: Kuat Yessenov <kuat@google.com> Signed-off-by: yashwant121 <yadavyashwant36@gmail.com>

mattklein123 added api/v3 Major version release @ end of Q3 2019 no stalebot Disables stalebot from closing an issue labels Aug 8, 2019

rshriram mentioned this issue Aug 8, 2019

named filter chains #7870

Closed

lambdai mentioned this issue Aug 20, 2019

api: add name into filter chain #7966

Merged

mattklein123 added this to the 1.12.0 milestone Sep 4, 2019

mattklein123 mentioned this issue Sep 25, 2019

Dynamic reload of filter config without listener drain #8364

Closed

htuch modified the milestones: 1.12.0, 1.13.0 Oct 16, 2019

mattklein123 mentioned this issue Dec 4, 2019

xDS resource patching #8400

Open

mandarjog mentioned this issue Dec 21, 2019

API request: xds async NACK #9447

Open

mattklein123 modified the milestones: 1.13.0, 1.14.0 Dec 26, 2019

htuch mentioned this issue Dec 26, 2019

oneof upgrade for filter config #9500

Closed

mattklein123 modified the milestones: 1.14.0, 1.15.0 Mar 18, 2020

mattklein123 mentioned this issue Apr 15, 2020

Allow reading filter config from disk #5237

Closed

mattklein123 mentioned this issue Apr 23, 2020

config: update protos for TTL #10898

Closed

This was referenced Jun 4, 2020

Apply Lua filter will close existed http/2 long live connection #11436

Closed

Proposal: add config blob discovery #11547

Open

kyessenov mentioned this issue Jun 12, 2020

api: add filter config discovery #11571

Merged

mattklein123 modified the milestones: 1.15.0, 1.16.0 Jun 24, 2020

kyessenov mentioned this issue Jun 30, 2020

xds: implement extension config discovery for HCM #11826

Merged

3 tasks

yangminzhu mentioned this issue Jul 13, 2020

rbac: utilize ECDS to optimize listener draining istio/istio#25484

Closed

mattklein123 closed this as completed in #11826 Jul 23, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

api: named filter config / filter config discovery service #7867

api: named filter config / filter config discovery service #7867

mattklein123 commented Aug 8, 2019

lambdai commented Aug 8, 2019

htuch commented Sep 23, 2019

mattklein123 commented Sep 23, 2019

htuch commented Sep 24, 2019

rshriram commented Sep 26, 2019

mattklein123 commented Sep 26, 2019

rshriram commented Sep 26, 2019

mattklein123 commented Sep 26, 2019

lambdai commented Oct 2, 2019

mattklein123 commented Oct 3, 2019

lambdai commented Oct 7, 2019

mattklein123 commented Oct 8, 2019

htuch commented Oct 8, 2019

yxue commented Nov 2, 2019

kyessenov commented Nov 15, 2019

htuch commented Nov 17, 2019

kyessenov commented Nov 26, 2019

kyessenov commented Dec 4, 2019

htuch commented Dec 5, 2019

kyessenov commented Dec 5, 2019

htuch commented Dec 5, 2019

mandarjog commented Dec 20, 2019

mattklein123 commented Dec 20, 2019

mandarjog commented Dec 21, 2019

mattklein123 commented Dec 23, 2019

mattklein123 commented Dec 26, 2019

htuch commented Dec 27, 2019

mattklein123 commented Dec 29, 2019

api: named filter config / filter config discovery service #7867

api: named filter config / filter config discovery service #7867

Comments

mattklein123 commented Aug 8, 2019

lambdai commented Aug 8, 2019

htuch commented Sep 23, 2019

mattklein123 commented Sep 23, 2019

htuch commented Sep 24, 2019

rshriram commented Sep 26, 2019

mattklein123 commented Sep 26, 2019

rshriram commented Sep 26, 2019

mattklein123 commented Sep 26, 2019

lambdai commented Oct 2, 2019

mattklein123 commented Oct 3, 2019

lambdai commented Oct 7, 2019

mattklein123 commented Oct 8, 2019

htuch commented Oct 8, 2019

yxue commented Nov 2, 2019

kyessenov commented Nov 15, 2019

htuch commented Nov 17, 2019

kyessenov commented Nov 26, 2019

kyessenov commented Dec 4, 2019

htuch commented Dec 5, 2019

kyessenov commented Dec 5, 2019

htuch commented Dec 5, 2019

mandarjog commented Dec 20, 2019

mattklein123 commented Dec 20, 2019

mandarjog commented Dec 21, 2019

mattklein123 commented Dec 23, 2019

mattklein123 commented Dec 26, 2019

htuch commented Dec 27, 2019

mattklein123 commented Dec 29, 2019