feat: rate limiting based on upstream replicas #4447

s0uky · 2024-10-14T14:44:42Z

Description:
It would be incredibly beneficial if rate limits could be dynamically adjusted based on the number of replicas of the upstream service/application. This would be a powerful approach for applications that combine autoscaling through KEDA (https://keda.sh/) with rate limiting.

E.g:
Deployment of 3 replicas of the App
Rate limits for the App are: 6 req/sec (e.g.: per client)
I change (or KEDA changes) amount of replicas of the App from 3 to 6 and It would be great if rate limiting pod reflects this change and edit amount of rate limits to 12 req/sec (default rate limits x deployment replicas).

zhaohuabing · 2024-10-15T07:01:52Z

@s0uky The relationship between the number of application replicas and limit bucket seems more like an application-level logic. Would it be possible to crates a "controller" that modifies the limit in the BackendTrafficPolicy to reflect the the application scaling?

s0uky · 2024-10-15T12:59:12Z

@zhaohuabing I'm having difficulty determining the appropriate location for this controller. It seems likely that this controller could be a component of the Gateway itself. In that case, it would be responsible for defining the name of the deployment to monitor for replicas and modify the limit dynamically in the BackendTrafficPolicy .

arkodg · 2024-10-15T17:24:45Z

the use case sounds like Local RateLimit https://gateway.envoyproxy.io/docs/tasks/traffic/local-rate-limit/

zirain · 2024-10-16T00:17:16Z

I don't think so, local rate limit is base on the replicas of gateway, what @s0uky want is basing on upstream's replicas.

arkodg · 2024-10-16T00:18:23Z

ah thanks for the clarification

zhaohuabing · 2024-10-16T01:32:57Z

@zhaohuabing I'm having difficulty determining the appropriate location for this controller.

Hi @s0uky You can deploy your controller by the side of EG to handle this.

s0uky · 2024-10-18T11:49:20Z

@zhaohuabing We have to decide about creation of this controller. If yes, can we submit a pull request to the upstream Envoy Gateway project? Or is it not make sense to you?

zhaohuabing · 2024-10-21T04:50:17Z

@s0uky While I agree that dynamically adjusting the rate limit based on the number of replicas of the upstream service/application is a valuable feature, I'm not entirly sure if it should be directly implemented in Envoy Gateway. Beyond the replica number, there are also scenairos that rate limiting could be driven by other application-level metrics, such as

Resource scaling up (CPU, Memory, etc.)
Request latency
Failed request ratio
....

I would suggest creating a controller to run alongside EG to adjust the limit of BTP based on those application metrics. @envoyproxy/gateway-maintainers please chim in.

arkodg · 2024-10-21T17:34:09Z

agree with @zhaohuabing, this is hard to generize within EG since its a use case specific combination of dynamic rate limiting and load balancing settings based on upstream replicas

s0uky added the triage label Oct 14, 2024

s0uky mentioned this issue Oct 14, 2024

feat: rate limiting based on upstream replicas envoyproxy/ratelimit#737

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: rate limiting based on upstream replicas #4447

feat: rate limiting based on upstream replicas #4447

s0uky commented Oct 14, 2024

zhaohuabing commented Oct 15, 2024 •

edited

Loading

s0uky commented Oct 15, 2024

arkodg commented Oct 15, 2024

zirain commented Oct 16, 2024

arkodg commented Oct 16, 2024

zhaohuabing commented Oct 16, 2024 •

edited

Loading

s0uky commented Oct 18, 2024

zhaohuabing commented Oct 21, 2024

arkodg commented Oct 21, 2024

feat: rate limiting based on upstream replicas #4447

feat: rate limiting based on upstream replicas #4447

Comments

s0uky commented Oct 14, 2024

zhaohuabing commented Oct 15, 2024 • edited Loading

s0uky commented Oct 15, 2024

arkodg commented Oct 15, 2024

zirain commented Oct 16, 2024

arkodg commented Oct 16, 2024

zhaohuabing commented Oct 16, 2024 • edited Loading

s0uky commented Oct 18, 2024

zhaohuabing commented Oct 21, 2024

arkodg commented Oct 21, 2024

zhaohuabing commented Oct 15, 2024 •

edited

Loading

zhaohuabing commented Oct 16, 2024 •

edited

Loading