enhancement(vector source): implement client connection limits for grpc server #21072

fpytloun · 2024-08-14T12:20:41Z

Related: #19457
Related: #10728

Adds two new configuration options for vector source to mitigate linked issues:

max_duration - maximum time for client connection
max_requests - maximum number of requests per client connection

None, both or any of these two parameters can be set.

…pc server Related: vectordotdev#19457 Related: vectordotdev#10728

jszwedko

Thanks for opening this to start a conversation @fpytloun ! It's an impressive attempt given you mentioned you have little Rust experience.

Unfortunately, I'm not sure if this is how we'd want to go about solving this because it applies the limits globally rather than per client.

I'm also not sure that the client is guaranteed to close the connection if it sees a resource exhausted error.

I think what we really want is for hyperium/tonic#1428 to be implemented (there is a PR as of last week: hyperium/tonic#1865). If that were added to tonic we could expose it in Vector.

Granted that only does connection age and not number of requests, but I'm imagining that might suffice for forcing clients to rebalance. What do you think? Should we just wait for that to be added to tonic?

jszwedko · 2024-08-16T21:12:55Z

src/sources/util/grpc/connectionlimit.rs

+#[derive(Clone)]
+pub struct ConnectionLimit<S> {
+    inner: S,
+    request_count: Arc<Mutex<usize>>,


I think an atomic could be used here instead: https://doc.rust-lang.org/std/sync/atomic/struct.AtomicUsize.html

jszwedko · 2024-08-16T21:14:12Z

src/sources/util/grpc/connectionlimit.rs

+            request_count: Arc::new(Mutex::new(0)),
+            max_requests: max_requests,
+            max_duration: max_duration,
+            start_time: Instant::now(),


Nit: I think it might be better to initialize the start_time in the call method since this is closer to when it would be "running".

jszwedko · 2024-08-16T21:16:15Z

src/sources/util/grpc/connectionlimit.rs

+    }
+}
+
+impl<S> Service<Request<Body>> for ConnectionLimit<S>


I think this would end up applying one duration and request limit across all clients. In an ideal world, I think we would apply the limits per-client (that is per socket).

jszwedko · 2024-08-16T21:18:01Z

src/sources/util/grpc/mod.rs

+    // Conditionally apply the ConnectionLimitLayer if any limits are set
+    let service = ConnectionLimitLayer::new(max_requests, max_duration).layer(service);


I think this could be added in the list before as another layer after (or before) the DecompressionAndMetricsLayer.

jszwedko · 2024-08-16T21:18:42Z

src/sources/util/grpc/mod.rs

@@ -22,14 +23,19 @@ use tracing::Span;
 mod decompression;
 pub use self::decompression::{DecompressionAndMetrics, DecompressionAndMetricsLayer};

+mod connectionlimit;


Nit: module names usually use _ for delimiting multiple words so this would be connection_limit.

jszwedko · 2024-08-16T21:20:09Z

src/sources/util/grpc/connectionlimit.rs

+                return Err(Status::resource_exhausted(
+                    "Connection closed after reaching the limit.",
+                ));


Will the server close the connection if this error is returned? Or are we depending on the client closing it when it sees this error?

fpytloun · 2024-08-20T14:00:16Z

Thank you @jszwedko for reviewing this. So it seems this is a blind route and it might be better to close this and instead try to implement keepalive limit on client-side (vector sink) as that might be easier and more versatile also with possibility to use load-balancing between multiple upstream hosts 🤔

fpytloun requested a review from a team as a code owner August 14, 2024 12:20

github-actions bot added the domain: sources Anything related to the Vector's sources label Aug 14, 2024

fpytloun changed the title ~~enhancement(vector source): implement client connection limits for gr…~~ enhancement(vector source): implement client connection limits for grpc server Aug 14, 2024

fpytloun force-pushed the sources-vector-connectionlimit branch from b053c50 to 14066df Compare August 14, 2024 12:55

enhancement(vector source): implement client connection limits for gr…

bb3f0d7

…pc server Related: vectordotdev#19457 Related: vectordotdev#10728

fpytloun force-pushed the sources-vector-connectionlimit branch from 14066df to bb3f0d7 Compare August 14, 2024 13:25

jszwedko reviewed Aug 16, 2024

View reviewed changes

fpytloun closed this Aug 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

enhancement(vector source): implement client connection limits for grpc server #21072

enhancement(vector source): implement client connection limits for grpc server #21072

fpytloun commented Aug 14, 2024 •

edited

Loading

jszwedko left a comment

jszwedko Aug 16, 2024

jszwedko Aug 16, 2024

jszwedko Aug 16, 2024

jszwedko Aug 16, 2024

jszwedko Aug 16, 2024

jszwedko Aug 16, 2024

fpytloun commented Aug 20, 2024

		// Conditionally apply the ConnectionLimitLayer if any limits are set
		let service = ConnectionLimitLayer::new(max_requests, max_duration).layer(service);

enhancement(vector source): implement client connection limits for grpc server #21072

enhancement(vector source): implement client connection limits for grpc server #21072

Conversation

fpytloun commented Aug 14, 2024 • edited Loading

jszwedko left a comment

Choose a reason for hiding this comment

jszwedko Aug 16, 2024

Choose a reason for hiding this comment

jszwedko Aug 16, 2024

Choose a reason for hiding this comment

jszwedko Aug 16, 2024

Choose a reason for hiding this comment

jszwedko Aug 16, 2024

Choose a reason for hiding this comment

jszwedko Aug 16, 2024

Choose a reason for hiding this comment

jszwedko Aug 16, 2024

Choose a reason for hiding this comment

fpytloun commented Aug 20, 2024

fpytloun commented Aug 14, 2024 •

edited

Loading