-
Notifications
You must be signed in to change notification settings - Fork 2.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Make splunkhec exporter http idle timeout lower/configurable #20543
Comments
Pinging code owners: See Adding Labels via Comments if you do not have permissions to add labels yourself. |
This is not a proper response, but some context here: Our default timeout is set to 10s: https://github.com/open-telemetry/opentelemetry-collector-contrib/blob/main/exporter/splunkhecexporter/factory.go#L69 |
It looks like the fix is to change the default idle timeout to 10s, (or at least make it configurable) - we can look into that. |
Ah! Good find! In fact all these settings likely play a role here: max_idle_conns Will read up on them. I can override these settings as part of the "advanced configs"? |
yes, here is how:
I have opened a PR to make default 10s instead of 30s. |
amazing, will try with a customer. |
I don't see these documented anywhere outside of this issue and the PRs. Was that intentional? |
Component(s)
exporter/splunkhec
Is your feature request related to a problem? Please describe.
The default http connection idle timeout is apparently hard coded at 30 seconds, which is higher than Splunk's, which is set by
busyKeepAliveIdleTimeout
and defaults to 12 seconds.This causes "EOF" errors/retries on the collector side.
This causes unnecessary retries and buffering on the OTel collector.
Describe the solution you'd like
I would like the http idle connection timer to be configurable, or set much lower, like we did in fluentd-hec, where the idle connection timer is 5 seconds, to explicitly avoid this situation.
https://github.com/splunk/fluent-plugin-splunk-hec#idle_timeout-integer
Describe alternatives you've considered
We have tried the workaround of raising the Splunk
busyKeepAliveIdleTimeout
to a value higher than OTel or the load balancer that may be between OTel agent and Splunk, and while it does help reduce the instances of EOF, it requires users to customize the default Splunk deployment and is not as easy as tuning the collector in many cases.Additional context
This has been tracked in Splunk's distro here
The text was updated successfully, but these errors were encountered: