Add forceFlush to API #1287

anuraaga · 2020-12-13T01:36:42Z

Currently forceFlush is only defined for the SDK. However, it's needed mostly in instrumentation, it's not a configuration or initialization concern like other SDK methods. In particular, it's generally needed when instrumenting a serverless runtime that automatically freezes, such as AWS lambda. But by being in the SDK, it breaks our convention that instrumentation only depends on the API. Indeed the Java instrumentation currently depends on the SDK, and if the user's app doesn't include it, will crash.

https://github.com/open-telemetry/opentelemetry-java-instrumentation/blob/master/instrumentation/aws-lambda-1.0/library/aws-lambda-1.0-library.gradle#L19

Since forceFlush is needed in logic, not initialization / configuration, it seems to also make sense in the API conceptually. We don't want logic writers to be touching the SDK as much as possible, e.g., randomly shutting it down during a request.

The text was updated successfully, but these errors were encountered:

carlosalberto · 2020-12-14T15:18:18Z

cc @tedsuo

jkwatson · 2020-12-14T16:01:34Z

I'm super hesitant to add this sort of call to the API. I understand the use-case, but this will encourage people who don't need it to call it as well (much like people invoke the Java garbage collector explicitly in their code when, at best, it doesn't actually do anything).

Is there a way that the instrumentation can guard against the SDK not being loaded at runtime, rather than crashing the application?

anuraaga · 2020-12-15T00:58:18Z

@jkwatson It's possible, but it seems complex and also ties the instrumentation to the SDK against our normal designs. I also don't know if such a trick would be possible in all languages. We would need to add a note to our wording that "instrumentation only depends on API" with a caveat that it can depend on it in a compile-only type of way to provide forceFlush. Instead, we could have a heavily documented forceFlush in the API, and if even then a user called it, presumably they're also calling System.gc() and have other things to worry about? A rename to forceFlushBeforeRuntimeFreeze to clarify the use case is obtuse but also an option :) But I'm ok with just adding the caveat and not adding to the API if that seems better to folks.

Oberon00 · 2020-12-15T12:27:13Z

FaaS are peculiar environments and you will often need hacks and special configurations to support them. So it seems OK to have that particular instrumentation depend on the SDK. Alternatively, such instrumentations could have a configurable hook that is called whenever a possible suspension point is reached (i.e. after a request is finished).

Note that the API vs SDK question was briefly touched in #351 (comment) but everyone seemed to agree that it should be in the SDK so there was not much discussion on that particular point.

andrewhsu · 2020-12-15T16:33:58Z

talked about this at the spec sig mtg this morning, would like to discuss with @anuraaga at the spec sig mtg later today

tedsuo · 2020-12-16T01:08:57Z

Discussed further with @anuraaga on the spec call.

API should not get a Flush method, as its presence encourages the wrong behavior in instrumentation authors, who will incorrectly believe they need to manage flushing themselves. We saw this issue in prior tracing implementations and do not want to revisit that scenario with OpenTelemetry.

In this special case, Lambda performs a freeze, but does not provide the user with an onFreeze hook. So ,to a certain extent lambda needs to provide an SDK manager of some kind, or a flush callback, etc. Rather than having flush baked into the instrumentation itself, this should be separate. There's no issue with framework code that manages the lifecycle of the SDK (Spring Sleuth is an example), but this should not be mixed up with instrumentation in a way that creates a scenario where users cannot escape from flush or shutdown being called if they don't want it to happen.

carlosalberto · 2020-12-16T20:13:31Z

@tedsuo Just to confirm: can we close this issue?

anuraaga · 2020-12-17T00:11:25Z

Yeah I think we can close and maybe revisit if there happen to be more use cases in the future.

…cification#1287

anuraaga added the spec:trace Related to the specification/trace directory label Dec 13, 2020

andrewhsu added the needs discussion Need more information before all suitable labels can be applied label Dec 15, 2020

anuraaga closed this as completed Dec 17, 2020

anuraaga mentioned this issue Feb 18, 2021

Inject OpenTelemetrySdk into lambda library instrumentation instead o… open-telemetry/opentelemetry-java-instrumentation#2328

Merged

pcolazurdo added a commit to pcolazurdo/opentelemetry-lambda that referenced this issue Jun 10, 2022

Added delay after force_flush due to open-telemetry/opentelemetry-spe…

094b2be

…cification#1287

legendecas mentioned this issue Nov 9, 2022

feat(API): add ForceFlush and Shutdown to API open-telemetry/opentelemetry-js#3323

Closed

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add forceFlush to API #1287

Add forceFlush to API #1287

anuraaga commented Dec 13, 2020 •

edited

Loading

carlosalberto commented Dec 14, 2020

jkwatson commented Dec 14, 2020

anuraaga commented Dec 15, 2020

Oberon00 commented Dec 15, 2020

andrewhsu commented Dec 15, 2020

tedsuo commented Dec 16, 2020

carlosalberto commented Dec 16, 2020

anuraaga commented Dec 17, 2020

Add forceFlush to API #1287

Add forceFlush to API #1287

Comments

anuraaga commented Dec 13, 2020 • edited Loading

carlosalberto commented Dec 14, 2020

jkwatson commented Dec 14, 2020

anuraaga commented Dec 15, 2020

Oberon00 commented Dec 15, 2020

andrewhsu commented Dec 15, 2020

tedsuo commented Dec 16, 2020

carlosalberto commented Dec 16, 2020

anuraaga commented Dec 17, 2020

anuraaga commented Dec 13, 2020 •

edited

Loading