Initial POC version of url-based sampler #66

iNikem · 2021-08-05T12:55:10Z

Closes #65

Asking for the following feedback.

First, which semantic attribute should we match against? http.url or http.route. The latter is not always available, the first has some problems, see below.

Second, should we match the whole url in http.url or only path portion? If I configure SDK manually inside my application's code, then I don't know the hostname that my application will respond to. Thus it seems that we cannot match full url, only path portion. But this means that on every call to shouldSample we have to parse http.url attribute and extract path. This may be very expensive.

Third, should narrow this sampler only to SERVER spans? The original request described in the issue seems to assume this.

Fourth, if want to provide something more than just exact match, which pattern language should we use? Always assume startsWith? ant-style path matching? Regex?

url-sampler/src/test/java/io/opentelemetry/contrib/samplers/UrlSamplerTest.java

jkwatson · 2021-08-05T19:54:45Z

First, which semantic attribute should we match against? http.url or http.route. The latter is not always available, the first has some problems, see below.

I think we have to use http.url, at least as a fallback. It might be faster to check the route first, then fall back to the url. if something has already parsed out that route, we can skip re-parsing it.

Second, should we match the whole url in http.url or only path portion? If I configure SDK manually inside my application's code, then I don't know the hostname that my application will respond to. Thus it seems that we cannot match full url, only path portion. But this means that on every call to shouldSample we have to parse http.url attribute and extract path. This may be very expensive.

Has to be just the path portion, because the host/port will be all over the place. It might be worth benchmark parsing of the URL vs. just using regex on the full url, to see which is faster/fewer allocations.

Third, should narrow this sampler only to SERVER spans? The original request described in the issue seems to assume this.

I think it should, yes. It will be a very fast enum comparison to check against it. Digging into attributes will be slower.

Fourth, if want to provide something more than just exact match, which pattern language should we use? Always assume startsWith? ant-style path matching? Regex?

My gut tells me regex, but I'd like to know how fast we can make the ant-style path patching, compared to precompiled regexes.

anuraaga · 2021-08-06T06:01:25Z

First, which semantic attribute should we match against? http.url or http.route. The latter is not always available, the first has some problems, see below.

I feel that we can avoid route entirely given we're configuring regexes in the rules anyways. We already have enough issues with the URL vs target, etc issue to add the sometimes but not always available route to the mix :P

Second, should we match the whole url in http.url or only path portion? If I configure SDK manually inside my application's code, then I don't know the hostname that my application will respond to. Thus it seems that we cannot match full url, only path portion. But this means that on every call to shouldSample we have to parse http.url attribute and extract path. This may be very expensive.

Parsing out an entire URL is definitely expensive, I think just finding the target can be made very cheap, just need to count slashes basically.

As for URL vs path, URL is more useful I guess since someone may need to match against host (sample all requests to some external API in a particular way would need the host I guess) - in practice it's rare though so only path could be OK (it solves the highly demanded health check sampling). I guess we could either

Support full URL. Reconstruct URL from triplet if triplet is all available and url isn't. If triplet is only partially filled, ignore
Support path. Check for target, if not available check for url.
is simpler logic but not as flexible as allowing a URL to match against URL.

We have similar discussion in the instrumentation repo and it really seems like supporting both URL and triplet causes a lot of complexity. Perhaps this sampler is also a motivation to revisit the spec and consider dropping one of these - it's basically spaghetti within the spec, and we may have anticipated it only extending to spaghetti in backends, but I'm starting to see it affect instrumentation and SDK plugins like this one too, potential performance wins from having the choice of a triplet seems to not be worth it potentially.

My gut tells me regex, but I'd like to know how fast we can make the ant-style path patching, compared to precompiled regexes.

I've seen ant-style path matching in libraries like Spring implemented by translating to regex - so that's what I did too since X-Ray uses glob patterns

https://github.com/open-telemetry/opentelemetry-java-contrib/blob/main/aws-xray/src/main/java/io/opentelemetry/contrib/awsxray/SamplingRuleApplier.java#L344

From what I understand a version of Java improved regex implementation such that the regex equivalent of a glob pattern performs about the same as dynamic programming instead of potentially going exponential. The advantage of the glob pattern is probably more that it prevents being able to define patterns with random regex syntax like captures / backreferences which can hurt.

Third, should narrow this sampler only to SERVER spans? The original request described in the issue seems to assume this.

While having a configuration to limit to kinds would be fine, I don't think the default to limit to any kind. There's enough use case to filter on client (external API requests, client-side-load-balancing clients which issue health checks).

iNikem · 2021-08-06T11:38:59Z

While having a configuration to limit to kinds would be fine, I don't think the default to limit to any kind. There's enough use case to filter on client (external API requests, client-side-load-balancing clients which issue health checks).

Even in the case when any given deployment wants to have both server- and client-side calls sampled I would expect them to have different patterns. Thus every given instance of url-based sampler will probably work with only one kind.

anuraaga · 2021-08-06T12:11:44Z

Thus every given instance of url-based sampler will probably work with only one kind.

I don't know how common it is, but there is a pattern of using client side health checks with load balancing. It means that filtering on the standard path like /healthcheck will work great across all servers and clients. I think semantics-wise applying a path filter on both client and server is correct - we see server side filtering of health check being common because k8s isn't traced with Otel but if it was, it would also filter the same. All kinds are "supposed to" behave the same for this special path. So I'd generally expect our default to go broad and if users want it as a performance optimization they could specify a kind to filter on.

iNikem · 2021-08-10T07:59:17Z

@anuraaga I am still thinking about filtering on span kind. But your examples for client side sampling is still about path, not host+path. So to be sure: do you want to match full url or just path part of it?

anuraaga · 2021-08-10T08:19:55Z

@iNikem I think path is much more common - but I still think URL is more useful in general since I can think of plenty of URLs where the host part does have meaning since the host often indicates a type of API.

Though realized, why do we make it a url-based sampler anyways - how about AttributeSampler which accepts a map from attribute key to patterns? Even for the path, or url, use case, it doesn't seem much harder to use than one specialized to URLs. The X-Ray sampler supports this if it's a useful reference point (it also natively supports some of the attributes only because X-Ray data model has native fields for them and does not populate them to attributes - but I don't think ours needs that).

https://github.com/open-telemetry/opentelemetry-java-contrib/blob/main/aws-xray/src/main/java/io/opentelemetry/contrib/awsxray/SamplingRuleApplier.java#L97

iNikem · 2021-08-10T08:38:24Z

Though realized, why do we make it a url-based sampler anyways - how about AttributeSampler which accepts a map from attribute key to patterns?

This does not change my question in any way. What I am trying to understand now is this: do we expect an end-user to provide "/healthcheck" or "http://example.com/healtcheck" or ".*/healtcheck" as the configuration for this sampler? Path or full url?

anuraaga · 2021-08-10T10:25:40Z

It means that both something like .deny(SemanticAttributes.TARGET, "/healthcheck") and .deny(SemanticAttributes.URL, ".*/healthcheck") would work fine. To add to our documentation in the FAQ of "how to disable health checks", we would need to resolve open-telemetry/opentelemetry-java-instrumentation#3700 but it becomes decoupled from the sampler implementation, which is able to satisfy this use case, or any other attribute based use case. Does that work?

iNikem · 2021-08-11T08:38:50Z

It means that both something like .deny(SemanticAttributes.TARGET, "/healthcheck") and .deny(SemanticAttributes.URL, ".*/healthcheck") would work fine. To add to our documentation in the FAQ of "how to disable health checks", we would need to resolve open-telemetry/opentelemetry-java-instrumentation#3700 but it becomes decoupled from the sampler implementation, which is able to satisfy this use case, or any other attribute based use case. Does that work?

I will try this out, thanks for the suggestion :)

iNikem · 2021-08-11T14:12:15Z

@anuraaga PTAL

Several open questions:

Should we do something to ignore query string and fragment? Or should we match the whole value of HTTP_URL?
In case of HTTP_URL should .*/healthcheck match http://healthcheck?
In case of HTTP_URL should .*/actuator match http://example.com/context/actuator?

anuraaga · 2021-08-11T15:04:16Z

I would suggest the same for all of them - don't do anything special, so the pattern is always just applied to the value of the key. It's easy to add .* where something like query should be skipped and I think it's natural. And while I do see the trickiness of accidentally matching http://healthcheck in the second example if the intention is path, at the same time it seems to still be the most straight forward behavior I think.

iNikem · 2021-08-11T16:21:11Z

The only remaining concern of mine is that I really would like to preserve p -> p.endsWith("$") ? p : (p + ".*"). So that users don't have to always add ".*" at the end of their patterns. WDYT @anuraaga @jkwatson

url-sampler/src/main/java/io/opentelemetry/contrib/samplers/StringAttributeSampler.java

iNikem · 2021-08-11T16:22:14Z

Also would love to hear suggestions for module name.

anuraaga · 2021-08-12T02:11:12Z

url-sampler/src/main/java/io/opentelemetry/contrib/samplers/StringAttributeSampler.java

+  private final SpanKind kind;
+  private final Sampler delegate;
+
+  public StringAttributeSampler(Map<AttributeKey<String>, ? extends Collection<String>> patterns, SpanKind kind, Sampler delegate) {


I think we may as well keep the delegate sampler together with the pattern, UX should be straight forward with a builder StringAttributeSamplerBuilder.addRule(key, pattern, delegate). Easy and opens up easy setting of different sampling rates for different endpoints.

Also careful with Map since order is important, both to determine precedence and to allow a user to optimize performance by (if they want) by having common rules up front. List<SamplingRule> for example could be better.

I think we may as well keep the delegate sampler together with the pattern, UX should be straight forward with a builder StringAttributeSamplerBuilder.addRule(key, pattern, delegate). Easy and opens up easy setting of different sampling rates for different endpoints.

You are making it more and more generic/complex :) I usually prefer to start with as simple as possible and then iterate based on the user feedback.

It is more generic, but is it really more complex? I think "setting a delegate Sampler for a pattern" is about as simple as it gets conceptually, it's almost like just populating a key/value set at that point. And while healthcheck is probably the #1 ask, it is common for systems to have APIs with vastly different QPS, so we can expect the new #1 ask to immediately want to be able to tweak their sampling rate :) I suspect it is worth working on this expectation, it doesn't seem to me to make things overly complex.

Wait. I realised that your suggestion is totally different from my original idea. I wanted to have "deny list": if attribute matches one of the patterns, then drop, otherwise delegate to the "main" sampler. E.g. my main sampler for the whole application is ParentBased, but I want to drop "/healthcheck". Deny list.

Your idea is more like "router": match some patterns to samplers. Which means that I have to provides AlwaysOff sampler for "/healthcheck" pattern and I have to remember to provide "catchAll" pattern to direct to ParentBased sampler. Router.

We probably may want both samplers, one simpler (including simpler to think about), one more powerful.

In that case, doesn't it sort of make sense in this case to start with the more powerful one? I feel it will be a more productive discussion when bringing it to the spec, to talk about the generic sampling router vs the specific use case of denying URLs.

This is also because methods on SamplerBuilder seem like they can make up the difference between the two, e.g. deny(key, pattern) { addRule(key, pattern, Sampler.alwaysOff()) } is a simple shortcut that wouldn't require a totally separate Sampler implementation.

url-sampler/src/main/java/io/opentelemetry/contrib/samplers/StringAttributeSampler.java

url-sampler/src/main/java/io/opentelemetry/contrib/samplers/StringMatcher.java

anuraaga · 2021-08-12T02:16:54Z

settings.gradle.kts

@@ -22,3 +22,4 @@ include(":aws-xray")
 include(":dependencyManagement")
 include(":example")
 include(":jmx-metrics")
+include(":url-sampler")


Since package is samplers this should also be samplers I guess

My initial idea was to have every single contrib sampler in a separate module. They then can have separate lifecycles and maintainers.

Hmm - need to imagine more what other samplers there could be. Presumably you mean different modules even for samplers that aren't vendor-specific. Perhaps a rate-limiting sampler. Realistically, would these sampler primitives really have different maintainers? I figured this is prototyping with the intention of moving into the SDK samplers package eventually as these seem pretty "core" for lack of a better term for me. Though then I guess it brings the idea of just using the tracing-incubator too - https://github.com/open-telemetry/opentelemetry-java/tree/main/sdk-extensions/tracing-incubator.

I'm -1 to the idea of this or similar important Samplers being in contrib long term - they seem like tier 1 components of the SDK to me.

@jkwatson WDYT?

We discussed this at SIG and agree that it's very likely this sort of sampler will end up in the core SDK, after this is a PoC, then spec discussion. So tracing-incubator is also a fine place to put this sampler and a great use case for that package.

To the initial point of the thread, though, I think we can assume the maintainers for this sampler are the Java maintainers, and that this artifact is temporary as a PoC, so we wouldn't need to generalize our approach to "every single contrib sampler", as we should think of this not as a "contrib sampler" but as a PoC for a "core sampler". @iNikem I guess you can pick whether you think it's easier to work on this in tracing-incubator or a new artifact in this contrib repo.

Once this is officially in the spec, then it should absolutely be in the core repo. Until then, tracing-incubator is acceptable if contrib doesn't work for you.

I'd like to think about having a formal decision-making process on this kind of thing.

Perhaps:

If we think a new component should be a core component, create a spec issue for it, and do prototyping in the tracing-incubator (eventually metrics-incubator, or some other clearly -alpha component in the core repo). If/when the spec issue gets approved, then it can be moved to a normal "extension" or "sdk-extension" artifact, as appropriate.

If we don't think it's going to be a core component, then put it into contrib and make sure it has an owner.

The other question is...how do we prune out failed incubator experiments, or unloved contrib components?

url-sampler/build.gradle.kts

anuraaga · 2021-08-16T07:37:14Z

url-sampler/build.gradle.kts

+}
+
+tasks {
+  shadowJar {


We don't need to shadow at least at this layer

Why not? If I want to grab this sampler and add it as an extension to my agent, then I need a full shadowed jar, right?

Well we definitely don't want to shade in the SDK as the agent distro will be bringing it in. This sampler doesn't seem to have any dependencies that we want to shade.

Anyways in your issue you also mention "At this moment I don't propose nor plan any auto-configuration support for this sampler." :P Let's just implement this as we would a normal Java library for now.

url-sampler/build.gradle.kts

anuraaga · 2021-08-16T07:38:46Z

url-sampler/src/main/java/io/opentelemetry/contrib/samplers/RuleBasedRoutingSampler.java

+  private final SpanKind kind;
+  private final Sampler delegate;
+
+  public RuleBasedRoutingSampler(List<SamplingRule> rules, SpanKind kind, Sampler defaultDelegate) {


Let's make this package private and add builder(SpanKind, Sampler) static method

Why do we want to hide this constructor? There is no actual requirement to use builder, isn't there?

Sorry missed commenting on this. For libraries we expose as little API as possible to reduce surface. It there is a useful shortcut we could expose it but this doesn't seem like a shortcut compared to the builder. Instead, we can even hide SamplingRule itself from the API too if only having the builder.

url-sampler/src/main/java/io/opentelemetry/contrib/samplers/RuleBasedRoutingSampler.java

url-sampler/src/main/java/io/opentelemetry/contrib/samplers/RuleBasedRoutingSamplerBuilder.java

anuraaga · 2021-08-16T07:39:44Z

url-sampler/src/main/java/io/opentelemetry/contrib/samplers/RuleBasedRoutingSamplerBuilder.java

+  private final SpanKind kind;
+  private final Sampler defaultDelegate;
+
+  public RuleBasedRoutingSamplerBuilder(SpanKind kind, Sampler defaultDelegate) {


Make package private

anuraaga · 2021-08-16T07:40:05Z

url-sampler/src/main/java/io/opentelemetry/contrib/samplers/SamplingRule.java

+/**
+ * @see RuleBasedRoutingSampler
+ */
+public class SamplingRule {


Looks like at least for now doesn't need to be public

iNikem · 2021-08-26T08:35:36Z

Replaced by #70

Initial POC version of url-based sampler

3eed30e

jkwatson reviewed Aug 5, 2021

View reviewed changes

url-sampler/src/test/java/io/opentelemetry/contrib/samplers/UrlSamplerTest.java Outdated Show resolved Hide resolved

iNikem added 3 commits August 10, 2021 10:45

Extract UrlMatcher class

237ee83

Pre-compile patterns

4fd8fa3

Use UrlMatcher in UrlSampler

6fe69e1

Accept several attributes to check

a991dd6

Will not extract path for matching

c440c53

iNikem marked this pull request as ready for review August 11, 2021 16:21

iNikem requested a review from a team August 11, 2021 16:21

iNikem commented Aug 11, 2021

View reviewed changes

url-sampler/src/main/java/io/opentelemetry/contrib/samplers/StringAttributeSampler.java Outdated Show resolved Hide resolved

anuraaga reviewed Aug 12, 2021

View reviewed changes

Converted to rule-based routing implementation

d13a572

anuraaga reviewed Aug 16, 2021

View reviewed changes

iNikem mentioned this pull request Aug 26, 2021

Attributes rule based sampler #70

Merged

iNikem closed this Aug 26, 2021

anuraaga deleted the url-sampler branch January 17, 2022 07:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial POC version of url-based sampler #66

Initial POC version of url-based sampler #66

iNikem commented Aug 5, 2021

jkwatson commented Aug 5, 2021 •

edited

Loading

anuraaga commented Aug 6, 2021

iNikem commented Aug 6, 2021

anuraaga commented Aug 6, 2021

iNikem commented Aug 10, 2021

anuraaga commented Aug 10, 2021

iNikem commented Aug 10, 2021

anuraaga commented Aug 10, 2021

iNikem commented Aug 11, 2021

iNikem commented Aug 11, 2021

anuraaga commented Aug 11, 2021

iNikem commented Aug 11, 2021

iNikem commented Aug 11, 2021

anuraaga Aug 12, 2021

iNikem Aug 12, 2021

anuraaga Aug 12, 2021

iNikem Aug 12, 2021

anuraaga Aug 12, 2021

anuraaga Aug 12, 2021

iNikem Aug 12, 2021

anuraaga Aug 12, 2021

iNikem Aug 12, 2021

anuraaga Aug 13, 2021

jkwatson Aug 13, 2021

anuraaga Aug 16, 2021

iNikem Aug 16, 2021

anuraaga Aug 16, 2021

anuraaga Aug 16, 2021

iNikem Aug 16, 2021

anuraaga Aug 26, 2021 •

edited

Loading

anuraaga Aug 16, 2021

anuraaga Aug 16, 2021

iNikem commented Aug 26, 2021

Initial POC version of url-based sampler #66

Initial POC version of url-based sampler #66

Conversation

iNikem commented Aug 5, 2021

jkwatson commented Aug 5, 2021 • edited Loading

anuraaga commented Aug 6, 2021

iNikem commented Aug 6, 2021

anuraaga commented Aug 6, 2021

iNikem commented Aug 10, 2021

anuraaga commented Aug 10, 2021

iNikem commented Aug 10, 2021

anuraaga commented Aug 10, 2021

iNikem commented Aug 11, 2021

iNikem commented Aug 11, 2021

anuraaga commented Aug 11, 2021

iNikem commented Aug 11, 2021

iNikem commented Aug 11, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

anuraaga Aug 26, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

iNikem commented Aug 26, 2021

jkwatson commented Aug 5, 2021 •

edited

Loading

anuraaga Aug 26, 2021 •

edited

Loading