[processor/tailsampling] fix `InvertNotSampled` decision precedence when inside and sub policy #33671

jamesrwhite · 2024-06-20T10:54:26Z

Description:

This fixes the handling of AND policies that contain a sub-policy with invert_match=true. Previously if the decision from a policy evaluation was NotSampled or InvertNotSampled it would return a NotSampled decision regardless, effectively downgrading the result.

This was breaking the documented behaviour that inverted decisions should take precedence over all others.

This is related to the changes made in #9768 that introduced support for using invert_match within and sub policies.

Link to tracking Issue: #33656

Testing:

I tested manually that this fixes the issue described in #33656 and also updated the tests. If you have any suggestions for more tests we could add let me know.

Documentation:

This fixes the handling of AND policies that contain a sub-policy with invert_match=true. Previously if the decision from a policy evaluation was NotSampled or InvertNotSampled it would return a NotSampled decision regardless, effectively downgrading the result. This was breaking the documented behaviour that inverted decisions should take precedence over all others. This is related to the changes made in open-telemetry#9768 that introduced support for using invert_match within and sub policies.

…pled

jpkrohling · 2024-06-20T11:03:23Z

Thank you for the PR! This might need a changelog entry:
https://github.com/open-telemetry/opentelemetry-collector-contrib/blob/main/CONTRIBUTING.md#adding-a-changelog-entry

jamesrwhite · 2024-06-20T13:55:42Z

@jpkrohling I've added a changelog entry but I wasn't sure whether to classify this as a breaking change or not? The behaviour now matches the documentation with this change but it could be a breaking change for people if they were relying on the old behaviour.

jpkrohling · 2024-06-20T15:06:21Z

I believe this is a bug fix.

…edence when inside and sub policy (open-telemetry#33671)" This reverts commit e2fda02.

karinhawk · 2024-11-05T14:59:29Z

update: didn't notice this was already covered in this bug.

We implemented tail sampling using v0.100.0 of the collector, and want to upgrade to the latest version, but have encountered being unable to replicate behaviour we took from the documentation examples of this repo (namely, the backwards compatibility policy) since that upgrade.

It seems that since v0.104.0 when this bug fix was merged that behaviour is no longer possible. I believe that the documentation examples are now not representative of the possible sampling decisions, which is misleading.

We liked the pattern of being able to sample everything but an array of services, which you would then create policies on what traces to keep from those services, as it made for a lean set of policies.

Our sampling used to work with the following, but now does not, as the "inverted not sample decision" is taking precedence over every subsequent "sample" or "not sample" policy made for those services - meaning those services will always NOT be sampled:

      {
        name: 'backwards-compatibility-policy',
        type: 'and',
        and: {
          and_sub_policy: [
            {
              name: 'services-using-tail_sampling-policy',
              type: 'string_attribute',
              string_attribute: {
                key: 'service.name',
                values: [
                  'otel-demo*',
                ],
                invert_match: true,
                enabled_regex_matching: true,
              },
            },
            {
              name: 'sample-all-policy',
              type: 'always_sample',
            },
          ],
        },
      },
       ...subsequent policies sampling the 'otel-demo' services

Keeping backwards compatibility with sampling services not ready to implement tail sampling at 100% is extremely important to us as we have such a large estate.

I understand this bug fix was needed, but would love some help to see if this behaviour can in fact be replicated after this change - it was working so well for us!

jpkrohling · 2024-11-27T11:48:27Z

@karinhawk, we have another issue covering this, right?

jamesrwhite added 2 commits June 18, 2024 17:33

Merge branch 'open-telemetry:main' into fix-and-policy-invert-not-sam…

3184487

…pled

jamesrwhite requested a review from jpkrohling as a code owner June 20, 2024 10:54

jamesrwhite requested a review from a team June 20, 2024 10:54

github-actions bot assigned crobert-1 Jun 20, 2024

github-actions bot added the processor/tailsampling Tail sampling processor label Jun 20, 2024

jamesrwhite changed the title ~~[processor/tailsampling] fix inverted not sampled AND policy~~ [processor/tailsampling] fix InvertNotSampled decision precedence when inside and sub policy Jun 20, 2024

jpkrohling assigned jpkrohling and unassigned crobert-1 Jun 20, 2024

jpkrohling approved these changes Jun 20, 2024

View reviewed changes

Add changelog entry

118da55

Merge branch 'main' into fix-and-policy-invert-not-sampled

401853a

jpkrohling approved these changes Jun 20, 2024

View reviewed changes

jpkrohling merged commit e2fda02 into open-telemetry:main Jun 20, 2024
154 checks passed

github-actions bot added this to the next release milestone Jun 20, 2024

crobert-1 mentioned this pull request Jun 20, 2024

[processor/tailsampling] invert_match not given precedence when inside and policy #33656

Open

jamesrwhite deleted the fix-and-policy-invert-not-sampled branch July 2, 2024 17:37

hyang023 pushed a commit to hyang023/opentelemetry-collector-contrib that referenced this pull request Jul 16, 2024

Revert "[processor/tailsampling] fix InvertNotSampled decision prec…

fe7acca

…edence when inside and sub policy (open-telemetry#33671)" This reverts commit e2fda02.

jpkrohling mentioned this pull request Dec 4, 2024

[processor/tailsampling] Allow invert matches in composite policy to continue processing #36673

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[processor/tailsampling] fix `InvertNotSampled` decision precedence when inside and sub policy #33671

[processor/tailsampling] fix `InvertNotSampled` decision precedence when inside and sub policy #33671

jamesrwhite commented Jun 20, 2024

jpkrohling commented Jun 20, 2024

jamesrwhite commented Jun 20, 2024

jpkrohling commented Jun 20, 2024

karinhawk commented Nov 5, 2024 •

edited

Loading

jpkrohling commented Nov 27, 2024

[processor/tailsampling] fix InvertNotSampled decision precedence when inside and sub policy #33671

[processor/tailsampling] fix InvertNotSampled decision precedence when inside and sub policy #33671

Conversation

jamesrwhite commented Jun 20, 2024

jpkrohling commented Jun 20, 2024

jamesrwhite commented Jun 20, 2024

jpkrohling commented Jun 20, 2024

karinhawk commented Nov 5, 2024 • edited Loading

jpkrohling commented Nov 27, 2024

[processor/tailsampling] fix `InvertNotSampled` decision precedence when inside and sub policy #33671

[processor/tailsampling] fix `InvertNotSampled` decision precedence when inside and sub policy #33671

karinhawk commented Nov 5, 2024 •

edited

Loading