Metrics SDK: Filtering metrics attributes #1191

lalitb · 2022-02-02T02:33:18Z

Fixes

Changes

As per the specs, Views should be able to configure the filtering of attributes reported on metrics, including removing all the attributes. This PR adds a filter attribute-processor for this. The list of attributes to be included in the metrics can be specified through this filter.

Please provide a brief description of the changes here.

For significant contributions please make sure you have completed the following items:

CHANGELOG.md updated for non-trivial changes
Unit tests have been added
Changes in public API reviewed

…pp into filter-attributes

codecov · 2022-02-02T02:39:50Z

Codecov Report

Merging #1191 (fa0b7e2) into main (e1b4a49) will increase coverage by 0.06%.
The diff coverage is 97.06%.

@@            Coverage Diff             @@
##             main    #1191      +/-   ##
==========================================
+ Coverage   93.29%   93.35%   +0.06%     
==========================================
  Files         174      177       +3     
  Lines        6404     6502      +98     
==========================================
+ Hits         5974     6069      +95     
- Misses        430      433       +3

Impacted Files	Coverage Δ
...clude/opentelemetry/sdk/common/attributemap_hash.h	`78.58% <78.58%> (ø)`
...include/opentelemetry/sdk/common/attribute_utils.h	`98.08% <100.00%> (+0.86%)`	⬆️
...ntelemetry/sdk/metrics/view/attributes_processor.h	`81.25% <100.00%> (+81.25%)`	⬆️
sdk/test/common/attribute_utils_test.cc	`100.00% <100.00%> (ø)`
sdk/test/common/attributemap_hash_test.cc	`100.00% <100.00%> (ø)`
sdk/test/metrics/attributes_processor_test.cc	`100.00% <100.00%> (ø)`

reyang · 2022-02-02T23:19:39Z

sdk/include/opentelemetry/sdk/metrics/view/attributes_processor.h

@@ -11,13 +11,25 @@ namespace metrics
 {
 using MetricAttributes = opentelemetry::sdk::common::AttributeMap;

+/**
+ * The AttributesProcessor is responsible for customizing which


I wonder if this can be made internal/private to the SDK for now.

The specification only allows very limited manipulation on attributes (e.g. remove certain attributes from the measurement), this generic solution could work but it comes with perf cost (e.g. the virtual AttributesProcessor::process call).

Doesn't have to be a blocker though.

Yes, I do suspect some perf cost associated with virtual and attributes copy. I have added a benchmark test to get cost statistics and would be helpful if have to consider performance improvement.

Also, I realized that we need to calculate the hash on the attribute map to store metrics data. Have added functionality to calculate the hash based on std::hash and boost::hash_combine`. As hash should be the same irrespective of the order of key/values in the AttributeMap, I have changed the internal storage for AttributeMap from unordered_map to map. This way we can avoid sorting of keys for every measurement.

Let me see how can we make this class internal/private to SDK, as C++ doesn't have a direct way of doing it. We can make AttributeProcessor::process() as private, and other SDK classes as its friend to it but would like to see a better approach if possible.

reyang · 2022-02-02T23:26:23Z

Doesn't have to be in this PR, we need to understand the performance characteristics (e.g. heap allocation, memory fragmentation, contention if there is any, etc.) in order to guide the SDK design/implementation.

Here are some general guidelines regarding metrics memory usage https://github.com/open-telemetry/opentelemetry-specification/blob/main/specification/metrics/supplementary-guidelines.md#memory-management.

Consider two types of performance tests:

benchmark - which tells us when we call instrument.Add/Record, what's the heap allocation in bytes and CPU cycles for each call, and what's the contribution from each component (provider, processor/reader, exporter)
stress test - while running the SDK and utilize all the CPU cores to emit as much as of measurements to the SDK (and probably scrape from the pull exporter at the same time), what's the total amount of memory usage, do we see stable calls/second, what's the scalability on SMP when the number of cores grow, etc.

reyang

I think it's fine to unblock this so we can get a working prototype end-to-end.

…pp into filter-attributes

lalitb · 2022-02-04T05:43:13Z

Doesn't have to be in this PR, we need to understand the performance characteristics (e.g. heap allocation, memory fragmentation, contention if there is any, etc.) in order to guide the SDK design/implementation.

Here are some general guidelines regarding metrics memory usage https://github.com/open-telemetry/opentelemetry-specification/blob/main/specification/metrics/supplementary-guidelines.md#memory-management.

Consider two types of performance tests:

benchmark - which tells us when we call instrument.Add/Record, what's the heap allocation in bytes and CPU cycles for each call, and what's the contribution from each component (provider, processor/reader, exporter)

stress test - while running the SDK and utilize all the CPU cores to emit as much as of measurements to the SDK (and probably scrape from the pull exporter at the same time), what's the total amount of memory usage, do we see stable calls/second, what's the scalability on SMP when the number of cores grow, etc.

Thanks, this is useful. We will discuss these performance attributes in our community meeting and will come up with plan to measure them in a better way.

…pp into filter-attributes

lalitb added 4 commits February 1, 2022 16:52

filter attribute processor

4eab744

fix bazel

9c969bd

add more tests

fd61256

comments

982e9a4

lalitb requested a review from a team February 2, 2022 02:33

Merge branch 'main' into filter-attributes

d54285f

lalitb changed the title ~~Filter attributes~~ Filtering metrics attributes Feb 2, 2022

lalitb added 2 commits February 1, 2022 18:35

misspell

8b80e2e

Merge branch 'filter-attributes' of github.com:lalitb/opentelemetry-c…

dba1962

…pp into filter-attributes

lalitb changed the title ~~Filtering metrics attributes~~ Metrics SDK: Filtering metrics attributes Feb 2, 2022

fix test

41ca1c2

reyang reviewed Feb 2, 2022

View reviewed changes

reyang approved these changes Feb 2, 2022

View reviewed changes

lalitb and others added 4 commits February 3, 2022 21:12

add attribute hash calculation

46fe572

Merge branch 'main' into filter-attributes

47eb2bf

fix

ac98cde

Merge branch 'filter-attributes' of github.com:lalitb/opentelemetry-c…

bf436c4

…pp into filter-attributes

lalitb and others added 7 commits February 3, 2022 21:58

load benchmark result

095aacc

fix bazel benchmark

49f48e7

add bazel tags

49e6d6a

Merge branch 'main' into filter-attributes

49dc38e

fix bazel finally

9cb3deb

Merge branch 'filter-attributes' of github.com:lalitb/opentelemetry-c…

0d47983

…pp into filter-attributes

Merge branch 'main' into filter-attributes

fa0b7e2

lalitb merged commit b6a28df into open-telemetry:main Feb 5, 2022

lalitb mentioned this pull request Feb 14, 2022

Metrics SDK: Attributes filtering through View. #1190

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Metrics SDK: Filtering metrics attributes #1191

Metrics SDK: Filtering metrics attributes #1191

lalitb commented Feb 2, 2022

codecov bot commented Feb 2, 2022 •

edited

Loading

reyang Feb 2, 2022

lalitb Feb 4, 2022

reyang commented Feb 2, 2022 •

edited

Loading

reyang left a comment

lalitb commented Feb 4, 2022

Metrics SDK: Filtering metrics attributes #1191

Metrics SDK: Filtering metrics attributes #1191

Conversation

lalitb commented Feb 2, 2022

Fixes

Changes

codecov bot commented Feb 2, 2022 • edited Loading

Codecov Report

reyang Feb 2, 2022

Choose a reason for hiding this comment

lalitb Feb 4, 2022

Choose a reason for hiding this comment

reyang commented Feb 2, 2022 • edited Loading

reyang left a comment

Choose a reason for hiding this comment

lalitb commented Feb 4, 2022

codecov bot commented Feb 2, 2022 •

edited

Loading

reyang commented Feb 2, 2022 •

edited

Loading