feat: introduce metrics based on system diagnostics #84

aygalinc · 2024-11-03T14:49:35Z

Hello !
Here is the start of a PR for : #58
Here is the try to implement the metrics reporting scope on message publishing counter to see if it cope to the thing you have planned.

I have base the impl of system diagnostic metric reporter on the one available on Npgsql : https://github.com/npgsql/npgsql/blob/main/src/Npgsql/MetricsReporter.cs

I m not sure on why you need the NoOpImplem and where the reporter should be instanciated.

Signed-off-by: Gabriele Santomaggio <G.santomaggio@gmail.com>

Gsantomaggio · 2024-11-04T09:00:57Z

Thank you for the PR. Please format the code based on the project's rule

Gsantomaggio

Thank you for the PR. There are some changes to make.
May I ask to add:

some test
an example inside the directory examples with a project ?

RabbitMQ.AMQP.Client/Impl/IMetricsReporter.cs

RabbitMQ.AMQP.Client/Impl/MetricsReporter.cs

RabbitMQ.AMQP.Client/Impl/AmqpConnection.cs

RabbitMQ.AMQP.Client/Impl/AmqpPublisherBuilder.cs

aygalinc · 2024-11-04T12:15:21Z

Yup I will add tests when I get some bandwidth

aygalinc · 2024-11-04T13:15:42Z

@Gsantomaggio : About the test stack you use, is there any entry point that i can read to make it work loccaly ? Or there is just the code ?

Do you have consider to use things like TestContainer that reduce the burden to have a local manual setup and keep the thing under test fixture control ?

Gsantomaggio · 2024-11-04T13:21:49Z

@aygalinc, to run the tests, please refer to the following:
https://github.com/rabbitmq/rabbitmq-amqp-dotnet-client?tab=readme-ov-file#how-to-run
You need a RabbitMQ running.

Do you have consider to use things like TestContainer

We use the same way to test all the clients. We have yet to consider TestContainer.

Thank you

aygalinc · 2024-11-04T15:46:56Z

@Gsantomaggio : i have had one test to check the test stack you use. Testing consumer is pretty cumbersome because metrics records happened after consumer is invoked and their is not mean to directly wait that the consumer register metrics before going to the assert part.

I need to check the doc to be able to run this test in parallel, I think i cannot use the static factory pattern of the metrics because it impacts all the test suite so result will not be very consistent.

=> I work on macos and the sh script fails (but i make it work by commenting some part).

Gsantomaggio · 2024-11-05T08:48:02Z

=> I work on macos and the sh script fails (but i make it work by commenting some part).

There is some problem with the TLS test on mac. So you can skip them. Don't worry.

Testing consumer is pretty cumbersome

Feel free to add some dependency on the Test project like Prometheus or everything you need to read the metrics externally.

thank you

aygalinc · 2024-11-05T21:56:31Z

@Gsantomaggio : i have add some tests for publisher and consumer metrics.
feel free to check if its good for you.

Tests/Tests.csproj

Signed-off-by: Gabriele Santomaggio <G.santomaggio@gmail.com>

Gsantomaggio · 2024-11-07T10:15:17Z

@aygalinc CI is finally green. :) Thank you! I will do some tests.
Before merging this PR, we (probably) need to merge this #81.
cc @lukebakken

aygalinc · 2024-11-07T10:23:45Z

@Gsantomaggio : I want to add the MetricsReporter available in the public API for dotnet 8 in order to have one impleme that impelment OTEL guidelines and so a MetricReporter that throw not implemented exception for lower version or that extend the NoOp implem.
Does this make sens to you ?

Gsantomaggio · 2024-11-07T11:02:43Z

I want to add the MetricsReporter available in the public API for dotnet 8

It is ok with me.

Gsantomaggio · 2024-11-08T09:32:22Z

@aygalinc I am working to resolve the conflicts.

Signed-off-by: Gabriele Santomaggio <G.santomaggio@gmail.com>

Gsantomaggio · 2024-11-08T13:48:50Z

@aygalinc Ok, done. The branch is merged with main

Notes:

With this merge, you can run all the tests locally. We fixed the TLS problem.
The docker image used is now: pivotalrabbitmq/rabbitmq.
If you have an M* series, run ./.ci/ubuntu/one-node/gha-setup.sh start pull arm to run the correct docker image.
If you have an Intel series, run ./.ci/ubuntu/one-node/gha-setup.sh start pull to run the correct docker image.

Request:

May I ask you to add an example (doc/examples/) of how we can integrate the metrics with OpenTelemetry and/or Prometheus?

Thank you for your effort on this PR.

…sions.Diagnostics` supports it.

aygalinc · 2024-11-14T15:28:59Z

@lukebakken thx.
The semantic convention specify the label : db.client.connection.pool.name for db but it s really that they do not the same with messaging

I have add an issue yesterday in sem conv repo to have an explanation on this : open-telemetry/semantic-conventions#1575

lukebakken · 2024-11-14T17:10:02Z

Yep, I plan on using the same metrics and naming as the Java client (https://github.com/rabbitmq/rabbitmq-amqp-java-client/blob/main/src/main/java/com/rabbitmq/client/amqp/metrics/MicrometerMetricsCollector.java)

aygalinc · 2024-11-14T18:50:57Z

The only issue with the design in npgsql is the fact that all is static in the MetricReporter so poorly testable. I think this is due to the fact that they release it in .net version 7 where IMeterFactory was not available.

lukebakken · 2024-11-14T19:05:12Z

OK I'll take that into consideration. I don't expect users of this library to have to configure an IMeterFactory implementation, however.

lukebakken · 2024-11-14T22:51:44Z

@aygalinc thank you for the sample application - it makes configuring all of this much more obvious.

Would it be possible to add a metrics summary output? If you'd like to point me in the right direction, that would be great.

UPDATE - I will try this tomorrow: https://learn.microsoft.com/en-us/dotnet/core/diagnostics/metrics-collection#view-metrics-with-dotnet-counters

* Update deps

aygalinc · 2024-11-15T17:32:41Z

@lukebakken Where do you want have the summary ?

lukebakken · 2024-11-15T17:34:42Z

Interesting, how did you get that summary? I was thinking of having that information (or similar) printed out at the end of the demo program's run. I haven't had time yet to try out the dotnet-counters tool (I'm busy working on customer support today).

aygalinc · 2024-11-15T17:35:35Z

If you use Rider you can use the monitorng view, add the desidered MeterName t otrack in the advance settings (link the one we indicate in the otel example) and see the counter live. (Note that they do not print label) :

aygalinc · 2024-11-15T17:38:02Z

And on the example i set up the console exporter so periodically it print on stdout some info on measurement (like this one where you can see actual label which help to contextualize the metrics in observability framework):

lukebakken · 2024-11-15T18:12:04Z

Ah! I just didn't let it run long enough 🤦

* Fix metrics tests

…flaky test (rabbitmq#84 (comment))

lukebakken · 2024-11-18T19:24:16Z

For reference: https://opentelemetry.io/docs/specs/semconv/messaging/messaging-spans/

@aygalinc I'm not exactly sure if OTel attributes should be added in this PR, or in a similar manner to what the AMQP 0.9.1 .NET client does (https://github.com/rabbitmq/rabbitmq-dotnet-client/pull/1717/files, https://github.com/rabbitmq/rabbitmq-dotnet-client/blob/main/projects/RabbitMQ.Client/Impl/RabbitMQActivitySource.cs).

I think I'll wrap up this PR without any tags or OTel attributes, and we can add those in another PR. How about that?

aygalinc · 2024-11-18T19:30:25Z

Seems more than reasonable

aygalinc · 2024-11-18T19:52:18Z

@lukebakken I think it is the usage in your histogram to use seconds instead of ms.
There has been lot of discussion like here on this topic : open-telemetry/opentelemetry-specification#2977 (comment) and its recommanded in the doc : https://opentelemetry.io/docs/specs/semconv/general/metrics/#instrument-units .

Do you want to stay on ms for histogram ?
I think the main driver of this choice it s the power of sameness, if everyone implement duration unit as second you never think of the unit of the instrument.

lukebakken · 2024-11-18T19:56:52Z

Thanks for pointing that out. Seems like you'd want to use ms, but I'll change the PR to follow the guidelines.

lukebakken

@aygalinc thank you very much!

Gsantomaggio

Great job @aygalinc @lukebakken!

Thanks a lot for your effort @aygalinc in this PR

aygalinc added 2 commits October 31, 2024 21:46

feat: add message publishing metrics

ea97eb6

chore: add more metrics

a4638b5

aygalinc changed the title ~~Feat/introduce metrics based on system diagnostics~~ feat: introduce metrics based on system diagnostics Nov 3, 2024

formatting

8f51146

Signed-off-by: Gabriele Santomaggio <G.santomaggio@gmail.com>

Gsantomaggio requested changes Nov 4, 2024

View reviewed changes

aygalinc force-pushed the feat/introduce_metrics_based_on_system_diagnostics branch from 27244f7 to 1ca625e Compare November 5, 2024 21:08

fix: use only dotnet 8 ImeterFactory for implementation

bb4e334

aygalinc force-pushed the feat/introduce_metrics_based_on_system_diagnostics branch from 6f82dc7 to bb4e334 Compare November 5, 2024 21:54

Gsantomaggio reviewed Nov 6, 2024

View reviewed changes

Tests/Tests.csproj Outdated Show resolved Hide resolved

aygalinc and others added 3 commits November 7, 2024 09:15

chore: format

ea925ab

use rabbitmq:4.0.2-management

40e0ae6

Signed-off-by: Gabriele Santomaggio <G.santomaggio@gmail.com>

use rabbitmq:4.0.2-management

653b1b2

Signed-off-by: Gabriele Santomaggio <G.santomaggio@gmail.com>

Gsantomaggio added 2 commits November 8, 2024 11:56

merge

136aa04

merge

0b4b623

Signed-off-by: Gabriele Santomaggio <G.santomaggio@gmail.com>

lukebakken added 4 commits November 11, 2024 09:40

Modify Tests.csproj to use net462 since (supposedly) `Microsoft.Exten…

19e834f

…sions.Diagnostics` supports it.

* dotnet format fixes

2e91b4f

* Suppress TFM support build warnings in Tests.csproj

3723713

* Use Stopwatch in a netstandard2.0 compatible way.

9dc565a

lukebakken added 2 commits November 14, 2024 15:46

* Move some stuff around

e24ca87

* Update deps

* Combine metrics context data classes into the same class.

abec929

lukebakken added 6 commits November 16, 2024 10:03

* Start bringing the metrics in-line with the Java AMQP 1.0 client.

330ce0c

*Combine metrics tests into a single test suite

c39559a

* Fix metrics tests

* Add elapsed timespan to publish measurements

7fdf08a

fixup

1206875

Thanks @aygalinc for noticing that a call to Pause is missing in a …

925c6ae

…flaky test (rabbitmq#84 (comment))

* Collect consume elapsed time duration.

94edb0d

lukebakken added 3 commits November 18, 2024 11:59

* Use seconds instead of milliseconds.

1e566fd

* Ensure all of the new metrics are tested.

c594cff

* Misc fixes, add InternalBugException

75ba9a8

lukebakken requested a review from Gsantomaggio November 18, 2024 22:04

lukebakken approved these changes Nov 18, 2024

View reviewed changes

Gsantomaggio approved these changes Nov 19, 2024

View reviewed changes

Gsantomaggio merged commit f47744a into rabbitmq:main Nov 19, 2024
2 checks passed

aygalinc deleted the feat/introduce_metrics_based_on_system_diagnostics branch November 25, 2024 14:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: introduce metrics based on system diagnostics #84

feat: introduce metrics based on system diagnostics #84

aygalinc commented Nov 3, 2024 •

edited

Loading

Gsantomaggio commented Nov 4, 2024

Gsantomaggio left a comment

aygalinc commented Nov 4, 2024

aygalinc commented Nov 4, 2024

Gsantomaggio commented Nov 4, 2024

aygalinc commented Nov 4, 2024

Gsantomaggio commented Nov 5, 2024

aygalinc commented Nov 5, 2024

Gsantomaggio commented Nov 7, 2024 •

edited

Loading

aygalinc commented Nov 7, 2024

Gsantomaggio commented Nov 7, 2024

Gsantomaggio commented Nov 8, 2024

Gsantomaggio commented Nov 8, 2024 •

edited

Loading

aygalinc commented Nov 14, 2024

lukebakken commented Nov 14, 2024

aygalinc commented Nov 14, 2024

lukebakken commented Nov 14, 2024

lukebakken commented Nov 14, 2024 •

edited

Loading

aygalinc commented Nov 15, 2024

lukebakken commented Nov 15, 2024

aygalinc commented Nov 15, 2024

aygalinc commented Nov 15, 2024

lukebakken commented Nov 15, 2024

lukebakken commented Nov 18, 2024 •

edited

Loading

aygalinc commented Nov 18, 2024

aygalinc commented Nov 18, 2024

lukebakken commented Nov 18, 2024

lukebakken left a comment

Gsantomaggio left a comment

feat: introduce metrics based on system diagnostics #84

feat: introduce metrics based on system diagnostics #84

Conversation

aygalinc commented Nov 3, 2024 • edited Loading

Gsantomaggio commented Nov 4, 2024

Gsantomaggio left a comment

Choose a reason for hiding this comment

aygalinc commented Nov 4, 2024

aygalinc commented Nov 4, 2024

Gsantomaggio commented Nov 4, 2024

aygalinc commented Nov 4, 2024

Gsantomaggio commented Nov 5, 2024

aygalinc commented Nov 5, 2024

Gsantomaggio commented Nov 7, 2024 • edited Loading

aygalinc commented Nov 7, 2024

Gsantomaggio commented Nov 7, 2024

Gsantomaggio commented Nov 8, 2024

Gsantomaggio commented Nov 8, 2024 • edited Loading

aygalinc commented Nov 14, 2024

lukebakken commented Nov 14, 2024

aygalinc commented Nov 14, 2024

lukebakken commented Nov 14, 2024

lukebakken commented Nov 14, 2024 • edited Loading

aygalinc commented Nov 15, 2024

lukebakken commented Nov 15, 2024

aygalinc commented Nov 15, 2024

aygalinc commented Nov 15, 2024

lukebakken commented Nov 15, 2024

lukebakken commented Nov 18, 2024 • edited Loading

aygalinc commented Nov 18, 2024

aygalinc commented Nov 18, 2024

lukebakken commented Nov 18, 2024

lukebakken left a comment

Choose a reason for hiding this comment

Gsantomaggio left a comment

Choose a reason for hiding this comment

aygalinc commented Nov 3, 2024 •

edited

Loading

Gsantomaggio commented Nov 7, 2024 •

edited

Loading

Gsantomaggio commented Nov 8, 2024 •

edited

Loading

lukebakken commented Nov 14, 2024 •

edited

Loading

lukebakken commented Nov 18, 2024 •

edited

Loading