Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[fix][broker] PIP-399: Fix Metric Name for Delayed Queue #23712

Merged
merged 2 commits into from
Jan 14, 2025

Conversation

thetumbled
Copy link
Member

@thetumbled thetumbled commented Dec 11, 2024

PIP: #23789

Motivation

There is already one metric called pulsar_delayed_message_index_size_bytes for the total memory occupation used by delayed queue of one topic.

writeMetric(stream, "pulsar_delayed_message_index_size_bytes", stats.delayedTrackerMemoryUsage,
                cluster, namespace, topic, splitTopicAndPartitionIndexLabel);

Whereas, the metric for one sub also called pulsar_delayed_message_index_size_bytes, which do not comform the metric name norm and is confusing.

writeSubscriptionMetric(stream, "pulsar_delayed_message_index_size_bytes",
                    subsStats.delayedTrackerMemoryUsage, cluster, namespace, topic, sub, splitTopicAndPartitionIndexLabel);

Currently, it can export metric like:

# TYPE pulsar_delayed_message_index_size_bytes gauge
pulsar_delayed_message_index_size_bytes{cluster="MyPulsar",namespace="public/default",topic="persistent://public/default/testNack-partition-0"} 0
pulsar_delayed_message_index_size_bytes{cluster="MyPulsar",namespace="public/default",topic="persistent://public/default/testNack-partition-0",subscription="sub2"} 0

The metric of topic and subscription mix together. If we want to filter out the metric of sub to pick out the metric of topic, we need to use promsql like:
pulsar_delayed_message_index_size_bytes{subscription=""}
It is quite weird and i believe that if any user use it already, he/she will contribute a pr to fix this, as these label names can only be access by researching the code, which means he/she is an advanced user and is involved in the community probably.

Modifications

Rename the metric for one sub to pulsar_subscription_delayed_message_index_size_bytes.

Verifying this change

  • Make sure that the change passes the CI checks.

(Please pick either of the following options)

This change is a trivial rework / code cleanup without any test coverage.

Does this pull request potentially affect one of the following parts:

If the box was checked, please highlight the changes

  • Dependencies (add or upgrade a dependency)
  • The public API
  • The schema
  • The default values of configurations
  • The threading model
  • The binary protocol
  • The REST endpoints
  • The admin CLI options
  • The metrics
  • Anything that affects deployment

Documentation

  • doc
  • doc-required
  • doc-not-needed
  • doc-complete

Matching PR in forked repository

PR in forked repository:

@thetumbled thetumbled requested a review from coderzc December 11, 2024 08:45
@github-actions github-actions bot added the doc-not-needed Your PR changes do not impact docs label Dec 11, 2024
@dao-jun
Copy link
Member

dao-jun commented Dec 11, 2024

LGTM. However, since it changes the metrics name, I'm not sure if it is OK to merge the PR without a discussion.

@thetumbled
Copy link
Member Author

WDYT, @lhotari @coderzc

@lhotari
Copy link
Member

lhotari commented Dec 12, 2024

Whereas, the metric for one sub also called pulsar_delayed_message_index_size_bytes, which do not comform the metric name norm and is confusing.

@thetumbled Does this cause any problem other than it's not consistent with other subscription level metric name?

According to our PIP guidelines, changes like this require a PIP since it would be a breaking change in metrics. However, if this would be a bug where the metric wouldn't be usable at all, then a PIP wouldn't be required.

@thetumbled
Copy link
Member Author

thetumbled commented Dec 12, 2024

Whereas, the metric for one sub also called pulsar_delayed_message_index_size_bytes, which do not comform the metric name norm and is confusing.

@thetumbled Does this cause any problem other than it's not consistent with other subscription level metric name?

According to our PIP guidelines, changes like this require a PIP since it would be a breaking change in metrics. However, if this would be a bug where the metric wouldn't be usable at all, then a PIP wouldn't be required.

Currently, it can export metric like:

# TYPE pulsar_delayed_message_index_size_bytes gauge
pulsar_delayed_message_index_size_bytes{cluster="MyPulsar",namespace="public/default",topic="persistent://public/default/testNack-partition-0"} 0
pulsar_delayed_message_index_size_bytes{cluster="MyPulsar",namespace="public/default",topic="persistent://public/default/testNack-partition-0",subscription="sub2"} 0

The metric of topic and subscription mix together. If we want to filter out the metric of sub to pick out the metric of topic, we need to use promsql like:
pulsar_delayed_message_index_size_bytes{subscription=""}
It is quite weird and i believe that if any user use it already, he/she will contribute a pr to fix this, as these label names can only be access by researching the code, which means he/she is an advanced user and is involved in the community probably.

@lhotari
Copy link
Member

lhotari commented Dec 12, 2024

Currently, it can export metric like:

# TYPE pulsar_delayed_message_index_size_bytes gauge
pulsar_delayed_message_index_size_bytes{cluster="MyPulsar",namespace="public/default",topic="persistent://public/default/testNack-partition-0"} 0
pulsar_delayed_message_index_size_bytes{cluster="MyPulsar",namespace="public/default",topic="persistent://public/default/testNack-partition-0",subscription="sub2"} 0

The metric of topic and subscription mix together. If we want to filter out the metric of sub to pick out the metric of topic, we need to use promsql like: pulsar_delayed_message_index_size_bytes{subscription=""} It is quite weird and i believe that if any user use it already, he/she will contribute a pr to fix this, as these label names can only be access by researching the code, which means he/she is an advanced user and is involved in the community probably.

@thetumbled Please add this directly to the description of this PR since it clarifies the issue. I'd suggest to start a discussion on that mailing list with these details. It's simplest to create a minimal PIP in any case to get this change documented.

@thetumbled
Copy link
Member Author

Currently, it can export metric like:

# TYPE pulsar_delayed_message_index_size_bytes gauge
pulsar_delayed_message_index_size_bytes{cluster="MyPulsar",namespace="public/default",topic="persistent://public/default/testNack-partition-0"} 0
pulsar_delayed_message_index_size_bytes{cluster="MyPulsar",namespace="public/default",topic="persistent://public/default/testNack-partition-0",subscription="sub2"} 0

The metric of topic and subscription mix together. If we want to filter out the metric of sub to pick out the metric of topic, we need to use promsql like: pulsar_delayed_message_index_size_bytes{subscription=""} It is quite weird and i believe that if any user use it already, he/she will contribute a pr to fix this, as these label names can only be access by researching the code, which means he/she is an advanced user and is involved in the community probably.

@thetumbled Please add this directly to the description of this PR since it clarifies the issue. I'd suggest to start a discussion on that mailing list with these details. It's simplest to create a minimal PIP in any case to get this change documented.

Sure, i will start one to discuss on this change.

@thetumbled thetumbled changed the title [fix][broker] fix metric for delayed memory occupation of sub. [fix][broker] PIP-399: Fix Metric Name for Delayed Queue Dec 27, 2024
@thetumbled
Copy link
Member Author

Currently, it can export metric like:

# TYPE pulsar_delayed_message_index_size_bytes gauge
pulsar_delayed_message_index_size_bytes{cluster="MyPulsar",namespace="public/default",topic="persistent://public/default/testNack-partition-0"} 0
pulsar_delayed_message_index_size_bytes{cluster="MyPulsar",namespace="public/default",topic="persistent://public/default/testNack-partition-0",subscription="sub2"} 0

The metric of topic and subscription mix together. If we want to filter out the metric of sub to pick out the metric of topic, we need to use promsql like: pulsar_delayed_message_index_size_bytes{subscription=""} It is quite weird and i believe that if any user use it already, he/she will contribute a pr to fix this, as these label names can only be access by researching the code, which means he/she is an advanced user and is involved in the community probably.

@thetumbled Please add this directly to the description of this PR since it clarifies the issue. I'd suggest to start a discussion on that mailing list with these details. It's simplest to create a minimal PIP in any case to get this change documented.

I propose a pip to discuss on this change: #23789.

@thetumbled
Copy link
Member Author

The vote for pip passed , please review this pr again, thanks. @lhotari @dao-jun @BewareMyPower @poorbarcode @Shawyeok @nodece

Copy link
Contributor

@Shawyeok Shawyeok left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM for the production code. However, you need to update the unit test PersistentTopicTest#testDelayedDeliveryTrackerMemoryUsageMetric to align with these changes.

@thetumbled
Copy link
Member Author

LGTM for the production code. However, you need to update the unit test PersistentTopicTest#testDelayedDeliveryTrackerMemoryUsageMetric to align with these changes.

Fixed.

Copy link
Contributor

@Shawyeok Shawyeok left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@thetumbled thetumbled added release/3.3.5 type/bug The PR fixed a bug or issue reported a bug labels Jan 14, 2025
@thetumbled thetumbled merged commit 5be922b into apache:master Jan 14, 2025
55 of 56 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/metrics doc-not-needed Your PR changes do not impact docs ready-to-test release/3.0.10 release/3.3.5 release/4.0.3 type/bug The PR fixed a bug or issue reported a bug
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants