Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[fix][client] Fix reader message filtering issue during blue-green cluster switch #23693

Merged
merged 2 commits into from
Dec 20, 2024

Conversation

hrzzzz
Copy link
Contributor

@hrzzzz hrzzzz commented Dec 9, 2024

Motivation

When the delayed acknowledgment feature is enabled, the method org.apache.pulsar.client.impl.PersistentAcknowledgmentsGroupingTracker#isDuplicate filters out messages that have already been acknowledged on the client side.

When using the blue-green cluster feature, after switching from the blue cluster to the green cluster, the message IDs generated by the green cluster may be smaller than those from the blue cluster. In this case, if a reader is used for consumption, the method org.apache.pulsar.client.impl.PersistentAcknowledgmentsGroupingTracker#isDuplicate will filter out all messages with IDs less than or equal to lastCumulativeAck, which can result in the loss of messages with IDs less than or equal to lastCumulativeAck after switching to the green cluster.

Modifications

Before switching from the blue cluster to the green cluster, first clear the contents of org.apache.pulsar.client.impl.AcknowledgmentsGroupingTracker

Verifying this change

  • Make sure that the change passes the CI checks.

Added test in: org.apache.pulsar.broker.service.ClusterMigrationTest#testMigrationWithReader

Does this pull request potentially affect one of the following parts:

If the box was checked, please highlight the changes

  • Dependencies (add or upgrade a dependency)
  • The public API
  • The schema
  • The default values of configurations
  • The threading model
  • The binary protocol
  • The REST endpoints
  • The admin CLI options
  • The metrics
  • Anything that affects deployment

Documentation

  • doc
  • doc-required
  • doc-not-needed
  • doc-complete

Matching PR in forked repository

PR in forked repository: hrzzzz#11

@github-actions github-actions bot added the doc-not-needed Your PR changes do not impact docs label Dec 9, 2024
@thetumbled
Copy link
Member

I am curious about how pulsar ensure the two message ids from two clusters corresponding to the same message are equal, or they are just not equal? Not equal may introduce lots of problem.

@lhotari
Copy link
Member

lhotari commented Dec 9, 2024

I am curious about how pulsar ensure the two message ids from two clusters corresponding to the same message are equal, or they are just not equal? Not equal may introduce lots of problem.

@thetumbled Good question. In the blue-green case the message id comparison most likely doesn't apply.
In general there isn't a way to compare message ids between clusters.

The Shadow Topic feature added a field to CommandSend to send the source cluster message id. It would be useful for translations,

// Message id of this message, currently is used in replicator for shadow topic.
optional MessageIdData message_id = 9;
. Taking this further and changing geo-replication to include the original message id and original source cluster would be useful for adding individual ack replication support to replicated subscriptions. Replicated subscriptions don't currently support individual acks and being able to compare message ids would be necessary for adding that feature.

@hrzzzz hrzzzz requested a review from lhotari December 11, 2024 02:31
Copy link
Member

@lhotari lhotari left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@codecov-commenter
Copy link

codecov-commenter commented Dec 20, 2024

Codecov Report

Attention: Patch coverage is 66.66667% with 1 line in your changes missing coverage. Please review.

Project coverage is 74.30%. Comparing base (bbc6224) to head (6c0c792).
Report is 809 commits behind head on master.

Files with missing lines Patch % Lines
...va/org/apache/pulsar/client/impl/ConsumerImpl.java 66.66% 0 Missing and 1 partial ⚠️
Additional details and impacted files

Impacted file tree graph

@@             Coverage Diff              @@
##             master   #23693      +/-   ##
============================================
+ Coverage     73.57%   74.30%   +0.73%     
+ Complexity    32624    32126     -498     
============================================
  Files          1877     1838      -39     
  Lines        139502   143056    +3554     
  Branches      15299    16232     +933     
============================================
+ Hits         102638   106303    +3665     
+ Misses        28908    28379     -529     
- Partials       7956     8374     +418     
Flag Coverage Δ
inttests 26.73% <33.33%> (+2.14%) ⬆️
systests 23.69% <33.33%> (-0.63%) ⬇️
unittests 73.67% <66.66%> (+0.83%) ⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines Coverage Δ
...va/org/apache/pulsar/client/impl/ConsumerImpl.java 80.40% <66.66%> (+2.82%) ⬆️

... and 1005 files with indirect coverage changes

@lhotari lhotari merged commit 34c2f30 into apache:master Dec 20, 2024
54 of 55 checks passed
lhotari pushed a commit that referenced this pull request Dec 21, 2024
…uster switch (#23693)

Co-authored-by: ruihongzhou <ruihongzhou@tencent.com>
(cherry picked from commit 34c2f30)
lhotari pushed a commit that referenced this pull request Dec 21, 2024
…uster switch (#23693)

Co-authored-by: ruihongzhou <ruihongzhou@tencent.com>
(cherry picked from commit 34c2f30)
@hrzzzz hrzzzz deleted the fix-blue-green branch December 23, 2024 06:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants