Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat repair CSL on mismatch with Domain config #596

Open
wants to merge 3 commits into
base: main
Choose a base branch
from

Conversation

dorjesinpo
Copy link
Collaborator

We rely on Primary to pick new domain config, but if Primary fails, the new config does not make it to the CSL on existing Replica.
As the result new Primary may crash because domain config doe snot match CSL -

  • "#QUEUE_STORAGE_NOTFOUND Virtual storage does not exist for AppId "

@dorjesinpo dorjesinpo requested a review from a team as a code owner February 4, 2025 23:34
@dorjesinpo dorjesinpo added the enhancement New feature or request label Feb 4, 2025
@dorjesinpo dorjesinpo changed the title repair CSL on mismatch with Domain config [feat] repair CSL on mismatch with Domain config Feb 4, 2025
@dorjesinpo dorjesinpo changed the title [feat] repair CSL on mismatch with Domain config feat repair CSL on mismatch with Domain config Feb 5, 2025
@dorjesinpo dorjesinpo requested a review from kaikulimu February 5, 2025 01:19
Comment on lines 2851 to 2852
bsl::vector<bsl::string> addedIds(d_allocator_p);
bsl::vector<bsl::string> removedIds(d_allocator_p);
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we should prefer the default allocator for this, the object that will technically own this memory is the function wrapper coming from bind, which is allocated using the default allocator. Providing this class's allocator might just lead to fragmentation.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Interesting. I guess, the best would be to pass bsl::shared_ptr<bsl::vector<bsl::string>> to avoid unnecessary copying

@@ -235,6 +236,7 @@ ClusterQueueHelper::QueueLiveState::QueueLiveState(
, d_numHandleCreationsInProgress(other.d_numHandleCreationsInProgress)
, d_queueExpirationTimestampMs(other.d_queueExpirationTimestampMs)
, d_pending(other.d_pending)
, d_pendingUpdates(other.d_pendingUpdates)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
, d_pendingUpdates(other.d_pendingUpdates)
, d_pendingUpdates(other.d_pendingUpdates, allocator)

Copy link
Collaborator

@kaikulimu kaikulimu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some comment

@kaikulimu kaikulimu assigned dorjesinpo and unassigned kaikulimu Feb 14, 2025
Signed-off-by: dorjesinpo <129227380+dorjesinpo@users.noreply.github.com>
Signed-off-by: dorjesinpo <129227380+dorjesinpo@users.noreply.github.com>
@dorjesinpo dorjesinpo force-pushed the fix/repair-csl branch 2 times, most recently from 43e86fe to baa5767 Compare February 20, 2025 21:40
Signed-off-by: dorjesinpo <129227380+dorjesinpo@users.noreply.github.com>
Copy link

@bmq-oss-ci bmq-oss-ci bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Build 2497 of commit c11fc4e has completed with FAILURE

@dorjesinpo dorjesinpo assigned kaikulimu and unassigned dorjesinpo Feb 21, 2025
Copy link
Collaborator

@kaikulimu kaikulimu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants