better exchange of starting seqNum during handshakes #4766

wdbaruni · 2024-12-16T07:26:05Z

Problem

When nodes restart or lose state, the current sequence number synchronization can lead to message gaps or duplicates. This occurs because nodes unconditionally trust each other's sequence numbers during handshake, without considering local state recovery scenarios.

Solution

This PR implements a "trust your own state" approach for sequence number synchronization during handshakes. Each node relies on its local state to determine its starting point, while using the handshake to inform the other party of its position.

Changes in Handshake Flow

Orchestrator Behavior
- Uses its local knowledge of the compute node's last received sequence number
- Starts streaming from 0 if no prior state exists, regardless of compute node's reported position
- Tracks compute node's progress through heartbeat updates
Compute Node Behavior
- Starts from its local checkpoint, preserved across restarts
- Ignores orchestrator's suggested sequence position if local state exists
- Continues reporting processed sequence numbers via heartbeats

Summary by CodeRabbit

Release Notes

New Features
- Introduced new methods for managing node sequence numbers and state handling.
- Added functionality to ensure message publishing starts correctly after node restarts.
- Enhanced dispatcher state management with a new structured format.
Bug Fixes
- Improved error handling for sequence number resolution and checkpointing processes.
Tests
- Added new tests to verify the behavior of the data plane and dispatcher during various scenarios.
Documentation
- Updated comments and documentation to reflect changes in methods and logic.

linear · 2024-12-16T07:26:10Z

ENG-445 Persist LastOrchestratroSeqNum in node info

…de-info

coderabbitai · 2024-12-16T07:26:17Z

Walkthrough

This pull request introduces several modifications across multiple packages related to node management, message dispatching, and connection handling. The changes primarily focus on improving sequence number tracking, checkpointing mechanisms, and state management in the compute and orchestrator components. Key updates include refactoring node information storage, enhancing message starting positions, adding state retrieval methods, and improving metadata handling in message responses.

Changes

File	Change Summary
`pkg/node/requester.go`	Updated `NodeInfoStore` type from `nodes.Store` to `nodes.Lookup`, removed `nodeStore` variable
`pkg/orchestrator/nodes/manager.go`	Added `resolveStartingOrchestratorSeqNum` and `updateSequenceNumbers` methods for improved sequence number handling
`pkg/transport/nclprotocol/compute/controlplane.go`	Added checkpoint saving in `Stop` method before returning
`pkg/transport/nclprotocol/compute/dataplane.go`	Introduced `resolveStartingIterator` method to handle message publishing start position
`pkg/transport/nclprotocol/dispatcher/dispatcher.go`	Added `State()` method and `doCheckpoint` method for improved state management
`pkg/transport/nclprotocol/dispatcher/state.go`	Created new `State` type and `GetState` method for dispatcher state retrieval
`pkg/transport/nclprotocol/orchestrator/manager.go`	Updated message handling to include metadata in responses

Sequence Diagram

sequenceDiagram
    participant Compute
    participant Orchestrator
    participant Dispatcher

    Compute->>Orchestrator: Handshake Request
    Orchestrator-->>Compute: Handshake Response (with StartingSeqNum)
    Compute->>Dispatcher: Start with Resolved Iterator
    Dispatcher->>Dispatcher: Checkpoint Progress
    Compute->>Orchestrator: Heartbeat
    Orchestrator-->>Compute: Heartbeat Response (with Metadata)

Possibly related PRs

fix orchestrator data-plan startSeqNum #4745: Addition of StartingOrchestratorSeqNum field in HandshakeResponse
improve orchestrator node self-registeration #4762: Introduction of nodeInfoProvider in nodesManager struct

Suggested reviewers

frrist

Poem

🐰 Hopping through code with glee,
Sequence numbers dancing free,
Checkpoints saved, messages clear,
Rabbit's magic engineering cheer!
Dispatcher's state now bright and neat 🚀

Tip

CodeRabbit's docstrings feature is now available as part of our Early Access Program! Simply use the command @coderabbitai generate docstrings to have CodeRabbit automatically generate docstrings for your pull request.

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai generate unit testing code for this file.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and generate unit testing code.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai generate docstrings to generate docstrings for this PR. (Beta)
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

…de-info

coderabbitai

Actionable comments posted: 1

🔭 Outside diff range comments (1)

pkg/orchestrator/nodes/manager.go (1)
Line range hint 836-852: Handle potential errors in selfRegister method

In the selfRegister method, when calling n.Get(ctx, nodeInfo.ID()), the error handling only checks for NotFoundError. Consider handling other potential errors as well.

Apply this diff to enhance error handling:
 	state, err := n.Get(ctx, nodeInfo.ID())
 	if err != nil {
 		if !bacerrors.IsErrorWithCode(err, bacerrors.NotFoundError) {
-			return bacerrors.New("failed to self-register node: %v", err).
+			return fmt.Errorf("failed to get node state during self-registration: %w", err)
 		}
 		state = models.NodeState{
 			Info:       nodeInfo,
 			Membership: models.NodeMembership.APPROVED,
 			ConnectionState: models.ConnectionState{
 				ConnectedSince: n.clock.Now().UTC(),
 			},
 		}
 	}

🧹 Nitpick comments (13)

pkg/transport/nclprotocol/dispatcher/state.go (1)
39-48: Add documentation for the GetState method.

The method implementation is correct with proper thread-safety, but it's missing documentation that describes its purpose and return value.
-// GetState returns
+// GetState returns a snapshot of the current dispatcher state including
+// the last acknowledged sequence number, last observed sequence number,
+// and last checkpoint position.
 func (s *dispatcherState) GetState() State {
pkg/transport/nclprotocol/dispatcher/dispatcher.go (1)
241-259: Consider returning the error from doCheckpoint.

The implementation is well-structured with proper timeout and error handling. However, consider returning the error to allow callers to handle checkpoint failures, especially during shutdown.
-func (d *Dispatcher) doCheckpoint(ctx context.Context) {
+func (d *Dispatcher) doCheckpoint(ctx context.Context) error {
   checkpointTarget := d.state.getCheckpointSeqNum()
   if checkpointTarget == 0 { // Nothing new to checkpoint
-    return
+    return nil
   }

   checkpointCtx, cancel := context.WithTimeout(ctx, d.config.CheckpointTimeout)
   defer cancel()

   if err := d.watcher.Checkpoint(checkpointCtx, checkpointTarget); err != nil {
     log.Error().Err(err).
       Uint64("target", checkpointTarget).
       Msg("Failed to checkpoint watcher")
-    return
+    return fmt.Errorf("failed to checkpoint watcher: %w", err)
   }

   d.state.updateLastCheckpoint(checkpointTarget)
+  return nil
 }
pkg/orchestrator/nodes/manager.go (4)
465-466: Remove redundant empty lines

There are consecutive empty lines at lines 465-466. Removing unnecessary empty lines improves code readability.

Apply this diff:
-	}
-
+	}
467-470: Handle potential error when resolving starting sequence number

In the Handshake method, when calling n.resolveStartingOrchestratorSeqNum, ensure that the error is checked correctly. This is already being done, but consider providing more context in the error message.

Apply this diff to enhance the error message:
 		state.ConnectionState.LastOrchestratorSeqNum, err = n.resolveStartingOrchestratorSeqNum(ctx, isReconnect, existing)
 		if err != nil {
-			return messages.HandshakeResponse{}, fmt.Errorf("failed to resolve starting sequence number: %w", err)
+			return messages.HandshakeResponse{}, fmt.Errorf("failed to resolve starting orchestrator sequence number for node %s: %w", request.NodeInfo.ID(), err)
 		}
754-781: Improve documentation and error messages in resolveStartingOrchestratorSeqNum

The comments in this function are helpful but consider making them more concise and clear. Additionally, when logging errors, provide more context.

Apply this diff:
 // resolveStartingOrchestratorSeqNum determines where a node should start receiving messages from.
-//
-// For reconnecting nodes, we trust the sequence numbers from our store rather than what the
-// compute node reports. This prevents issues with compute nodes restarting with same ID but
-// fresh state, where they would ask to start from 0.
+// For reconnecting nodes, use the stored sequence number to prevent issues with nodes restarting with the same ID but fresh state.

 // For new nodes, we start them from the latest sequence number to avoid overwhelming them
 // with historical events.
-
-// TODO: Add support for snapshots to allow nodes to catch up on missed state without
-// replaying all historical events. For now, we always start from latest to avoid
-// overwhelming nodes that have been down for a long time.
 func (n *nodesManager) resolveStartingOrchestratorSeqNum(
 	ctx context.Context, isReconnect bool, existing models.NodeState) (uint64, error) {
 	if isReconnect {
 		// For reconnecting nodes, trust our stored sequence number
 		return existing.ConnectionState.LastOrchestratorSeqNum, nil
 	}

 	// For new nodes, start from latest sequence number
 	latestSeq, err := n.eventstore.GetLatestEventNum(ctx)
 	if err != nil {
-		return 0, fmt.Errorf("failed to get latest event number: %w", err)
+		return 0, fmt.Errorf("failed to get latest event number from event store: %w", err)
 	}

 	return latestSeq, nil
 }
795-798: Add TODO for sequence number advancement logic

The current implementation allows sequence numbers to move backwards. A TODO comment exists, but consider creating a GitHub issue to track this enhancement.

Would you like me to open a GitHub issue to track the implementation of proper sequence number comparison logic to ensure sequence numbers only advance forward?
pkg/transport/nclprotocol/compute/dataplane.go (1)
177-194: Clarify the logic in resolveStartingIterator method

The method resolveStartingIterator currently ignores the lastReceivedSeqNum. If this is intentional, consider documenting the reasoning more clearly and possibly logging a warning if lastReceivedSeqNum is non-zero.

Enhance the method as follows:
func (dp *DataPlane) resolveStartingIterator(lastReceivedSeqNum uint64) watcher.EventIterator {
	if lastReceivedSeqNum != 0 {
		log.Warn().Uint64("lastReceivedSeqNum", lastReceivedSeqNum).Msg("Ignoring non-zero lastReceivedSeqNum; starting from trim horizon")
	}
	return watcher.TrimHorizonIterator()
}
This makes it explicit that lastReceivedSeqNum is currently not used and informs if it's non-zero.
pkg/transport/nclprotocol/compute/controlplane.go (1)
227-229: Handle checkpointing error when stopping

In the Stop method, if checkpointProgress fails, the error is logged but not returned. Consider returning the error to inform the caller.

Modify the code to return the error:
		if err := cp.checkpointProgress(ctx); err != nil {
			log.Error().Err(err).Msg("Failed to checkpoint progress before stopping")
+			return err
		}
		return nil
This way, the caller is aware of the failure during the shutdown process.
pkg/transport/nclprotocol/compute/dataplane_test.go (2)
173-175: Simplify context cancellation handling

In the TestStartupFailureCleanup test case, the select statement waiting for context cancellation can be simplified, as the context will be cancelled immediately.

Simplify the code:
s.Require().True(errors.Is(s.ctx.Err(), context.Canceled), "Context should be canceled")
This directly checks that the context is indeed canceled.

248-299: Improve test coverage with edge cases

In the TestStartingPosition method, consider adding edge cases where LastReceivedSeqNum is zero and when there are no initial events to ensure the data plane behaves correctly in all scenarios.

Add additional sub-tests within TestStartingPosition for these cases.
pkg/node/requester.go (1)

55-55: LGTM: Type change aligns with "trust your own state" approach

Changing NodeInfoStore from nodes.Store to nodes.Lookup enforces read-only access to node information, which supports the PR's objective of having nodes trust their own state rather than allowing external modifications.
pkg/transport/nclprotocol/compute/manager_test.go (2)
118-123: Document the purpose of different sequence numbers.

While the test correctly sets up different sequence numbers for the compute node (124) and orchestrator (100), it would be helpful to add a comment explaining why these specific numbers were chosen and what scenario they're testing.
 // Configure handshake response to return a different sequence number
+// Using a lower sequence number (100) than the compute node's checkpoint (124)
+// to verify that the compute node trusts its own state over orchestrator's suggestion
 handshakeSeqNum := uint64(100)
Line range hint 114-182: Review sequence number synchronization strategy.

There appears to be a fundamental disconnect between the PR objectives and the test implementation:

PR Objectives state: "trust your own state" approach where compute node relies on its local state

Test Implementation shows: compute node adopts orchestrator's sequence number (100) despite having its own state (124)

This discrepancy needs to be resolved to ensure the implementation aligns with the intended design.

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between d71b758 and d597259.

📒 Files selected for processing (11)

pkg/node/requester.go (3 hunks)
pkg/orchestrator/nodes/manager.go (5 hunks)
pkg/transport/nclprotocol/compute/controlplane.go (1 hunks)
pkg/transport/nclprotocol/compute/dataplane.go (2 hunks)
pkg/transport/nclprotocol/compute/dataplane_test.go (2 hunks)
pkg/transport/nclprotocol/compute/manager.go (1 hunks)
pkg/transport/nclprotocol/compute/manager_test.go (4 hunks)
pkg/transport/nclprotocol/dispatcher/dispatcher.go (2 hunks)
pkg/transport/nclprotocol/dispatcher/dispatcher_e2e_test.go (1 hunks)
pkg/transport/nclprotocol/dispatcher/state.go (2 hunks)
pkg/transport/nclprotocol/orchestrator/manager.go (3 hunks)

🔇 Additional comments (14)

pkg/transport/nclprotocol/dispatcher/state.go (1)

14-18: LGTM! Well-designed state encapsulation.

The State struct effectively encapsulates the sequence numbers and checkpoint state with clear, self-descriptive field names.

pkg/transport/nclprotocol/dispatcher/dispatcher.go (2)

80-82: LGTM! Clean state access method.

The method provides a clean interface to access dispatcher state information.

233-233: LGTM! Improved checkpoint consistency.

The changes ensure consistent checkpoint handling for both periodic checkpoints and shutdown scenarios.

Also applies to: 236-236

pkg/transport/nclprotocol/dispatcher/dispatcher_e2e_test.go (1)

231-255: LGTM! Well-structured test for shutdown checkpoint behavior.

The test effectively verifies the final checkpoint behavior during shutdown by:

Isolating the shutdown checkpoint from periodic checkpoints
Ensuring state synchronization before shutdown
Verifying the final checkpoint value

pkg/orchestrator/nodes/manager.go (1)

425-425: ⚠️ Potential issue

Check for error handling when retrieving existing node state

In the Handshake method, the variable err is not declared before being used in line 425. Ensure that the error returned by n.Get(ctx, request.NodeInfo.ID()) is properly handled.

Apply this diff to fix the issue:

-	existing, err := n.Get(ctx, request.NodeInfo.ID())
+	existing, err := n.Get(ctx, request.NodeInfo.ID())
+	if err != nil {
+		// handle error appropriately
+	}

Likely invalid or redundant comment.

pkg/transport/nclprotocol/compute/dataplane.go (1)

124-124: Ensure correct context is passed to watcher.New

When creating the dispatcher watcher with watcher.New(), ensure that the context passed is managed appropriately to prevent leaks or premature cancellations.

Please confirm that the context ctx is suitable for the lifetime of the watcher. If ctx can be cancelled before the watcher is done, consider using a separate context.

pkg/transport/nclprotocol/orchestrator/manager.go (1)

261-261: LGTM: Added message type metadata to responses

Good practice to include message type metadata in responses. This improves message handling and type safety in the transport layer.

Also applies to: 281-282

pkg/transport/nclprotocol/compute/manager.go (2)

316-320: LGTM: Core implementation of "trust your own state" approach

The implementation correctly updates the sequence tracker with the orchestrator's number after handshake. The comment clearly explains the rationale that the orchestrator may have reset or chosen a different starting point.

This change is central to the PR's objective of improving sequence number synchronization during handshakes.

316-320: Verify sequence number handling consistency

Let's verify that sequence number handling is consistent across the codebase.

✅ Verification successful

Based on the results, I can now generate the final response:

Sequence number handling is thread-safe and consistent

The sequence number handling implementation is correct and safe because:

All sequence operations use atomic operations via SequenceTracker
Updates are centralized in the SequenceTracker type with atomic Store/Load operations
The handshake's sequence number update follows the same pattern as other usages in the codebase

Key findings:

UpdateLastSeqNum is implemented using atomic operations (Store)
GetLastSeqNum is implemented using atomic operations (Load)
All sequence number updates across the codebase consistently use these atomic methods
No direct access to the underlying sequence number field was found

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Description: Check for consistent sequence number handling

# Look for other places where sequence numbers are updated
rg -A 3 "UpdateLastSeqNum|GetLastSeqNum" 

# Check for potential race conditions in sequence number handling
ast-grep --pattern 'func $_(ctx context.Context, $$$) {
  $$$
  $x.UpdateLastSeqNum($_)
  $$$
}'

Length of output: 3641

pkg/node/requester.go (1)

80-83: LGTM: Improved encapsulation of node state management

The removal of nodeStore from createNodeManager's return values and using nodesManager directly in the Requester struct initialization improves encapsulation of node state management. This change:

Centralizes node state management within the manager
Reduces the risk of inconsistent state by eliminating direct store access
Supports the PR's goal of reliable sequence number synchronization

Let's verify that the node manager is the single source of truth for node state:

Also applies to: 330-334

pkg/transport/nclprotocol/compute/manager_test.go (4)

114-117: LGTM! Initial checkpoint setup aligns with PR objectives.

The setup correctly simulates a compute node with existing local state (sequence number 124), which aligns with the PR's "trust your own state" approach.

132-136: LGTM! Handshake verification ensures compute node communicates its state.

The test correctly verifies that the compute node includes its local sequence number in the handshake, which is crucial for the new "trust your own state" approach.

155-163: ⚠️ Potential issue

Verify if heartbeat sequence number behavior aligns with PR objectives.

The test expects the compute node to use the orchestrator's sequence number (100) in heartbeats, but this seems to contradict the PR's "trust your own state" approach. According to the PR objectives, shouldn't the compute node continue using its own sequence number (124)?

182-182: ⚠️ Potential issue

Verify sequence number behavior in subsequent heartbeats.

The test comment suggests that using the orchestrator's sequence number in subsequent heartbeats is intentional. This needs verification against the PR objectives which emphasize "trust your own state".

✅ Verification successful

Let me gather more context about the sequence number implementation to verify the behavior.

Let me search for more specific information about sequence tracking implementation.

Based on the gathered evidence, I can now provide a final response:

Sequence number handling in heartbeats is correctly implemented

The implementation correctly maintains sequence tracking through the SequenceTracker component, which is consistently used across handshakes and heartbeats. The code shows that:

The sequence number is properly initialized from the handshake response (StartingOrchestratorSeqNum)
The tracker is continuously updated and maintains the last processed sequence
Both control plane and connection manager use the same sequence tracker instance
The sequence number is checkpointed for persistence

The comment "Should continue using sequence number from handshake" aligns with the implementation and does not conflict with "trust your own state" as the sequence tracker maintains the state internally.

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Search for documentation or comments about sequence number handling
rg -A 5 "sequence number" --type md

Length of output: 2139

Script:

#!/bin/bash
# Search for sequence number handling in compute manager and related files
rg -A 10 "LastOrchestratorSeqNum" pkg/transport/nclprotocol/compute/

Length of output: 5971

Script:

#!/bin/bash
# Look for heartbeat sequence number handling
ast-grep --pattern 'func $$$Heartbeat$$$($$$) {
  $$$
}'

Length of output: 58

Script:

#!/bin/bash
# Search for sequence number validation or verification logic
rg -A 5 "validateSeq|verifySeq|checkSeq" pkg/transport/nclprotocol/

Length of output: 69

Script:

#!/bin/bash
# Look for sequence tracker implementation
rg -A 10 "incomingSeqTracker" pkg/transport/nclprotocol/

# Also search for sequence tracking related types/interfaces
ast-grep --pattern 'type $$$SeqTracker$$$ interface {
  $$$
}'

Length of output: 9310

Script:

#!/bin/bash
# Search for sequence number handling in messages
rg -A 5 "SeqNum" pkg/transport/nclprotocol/messages/

Length of output: 128

coderabbitai · 2024-12-16T08:03:39Z

pkg/orchestrator/nodes/manager.go

-		if request.LastComputeSeqNum > 0 {
-			updated.LastComputeSeqNum = request.LastComputeSeqNum
-		}
+		n.updateSequenceNumbers(&updated, request.LastOrchestratorSeqNum, request.LastComputeSeqNum)


🛠️ Refactor suggestion

Consider sequence number validation before updating

In the Heartbeat method, sequence numbers are updated without validation. It's advisable to ensure that sequence numbers only advance forward to prevent potential rollback due to race conditions.

Implement validation logic to ensure LastOrchestratorSeqNum and LastComputeSeqNum do not decrease:

func (n *nodesManager) updateSequenceNumbers(state *models.ConnectionState, orchestratorSeq, computeSeq uint64) { if orchestratorSeq > state.LastOrchestratorSeqNum { state.LastOrchestratorSeqNum = orchestratorSeq } if computeSeq > state.LastComputeSeqNum { state.LastComputeSeqNum = computeSeq } }

better exchange of starting seqNum during handshakes

3d7a66a

Merge branch 'main' into eng-445-persist-lastorchestratroseqnum-in-no…

4555f16

…de-info

wdbaruni added 2 commits December 16, 2024 09:37

fix ConnectionManagerTestSuite

08b0976

Merge branch 'main' into eng-445-persist-lastorchestratroseqnum-in-no…

d597259

…de-info

coderabbitai bot reviewed Dec 16, 2024

View reviewed changes

wdbaruni closed this Dec 16, 2024

wdbaruni reopened this Dec 16, 2024

wdbaruni merged commit 86e8a96 into main Dec 16, 2024
24 of 25 checks passed

wdbaruni deleted the eng-445-persist-lastorchestratroseqnum-in-node-info branch December 16, 2024 08:08

This was referenced Dec 16, 2024

Add Shutdown Notice from Compute Nodes #4769

Merged

Fix race conditions resulting in multiple data planes #4771

Merged

Faster reconnect on handshake required response #4772

Merged

Preserve LastComputeSeqNum if 0 #4775

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

better exchange of starting seqNum during handshakes #4766

better exchange of starting seqNum during handshakes #4766

wdbaruni commented Dec 16, 2024 •

edited by coderabbitai bot

Loading

linear bot commented Dec 16, 2024

coderabbitai bot commented Dec 16, 2024 •

edited

Loading

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (`.coderabbit.yaml`)

Documentation and Community

coderabbitai bot left a comment

coderabbitai bot Dec 16, 2024

better exchange of starting seqNum during handshakes #4766

better exchange of starting seqNum during handshakes #4766

Conversation

wdbaruni commented Dec 16, 2024 • edited by coderabbitai bot Loading

Problem

Solution

Changes in Handshake Flow

Summary by CodeRabbit

Release Notes

linear bot commented Dec 16, 2024

coderabbitai bot commented Dec 16, 2024 • edited Loading

Walkthrough

Changes

Sequence Diagram

Possibly related PRs

Suggested reviewers

Poem

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Documentation and Community

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot Dec 16, 2024

Choose a reason for hiding this comment

wdbaruni commented Dec 16, 2024 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Dec 16, 2024 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)