Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ddl_puller (ticdc): handle dorp pk/uk ddl correctly #10965

Merged
merged 6 commits into from
Apr 25, 2024

Conversation

asddongmen
Copy link
Contributor

@asddongmen asddongmen commented Apr 24, 2024

What problem does this PR solve?

Issue Number: close #10890

What is changed and how it works?

When handling a DDL job in ddlPuller, check if it will transition the table from eligible to ineligible. If so, return an error and stop the changefeed. The error message should provide instructions for the user on how to manage the situation.

Check List

Tests

  • Unit test
  • Manual test (add detailed scripts or steps below)
  1. Create a table t1.
CREATE TABLE t1 (
id INT PRIMARY KEY /*T![clustered_index] NONCLUSTERED */,
name VARCHAR(255),
email VARCHAR(255) UNIQUE
);
  1. Drop its primary key.
ALTER TABLE t1 DROP PRIMARY KEY; 
  1. List this changefeed.
-> # ./cdc cli changefeed list 
[
  {
    "id": "test",
    "namespace": "default",
    "summary": {
      "state": "warning",
      "tso": 449299724779651075,
      "checkpoint": "2024-04-24 15:07:11.563",
      "error": {
        "time": "2024-04-24T16:52:17.129593307+08:00",
        "addr": "127.0.0.1:8300",
        "code": "CDC:ErrProcessorUnknown",
        "message": "An eligible table become ineligible after DDL: [alter table t1 drop primary key] it is a dangerous operation. If you want to replicate this ddl, pelase pause the changefeed and update the `force-replicate=true` in the changefeed configuration, then resume the changefeed. Otherwise, will cause data loss."
      }
    }
  }
]
  1. Update the changefeed's config as the instructions in the error message and resume the changefeed. Check the CDC log to confirm if the DDL was replicated.
[2024/04/24 16:55:09.679 +08:00] [INFO] [ddl_manager.go:558] ["execute a ddl event successfully"] [ddl="ALTER TABLE `t1` DROP PRIMARY KEY"] [namespace=default] [changefeed=test] ....

Questions

Will it cause performance regression or break compatibility?
Do you need to update user documentation, design documentation or monitoring documentation?

Release note

Fix a bug that might cause the replication of the 'drop primary key' or 'drop unique key' DDL to be lost.

Signed-off-by: dongmen <414110582@qq.com>
Copy link
Contributor

ti-chi-bot bot commented Apr 24, 2024

Skipping CI for Draft Pull Request.
If you want CI signal for your change, please convert it to an actual PR.
You can still manually trigger a test run with /test all

@ti-chi-bot ti-chi-bot bot added do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. release-note Denotes a PR that will be considered when it comes time to generate release notes. size/M Denotes a PR that changes 30-99 lines, ignoring generated files. labels Apr 24, 2024
Signed-off-by: dongmen <414110582@qq.com>
@ti-chi-bot ti-chi-bot bot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/M Denotes a PR that changes 30-99 lines, ignoring generated files. labels Apr 24, 2024
@asddongmen
Copy link
Contributor Author

/test verify

@asddongmen
Copy link
Contributor Author

/test all

@asddongmen
Copy link
Contributor Author

/review

Signed-off-by: dongmen <414110582@qq.com>
@asddongmen
Copy link
Contributor Author

/test verify

Signed-off-by: dongmen <414110582@qq.com>
@asddongmen
Copy link
Contributor Author

/test verify

@asddongmen asddongmen added needs-cherry-pick-release-8.1 Should cherry pick this PR to release-8.1 branch. needs-cherry-pick-release-6.5 Should cherry pick this PR to release-6.5 branch. needs-cherry-pick-release-7.1 Should cherry pick this PR to release-7.1 branch. and removed affects-8.1 affects-6.5 affects-7.1 affects-7.5 labels Apr 25, 2024
@ti-chi-bot ti-chi-bot bot added needs-1-more-lgtm Indicates a PR needs 1 more LGTM. approved labels Apr 25, 2024
Signed-off-by: dongmen <414110582@qq.com>
Copy link
Contributor

ti-chi-bot bot commented Apr 25, 2024

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: CharlesCheung96, sdojjy

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:
  • OWNERS [CharlesCheung96,sdojjy]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@ti-chi-bot ti-chi-bot bot added lgtm and removed needs-1-more-lgtm Indicates a PR needs 1 more LGTM. labels Apr 25, 2024
Copy link
Contributor

ti-chi-bot bot commented Apr 25, 2024

[LGTM Timeline notifier]

Timeline:

  • 2024-04-25 10:12:08.432744559 +0000 UTC m=+255085.172647469: ☑️ agreed by CharlesCheung96.
  • 2024-04-25 10:46:44.144215404 +0000 UTC m=+257160.884118308: ☑️ agreed by sdojjy.

@asddongmen
Copy link
Contributor Author

/retest

@ti-chi-bot ti-chi-bot bot merged commit d0329d7 into pingcap:master Apr 25, 2024
28 checks passed
ti-chi-bot pushed a commit to ti-chi-bot/tiflow that referenced this pull request Apr 25, 2024
Signed-off-by: ti-chi-bot <ti-community-prow-bot@tidb.io>
@ti-chi-bot
Copy link
Member

In response to a cherrypick label: new pull request created to branch release-6.5: #10981.

@ti-chi-bot
Copy link
Member

In response to a cherrypick label: new pull request created to branch release-7.1: #10982.

ti-chi-bot pushed a commit to ti-chi-bot/tiflow that referenced this pull request Apr 25, 2024
Signed-off-by: ti-chi-bot <ti-community-prow-bot@tidb.io>
ti-chi-bot pushed a commit to ti-chi-bot/tiflow that referenced this pull request Apr 25, 2024
Signed-off-by: ti-chi-bot <ti-community-prow-bot@tidb.io>
@ti-chi-bot
Copy link
Member

In response to a cherrypick label: new pull request created to branch release-7.5: #10983.

@ti-chi-bot
Copy link
Member

In response to a cherrypick label: new pull request created to branch release-8.1: #10984.

@wuhuizuo
Copy link
Contributor

/review summary the changes.

Copy link
Contributor

ti-chi-bot bot commented Apr 26, 2024

@wuhuizuo:

Pull Request Review

Title

"ddl_puller (ticdc): handle drop primary key/unique key DDL correctly"

Changes Made

This pull request focuses on handling a DDL job in the ddlPuller by checking if it will transition the table from eligible to ineligible. If so, an error is returned and the changefeed is stopped. The error message provides instructions for the user on how to manage the situation.

The changes include:

  1. Check if the DDL event belongs to an ineligible table. If so, ignore it.
  2. Add a new function checkIneligibleTableDDL to check if the table is ineligible before and after the DDL.
  3. Adjust tests to match the new changes.

The main changes are in the following files:

  • cdc/owner/ddl_manager.go
  • cdc/puller/ddl_puller.go
  • cdc/puller/ddl_puller_test.go

Tests

The changes include unit tests and manual tests. The manual test is detailed and checks various scenarios including creating a table, dropping its primary key, listing the changefeed, and updating the changefeed's config.

Release Note

Fix a bug that might cause the replication of the 'drop primary key' or 'drop unique key' DDL to be lost.

In response to this:

/review summary the changes.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@wuhuizuo
Copy link
Contributor

/review default

Copy link
Contributor

ti-chi-bot bot commented Apr 26, 2024

@wuhuizuo:

Review Report

General

The pull request is intended to solve the issue of correctly handling the DDL operations when a primary key or a unique key is dropped from a table in the TiFlow system. The PR checks if the table transitions from eligible to ineligible after a DDL operation and provides an error message with instructions if it does so.

Coding Style

The code follows the coding style of the project and is well-formatted. Variable and function names are descriptive and self-explanatory.

Logic

The logic of the changes seems correct. The function checkIneligibleTableDDL checks whether a table is ineligible before and after a DDL operation. If the table becomes ineligible after the operation, an error is returned. The unit tests cover all the cases and therefore the logic is sound.

Tests

The PR includes both unit tests and manual tests, which is very good. The unit tests cover the different scenarios and seem to be comprehensive. It's great that the manual tests are also included in the PR as they guide users to test the feature manually.

Potential Issues

  1. There is a minor typo in the error message: "pelase pause the changefeed" should be "please pause the changefeed".
  2. There are some spelling mistakes in the comments, like "dorp" instead of "drop".

Suggested Changes

  1. Correct the spelling mistakes in the error message and comments.
  2. It seems that the PR does not handle the case where a DDL operation makes an ineligible table eligible. Although this might be a rare case, it would be good to handle it for completeness.
```markdown
# Review Report

## General

The pull request is intended to solve the issue of correctly handling the DDL operations when a primary key or a unique key is dropped from a table in the TiFlow system. The PR checks if the table transitions from eligible to ineligible after a DDL operation and provides an error message with instructions if it does so.

## Coding Style

The code follows the coding style of the project and is well-formatted. Variable and function names are descriptive and self-explanatory.

## Logic

The logic of the changes seems correct. The function `checkIneligibleTableDDL` checks whether a table is ineligible before and after a DDL operation. If the table becomes ineligible after the operation, an error is returned. The unit tests cover all the cases and therefore the logic is
......
> Response is trunked for length limits.

<details>

In response to [this](https://github.com/pingcap/tiflow/pull/10965#issuecomment-2078822403):

>/review default


Instructions for interacting with me using PR comments are available [here](https://git.k8s.io/community/contributors/guide/pull-requests.md).  If you have questions or suggestions related to my behavior, please file an issue against the [kubernetes/test-infra](https://github.com/kubernetes/test-infra/issues/new?title=Prow%20issue:) repository.
</details>

hicqu added a commit to ti-chi-bot/tiflow that referenced this pull request Jun 12, 2024
commit c092599
Author: Ti Chi Robot <ti-community-prow-bot@tidb.io>
Date:   Wed Jun 12 00:26:59 2024 +0800

    pkg/config, sink(ticdc): support output raw change event for mq and cloud storage sink (pingcap#11226) (pingcap#11290)

    close pingcap#11211

commit 3426e46
Author: Ti Chi Robot <ti-community-prow-bot@tidb.io>
Date:   Tue Jun 11 19:40:29 2024 +0800

    puller(ticdc): fix wrong update splitting behavior after table scheduling (pingcap#11269) (pingcap#11282)

    close pingcap#11219

commit 2a28078
Author: Ti Chi Robot <ti-community-prow-bot@tidb.io>
Date:   Tue Jun 11 16:40:37 2024 +0800

    mysql(ticdc): remove error filter when check isTiDB in backend init (pingcap#11214) (pingcap#11261)

    close pingcap#11213

commit 2425d54
Author: Ti Chi Robot <ti-community-prow-bot@tidb.io>
Date:   Tue Jun 11 16:40:30 2024 +0800

    log(ticdc): Add more error query information to the returned error to facilitate users to know the cause of the failure (pingcap#10945) (pingcap#11257)

    close pingcap#11254

commit 053cdaf
Author: Ti Chi Robot <ti-community-prow-bot@tidb.io>
Date:   Tue Jun 11 15:34:30 2024 +0800

    cdc: log slow conflict detect every 60s (pingcap#11251) (pingcap#11287)

    close pingcap#11271

commit 327ba7b
Author: Ti Chi Robot <ti-community-prow-bot@tidb.io>
Date:   Tue Jun 11 11:42:00 2024 +0800

    redo(ticdc): return internal error in redo writer (pingcap#11011) (pingcap#11091)

    close pingcap#10124

commit d82ae89
Author: Ti Chi Robot <ti-community-prow-bot@tidb.io>
Date:   Mon Jun 10 22:28:29 2024 +0800

    ddl_puller (ticdc): handle dorp pk/uk ddl correctly (pingcap#10965) (pingcap#10981)

    close pingcap#10890

commit f15bec9
Author: Ti Chi Robot <ti-community-prow-bot@tidb.io>
Date:   Fri Jun 7 16:16:28 2024 +0800

    redo(ticdc): enable pprof and set memory limit for redo applier (pingcap#10904) (pingcap#10996)

    close pingcap#10900

commit ba50a0e
Author: Ti Chi Robot <ti-community-prow-bot@tidb.io>
Date:   Wed Jun 5 19:58:26 2024 +0800

    test(ticdc): enable sequence test (pingcap#11023) (pingcap#11037)

    close pingcap#11015

commit 94b9897
Author: Ti Chi Robot <ti-community-prow-bot@tidb.io>
Date:   Wed Jun 5 17:08:56 2024 +0800

    mounter(ticdc): timezone fill default value should also consider tz. (pingcap#10932) (pingcap#10946)

    close pingcap#10931

commit a912d33
Author: Ti Chi Robot <ti-community-prow-bot@tidb.io>
Date:   Wed Jun 5 10:49:25 2024 +0800

    mysql (ticdc): Improve the performance of the mysql sink by refining the transaction event batching logic (pingcap#10466) (pingcap#11242)

    close pingcap#11241

commit 6277d9a
Author: dongmen <20351731+asddongmen@users.noreply.github.com>
Date:   Wed May 29 20:13:22 2024 +0800

    kvClient (ticdc): revert e5999e3 to remove useless metrics (pingcap#11184)

    close pingcap#11073

commit 54e93ed
Author: dongmen <20351731+asddongmen@users.noreply.github.com>
Date:   Wed May 29 17:43:22 2024 +0800

    syncpoint (ticdc): make syncpoint support base64 encoded password (pingcap#11162)

    close pingcap#10516

commit 0ba9329
Author: Ti Chi Robot <ti-community-prow-bot@tidb.io>
Date:   Wed May 29 09:07:21 2024 +0800

    (redo)ticdc: fix the event orderliness in redo log (pingcap#11117) (pingcap#11180)

    close pingcap#11096

Signed-off-by: qupeng <qupeng@pingcap.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved lgtm needs-cherry-pick-release-6.5 Should cherry pick this PR to release-6.5 branch. needs-cherry-pick-release-7.1 Should cherry pick this PR to release-7.1 branch. needs-cherry-pick-release-7.5 Should cherry pick this PR to release-7.5 branch. needs-cherry-pick-release-8.1 Should cherry pick this PR to release-8.1 branch. release-note Denotes a PR that will be considered when it comes time to generate release notes. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
Projects
None yet
5 participants