-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[FLINK-35143][pipeline-connector][mysql] Expose newly added tables capture in mysql pipeline connector. #3411
[FLINK-35143][pipeline-connector][mysql] Expose newly added tables capture in mysql pipeline connector. #3411
Conversation
… in mysql pipeline connector.
...st/java/org/apache/flink/cdc/connectors/mysql/source/MysqlPipelineNewlyAddedTableITCase.java
Show resolved
Hide resolved
@qg-lin Thanks for your PR. If you need helps, please be free to notice me. Thanks. |
Thanks for your review, I've resolved it. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @qg-lin 👍
Minor wording suggestion from my side, otherwise looks good!
.booleanType() | ||
.defaultValue(false) | ||
.withDescription( | ||
"Whether capture the scan the newly added tables or not, by default is false. This option is only useful when we start the job from a savepoint/checkpoint."); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could we drop the capture
here?
For example:
Whether to scan the newly added ...
With both capture
and scan
it seems hard to understand what the flag enables.
...mysql/src/main/java/org/apache/flink/cdc/connectors/mysql/source/MySqlDataSourceOptions.java
Outdated
Show resolved
Hide resolved
…line-connector-mysql/src/main/java/org/apache/flink/cdc/connectors/mysql/source/MySqlDataSourceOptions.java Co-authored-by: Muhammet Orazov <916295+morazow@users.noreply.github.com>
Got it 👍 I missed that there is doc for the flag, the doc version was already understandable. Looks good! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
@qg-lin Please rebase the master branch and let's pass the CI. |
done |
@qg-lin It seems that some tests failed. Please take a look at them. Thanks. |
@ruanhang1993 Done, please trigger again. |
…pture in mysql pipeline connector. (apache#3411) Co-authored-by: Muhammet Orazov <916295+morazow@users.noreply.github.com> Co-authored-by: north.lin <north.lin@yunlsp.com>
…pture in mysql pipeline connector. (apache#3411) Co-authored-by: Muhammet Orazov <916295+morazow@users.noreply.github.com> Co-authored-by: north.lin <north.lin@yunlsp.com>
… connector (from: [FLINK-35143] apache#3411)
…pture in mysql pipeline connector. (apache#3411) Co-authored-by: Muhammet Orazov <916295+morazow@users.noreply.github.com> Co-authored-by: north.lin <north.lin@yunlsp.com>
…e-11 Link: https://code.alibaba-inc.com/ververica/flink-cdc/codereview/20462875 * [FLINK-35173][cdc][mysql] Debezium custom time serializer for MySQL connector (apache#3240) * [minor][cdc][docs] Add user guide about providing extra jar package in quickstart docs * [FLINK-35235][pipeline-connector][kafka] Fix missing dependencies in the uber jar of Kafka pipeline sink * [FLINK-35251][cdc][runtime] Fix bug of serializing derivation mapping in SchemaDerivation This closes apache#3267. * [FLINK-35258][cdc][doc] Fix broken links to Doris in documentation (apache#3276) * [FLINK-35256][cdc][runtime] Fix transform node does not respect type nullability (apache#3272) * [minor][docs] Fix route definition example in core concept docs (apache#3269) * [FLINK-35259][cdc][transform] Fix FlinkCDC pipeline transform can't deal timestamp field (apache#3278) * [FLINK-34878][cdc][transform] Flink CDC pipeline transform supports CASE WHEN (apache#3228) * [FLINK-35255][cdc][runtime] DataSinkWriterOperator adds overrides for the snapshotState and processWatermark methods (apache#3271) * [FLINK-35264][cdc][runtime] Fix multiple transform rules do not take effect (apache#3280) * [FLINK-35244][cdc-connector][tidb] Correct the package for flink-connector-tidb-cdc test * [FLINK-35274][cdc-connector][db2] Fix occasional failure issue with Flink CDC Db2 UT * [hotfix][docs] Correct example configuration for Paimon warehouse path * [FLINK-35245][cdc-connector][tidb] Add metrics for flink-connector-tidb-cdc * [minor][docs] Rectify names of CDC sources for Flink and improve the content directory This closes apache#3310. * [hotfix][docs] Fix dead links in documentations This closes apache#3314. * [FLINK-35314][cdc] Add Flink CDC pipeline transform user document (apache#3308) * [FLINK-35386][cdc][docs] Build release-3.1 documentation and mark it as stable (apache#3330) * [FLINK-35386][cdc][docs] Shorten GHA names and disable fail-fast to isolate branches (apache#3331) * [minor][docs] Improve the answer of MySQL CDC FAQ docs This closes apache#3337. * [FLINK-35431][doc] Migrate references in Flink CDC documentation from Debezium 1.9 to 2.0 to avoid dead links This closes apache#3351. * [FLINK-35298][cdc][metrics] Improve the fetch delay metrics This closes apache#3298. * [FLINK-35294][mysql] Use source config to check if the filter should be applied in timestamp starting mode This closes apache#3291. * [FLINK-35295][mysql] Improve jdbc connection pool initialization failure message This closes apache#3293. * [FLINK-35301][cdc] Avoid deadlock when loading driver classes This closes apache#3300. * [FLINK-35408][mysql] Introduce 30 minutes tolerance when validate the time-zone setting This closes apache#3341. * [FLINK-35447][doc-ci] Flink CDC Document document file had removed but website can access This closes apache#3362. * [minor][cdc-cli] Suppress false alarm in flink config loader This closes apache#3292. * [FLINK-35409][cdc][mysql] Request more splits if all splits are filtered from addSplits method This closes apache#3342. * [FLINK-35323][cdc-runtime] Fix transform failure when one rule matches multiple tables with incompatible schema This closes apache#3313. * [FLINK-35430][cdc-connector][kafka] Pass the time zone infor to JsonSerializationSchema This closes apache#3359. * [FLINK-35149][cdc-composer] Fix DataSinkTranslator#sinkTo ignoring pre-write topology if not TwoPhaseCommittingSink (apache#3233) * [FLINK-35129][postgres] Introduce scan.lsn-commit.checkpoints-num-delay option to control LSN offset commit lazily This close apache#3349. * [FLINK-35325][cdc-connector][paimon] Support specify column order when adding new columns to a table This closes apache#3323. * [build][hotfix] Fix jackson conflicts among cdc connectors This closes apache#2987. * [minor][docs] Improve the readme and issue template This closes apache#3383. * [FLINK-35527][docs] Polish quickstart guide & migrate maven links from ververica to apache * [minor][cdc-connector][paimon] Remove duplicate interface implements * [FLINK-35464][cdc] Fixes operator state backwards compatibility from CDC 3.0.x and add migration tests (apache#3369) * [FLINK-35415][base] Fix compatibility with Flink 1.19 * [FLINK-35415][base] Bump Flink patch version to 1.18.1 * [FLINK-35316][base] Run E2e test cases with Flink 1.19 and latest patch versions * [FLINK-35120][doris] Add Doris integration test cases * [FLINK-35092][cdc][starrocks] Add starrocks integration test cases * [FLINK-34908][pipeline-connector][doris] Fix MySQL to doris pipeline will lose precision for timestamp type This closes apache#3207. * [hotfix][cdc-composor] Adjust test of cdc composer from junit4 to junit5 * [hotfix][ci] PIPELINE_CONNECTORS stage should include pipeline connector paimon This closes apache#3344. * [hotfix][build] Remove test-jar from parent pom and only add to necessary modules This closes apache#3397. Co-authored-by: Qingsheng Ren <renqschn@gmail.com> * [FLINK-35545][doc] Miss 3.1.0 version in snapshot flink-cdc doc version list * [FLINK-35545][doc] Revert dbz doc 2.0 back to 1.9 * [FLINK-35540][cdc-common] Fix table missed when database and table are with the same name This closes apache#3396. * [FLINK-35121][common] Adds validation for pipeline definition options * [FLINK-35297][mysql] Add validation for option connect.timeout because of HikariConfig limitation (apache#3295) * [docs][minor] Optimize styles of the Flink CDC index page This closes apache#3420 * [hotfix][ci] Add new pipeline connectors into labeler.yml * [FLINK-35648][runtime] Allow applying multiple route rules for one single source table (apache#3425) * [FLINK-35700][cli] Loosen CDC pipeline options validation This closes apache#3435. * [FLINK-35281][cdc-common] FlinkEnvironmentUtils#addJar add each jar only once (apache#3301) * [FLINK-35354] Support host mapping in Flink tikv cdc (apache#3336) * [FLINK-35337][cdc] Keep up with the latest version of tikv client * [FLINK-35354]Support host mapping in Flink tikv cdc * [FLINK-35354] Add doc for host mapping feature * [FLINK-35354] fixed annotation import * [FLINK-35647][route] Support symbol replacement to enrich routing rules This closes apache#3428. Co-authored-by: 张田 <zhang_tian@inspur.com> Co-authored-by: yangshuaitong <duguhoney@gmail.com> * [FLINK-35781][cli] Provide a default parallelism 1 for pipeline jobs This closes apache#3458. * [minor][cdc][docs] Optimize markdown formats in doris quickstart doc This closes apache#3324. * [FLINK-34990][cdc-connector][oracle] Oracle cdc support newly add table (apache#3203) * [cdc-connector][oracle] Oracle cdc support newly add table * [cdc-connector][oracle] Fix code style * [cdc-connector][oracle] Address comment * [hotfix][docs][postgres] Remove unsupported erroneous example code This closes apache#3464 * [build][CI] Introduce automated PR governance workflows This closes apache#3466 * [FLINK-34883] Fix postgres uuid column as PK (apache#3282) * [FLINK-34883] Fix postgres uuid column as PK * [FLINK-34883] Fix column comment. * [minor][cdc-connector][postgres] PostgresDialect removes useless code This closes apache#3416. * [FLINK-35758][doc][cdc-connector][mysql] Add missed scan.startup.timestamp-millis option for mysql connector This closes apache#3453. * [minor][cdc-connector][mysql] Code optimization for constants This closes apache#3385 * [minor][cdc-connector][sqlserver] Fix some typos (apache#3421) * [docs][minor] Specify the lib directory of Flink CDC in quick start docs This closes apache#3479. Co-authored-by: fangxiangmin <fangxiangmin> Co-authored-by: Leonard Xu <leonard@apache.org> Co-authored-by: yux <34335406+yuxiqian@users.noreply.github.com> * [FLINK-35237][cdc-common] Allow Sink to choose HashFunction in PrePartitionOperator * [FLINK-35237][cdc-common] Improve the interfaces and reorganize the directory. * [FLINK-35871][doc] Add missed "snapshot" mode to mysql connector startup options This closes apache#3484. * [FLINK-35865][base] Support Byte and Short in ObjectUtils (apache#3481) * [minor][doc][cdc-connector][oracle] Update OracleSchema#getTableSchema doc description This closes apache#3443. * [FLINK-35874][cdc-connector][mysql] Check pureBinlogPhaseTables set before call getBinlogPosition method This closes apache#3488. * [FLINK-35740][cdc-connector][mysql] Allow column as chunk key even it's not primary key This closes apache#3448. * [minor][test] Add Flink CDC 3.1.1 version to migration test version list This closes apache#3426. * [FLINK-35868][cdc-connector][mongodb] Bump dependency version to support MongoDB 7.0 This closes apache#3489. * [FLINK-35391][cdc-connector][paimon] Bump dependency of Paimon Pipeline connector to 0.8.2 This closes apache#3335 Co-authored-by: wangjunbo <wangjunbo@qiyi.com> * [FLINK-35736][test] Add migration test scripts & CI workflows This closes apache#3447 * [docs][minor] Add transform piece for pipeline example This closes apache#3496 * [FLINK-35072][doris] Support applying AlterColumnTypeEvent to Doris pipeline sink This closes apache#3473 * [FLINK-34877][cdc] Support type cast conversion in pipeline transform This closes apache#3357. * [hotfix][ci] Migrate to Docker Compose V2 (apache#3505) * [FLINK-35344][cdc-base] Move same code from multiple subclasses to JdbcSourceChunkSplitter (apache#3319) * [FLINK-35524][cdc-base] Clear connections pools when reader exist. (apache#3388) * [FLINK-35674][cdc-connector][mysql]Fix blocking caused by searching for timestamp in binlog file (apache#3432) * [FLINK-35912][cdc-connector] SqlServer CDC doesn't chunk UUID-typed columns correctly (apache#3497) * resolve conficts * polish code to trigger ci --------- Co-authored-by: Kael <kael@fts.dev> Co-authored-by: gongzhongqiang <gongzhongqiang@gigacloudtech.com> * [hotfix][starrocks] Fix StarRocks FE startup failure due to insufficient disk space available This closes apache#3508. * [FLINK-35813][cdc-runtime] Do not clear state field in TransformSchemaOperator until operator closed This closes apache#3469. * [FLINK-35234][minor][cdc-common] Fix potential NPE in ConfigurationUtils This closes apache#3255. * [FLINK-35743][cdc-runtime] Correct the temporal function semantics This closes apache#3449. Co-authored-by: wenmo <32723967+wenmo@users.noreply.github.com> Co-authored-by: yuxiqian <34335406+yuxiqian@users.noreply.github.com> * Revert "[hotfix][starrocks] Fix StarRocks FE startup failure due to insufficient disk space available " This reverts commit 3315be3. * [hotfix][ci] Clean up disk space * [FLINK-34638][cdc-common] Support column with default value * [FLINK-35893][cdc-runtime] Write CURRENT_VERSION of TableChangeInfo to state This closes apache#2944. * [hotfix][e2e] Add missing default values field in E2e test This closes apache#3516. * [FLINK-35242][cdc-common][cdc-runtime] Support TRY_EVOLVE and LENIENT schema evolution behavior This closes apache#3339. * [FLINK-35432][pipeline-connector][mysql] Support catch modify event in mysql to send AlterColumnTypeEvent. (apache#3352) Co-authored-by: haoke <haoke@bytedance.com> * [FLINK-35791][kafka] Add database and table info of Canal / Debezium json format for Kafka sink (apache#3461) * [build][e2e] Separate Pipeline and Source E2e tests and cover flink 1.20 version This closes apache#3514. * [FLINK-35730][cdc-cli] PipelineDefinitionParser supports parsing pipeline def in text format This closes apache#3444. * [FLINK-35272][cdc-runtime] Transform supports omitting and renaming computed column This closes apache#3285. * [FLINK-34853] Submit CDC Job To Flink K8S Native Application Mode (apache#3093) * [hotfix][cdc-connector][mongodb] Fix unstable test cases for snapshot back-filling (apache#3506) * [FLINK-35143][pipeline-connector][mysql] Expose newly added tables capture in mysql pipeline connector. (apache#3411) Co-authored-by: Muhammet Orazov <916295+morazow@users.noreply.github.com> Co-authored-by: north.lin <north.lin@yunlsp.com> * [FLINK-35715][common] Ignore the compact optimize for mysql timestamp type in BinaryRecordData (apache#3511) * [FLINK-34688][cdc-connector][mysql] Make scan newly table trigger condition strictly This closes apache#3519. * [FLINK-35442][cdc-connect][kafka] add key.format and partition.strategy option to make sure the same record sending to the same partition. (apache#3522) * [FLINK-34876][transform] Support UDF functions in transform (apache#3465) * [FLINK-36007][[cdc-composer] Loading factory and added jar in one search This close apache#3520. * [FLINK-35981][cdc-runtime] Transform supports referencing one column more than once This closes apache#3515. * [hotfix][doc] Fix doris document dead links and typo This closes apache#3527 * [FLINK-35938][pipeline-connector][paimon] Use filterAndCommit API for retry Committable to avoid duplicate commit This closes apache#3504. * [FLINK-36023][cdc-composer] Flink CDC K8S Native Application Mode add wrong jar url (apache#3523) * [FLINK-35984][cdc-runtime] Fix bug that metadata column name can not be used in transform rule This closes apache#3528. Co-authored-by: yuxiqian <34335406+yuxiqian@users.noreply.github.com> * [FLINK-35891][pipeline-connector][paimon] Support dynamic bucket in Paimon sink This closes apache#3499. * [FLINK-35884][pipeline-connector][mysql] MySQL pipeline connector supports to set chunk key-column (apache#3490) Co-authored-by: wangjunbo <wangjunbo@qiyi.com> Co-authored-by: Hang Ruan <ruanhang1993@hotmail.com> * [FLINK-35894][pipeline-connector][es] Introduce Elasticsearch Sink Connector for Flink CDC Pipeline * [FLINK-35894][pipeline-connector][es] Support for ElasticSearch 6, 7 versions This closes apache#3495. * [FLINK-35991][cdc-runtime] Resolve operator conflicts in transform SQL operator tables * [FLINK-36034][cdc-runtime] Get rid of Flink table planner dependency in cdc runtime module This closes apache#3513. * [release] Update version to 3.3-SNAPSHOT This closes apache#3531. Co-authored-by: yuxiqian <34335406+yuxiqian@users.noreply.github.com> * [hotfix][cdc-runtime] Invalidate cache correctly to avoid classloader leakage This closes apache#3533 * [FLINK-35805][transform] Add __data_event_type__ metadata column This closes apache#3468 * [FLINK-36054][cdc][build] Fix Flink CDC parent pom and release scripts (apache#3540) * [FLINK-36056][cdc][connector/elasticsearch] Change flink.connector.elasticsearch.version to a released version This closes apache#3542. * [FLINK-35638][OceanBase][test] Refactor OceanBase test cases and remove dependency on host network (apache#3439) * [minor][docs] Compress images to reduce file size and improve website load speed This closes apache#3551 * [FLINK-36082][pipeline-connector][kafka] Fix lamda NotSerializableException in KafkaDataSink This closes apache#3549 * [FLINK-36088][pipeline-connector][paimon] Fix NPE in BucketAssignOperator when job restoration This closes apache#3553 * [FLINK-36076][minor][cdc-runtime] Set isSchemaChangeApplying as volatile for thread safe consideration This closes apache#3556. * [FLINK-35243][cdc-common] Extends more schema change event types support This close apache#3521. * [FLINK-36111][minor][pipeline-connector/paimon] Improve MultiTableCommittableChannelComputer Topology name This closes apache#3559 * [FLINK-36081][cdc-connector][mysql] Remove the schemas of outdated tables in the BinlogSplit when restart (apache#3548) Co-authored-by: 云时 <mingya.wmy@alibaba-inc.com> * [FLINK-36094][cdc-runtime] Improve the Exception that SchemaRegistryRequestHandler thrown This closes apache#3558. * [FLINK-36115][pipeline-connector][mysql] Introduce scan.newly-added-table.enabled option for MySQL Source This closes apache#3560. * [FLINK-36114][cdc-runtime] Make SchemaRegistryRequestHandler thread safe by blocking subsequent schemaChangeEvent This closes apache#3563. Co-authored-by: Hongshun Wang <loserwang1024@gmail.com> * [FLINK-36092][cdc-runtime] Fix schema evolution failure with wildcarded transform rule This closes apache#3557. * [FLINK-35056][cdc-connector/sqlserver] Fix scale mapping from SQL Server TIMESTAMP to Flink SQL timestamp This closes apache#3561. * [hotfix][cdc-runtime] Fix schema registry hanging in multiple parallelism This closes apache#3567. * [hotfix][cdc-connector][mongodb] Fix LegacyMongoDBSourceExampleTest cannot run (apache#3555) * [FLINK-36128][cdc-runtime] Fix potential unrecoverable in-flight data exception by promoting LENIENT as the default schema change behavior This closes apache#3574. * [hotfix][cdc-runtime] Keep upstream pending requests in order to avoid checkpoint hanging & state inconsistency in timestamp startup mode This closes apache#3576. * [FLINK-36148][pipeline-connector/mysql] Fix that newly added table can not discovered by adding custom parser for CreateTableEvent This closes apache#3570. * [FLINK-36150][pipeline-connector/mysql] tables.exclude should work even scan.binlog.newly-added-table.enabled is true This closes apache#3573. * [minor][cdc-runtime] Run schema coordinator logic asynchronously to avoid blocking the main thread This closes apache#3557 * [hotfix][pipeline-connector/mysql] Fix primary key restraints missing when using inline `PRIMARY KEY` declaration syntax This closes apache#3579. * [minor][tests] Fix test testDanglingDroppingTableDuringBinlogMode due to imprecise timestamp startup This closes apache#3580 * [FLINK-36184][transform] Fix transform operator swallows schema changes from tables not present in transform rules (apache#3589) * [FLINK-36183] Fix lenient schema evolution failure with route blocks (apache#3583) * [FLINK-36226][cdc][docs] Build 3.2 docs and mark it as stable (apache#3597) * [FLINK-35503] Add support for running Oracle connector unit test on ARM architecture (apache#3600) * [hotfix][CI] Fix Stale label automatically removed without update activity This closes apache#3621. * [FLINK-34738][cdc][docs-zh] "Deployment - YARN" Page for Flink CDC Chinese Documentation (apache#3205) * [FLINK-34738][cdc][docs-zh] "Deployment - YARN" Page for Flink CDC Chinese Documentation * [FLINK-34738][cdc][docs-zh] Optimization for "Deployment - YARN" Page's Chinese Documentation * [hotfix][tests] Fix oracle e2e test with ARM docker image without a database initial (apache#3634) * [FLINK-36151][docs] Add schema evolution related docs This closes apache#3575. * [hotfix][test] Reorganize test cases * [hotfix][ci] Optimize CI performance by implementing parallel execution * [hotfix][ci] Fix ci failure in new GitHub runner images (apache#3645) Signed-off-by: yuxiqian <34335406+yuxiqian@users.noreply.github.com> * [FLINK-36214][build] Downgrade scala-maven-plugin version to 4.8.0 to keep compatibility with Java 8 (apache#3594) * [FLINK-36326][cdc-connector][mysql] Send BinlogSplitUpdateRequestEvent only once to fix auto scan newly-added table failure (apache#3613) Signed-off-by: yuxiqian <34335406+yuxiqian@users.noreply.github.com> Co-authored-by: Hang Ruan <ruanhang1993@hotmail.com> * [FLINK-36463][cdc-connector][mysql] Forbid to override some debezium options (apache#3631) * [FLINK-36211][pipeline-connector/kafka] shade org.apache.flink.streaming.connectors.kafka to avoid conflict with flink-connector-kafka jar. (apache#3595) * [FLINK-35291][runtime] Improve the ROW data deserialization performance of DebeziumEventDeserializationScheme (apache#3289) Co-authored-by: liuzeshan <liuzeshan@bytedance.com> * [FLINK-36174][cdc-cli] CDC yaml without pipeline should not throw exception. (apache#3588) * [FLINK-35544][docs][deploy] Add deployment documentations for Kubernetes Operator This closes apache#3392. * [FLINK-36618][cdc-connector][postgres] Improve PostgresDialect.discoverDataCollections to reduce the bootstrap time This closes apache#3672. * [FLINK-36052][docs][cdc-pipeline-connector/es] Add flink cdc pipeline elasticsearch connector docs This closes apache#3649. Co-authored-by: wangjunbo <wangjunbo@qiyi.com> * [FLINK-36474][route] Support merging timestamp columns when routing This closes apache#3636. * [FLINK-36514][cdc-cli] Fix unable to override exclude schema types in lenient mode This closes apache#3637. Signed-off-by: yuxiqian <34335406+yuxiqian@users.noreply.github.com> * [FLINK-36221][docs] Add CAST function documentations (apache#3596) * [FLINK-35982][transform] Fix transform metadata failure when projection block is missed This closes apache#3592. * [FLINK-35985][transform] Correct the substring function in transform rule This closes apache#3537. * [FLINK-36105][flink-cdc-cli] Fix unable to restore from checkpoint with Flink 1.20 (apache#3564) * [FLINK-36247][cdc-connector][mysql] Fix potential transaction leak during MySQL snapshot phase (apache#3602) * [hotfix][transform][minor] Fix potential code conflict that SUBSTR built-in function is 1-based index now This closes apache#3696 * [FLINK-36649][cdc-connector][oracle] Fix oracle connection close error (apache#3678) * [FLINK-36572][pipeline-connector][starrocks] Fix the issue that the local time zone is wrongly set This closes apache#3655. * [FLINK-36407][runtime] Shut down coordinatorExecutor upon closing SchemaRegistry (apache#3624) * [FLINK-36560][pipeline-connector][paimon] Fix the issue that timestamp_ltz field is not correctly converted This closes apache#3648. * [FLINK-36408][pipeline-connector][mysql] Fix MySQL pipeline connector failed to parse FLOAT type with precision (apache#3625) * [FLINK-36375][cdc-runtime] Fix missed default value in AddColumnEvent/RenameColunEvent This closes apache#3622. * [FLINK-36517][pipeline-connector][paimon] Use filterAndCommit API to avoid committing the same datafile twice This closes apache#3639 * [FLINK-36678][docs][deploy] Fix the typo in Flink CDC YARN deployment documentation This closes apache#3706. * [FLINK-36541][pipeline-connector][paimon] Pass checkpointId to StoreSinkWrite#prepareCommit correctly This closes apache#3652. * [FLINK-36681][mysql-cdc][docs] Fix the wrong chunks splitting query in incremental snapshot reading section This closes apache#3703. * [FLINK-36461][transform] Fix schema evolution failure with un-transformed tables This closes apache#3632. * [FLINK-36609][pipeline-connector][paimon] Add partition columns to primary keys This closes apache#3641. * [FLINK-36699][cdc-common] fix nullability when converting cdc data type to flink data type (apache#3713) * [FLINK-35592][cdc-connector][mysql] Fix MysqlDebeziumTimeConverter miss timezone when convert to timestamp This closes apache#3332 Co-authored-by: Hang Ruan <ruanhang1993@hotmail.com> * [FLINK-36093][transform] Fix preTransformoperator wrongly filters columns belong to different transforms This closes apache#3572 * [FLINK-36596][transform] Fix unable to schema evolve with project-less transform rules This closes apache#3665. * [FLINK-36565][transform] Route allows merging Decimals with various precisions This closes apache#3651 * [FLINK-36285][doris] Fix unable to alter column type without default value specified This closes apache#3691. * [FLINK-36586][build] Update flink version to 1.19 (apache#3660) Co-authored-by: ConradJam <czy006@apache.com> Co-authored-by: Hang Ruan <ruanhang1993@hotmail.com> * [FLINK-36656][mysql] Fix type conversion failure for newly-added sharding table with mysql boolean type (apache#3683) * [FLINK-36750][pipeline-connector/paimon] Restore SinkWriter from state to keep the information of previous SinkWrite when schema evolution happen This closes apache#3744. * [hotfix][source-connector/mysql] Fix conflicts after flink version bumped to 1.19 (apache#3748) * [FLINK-36772][mysql][cdc-base] Fix error placeholder for errorMessageTemplate of Preconditions This closes apache#3754 * [hotfix][docs][pipeline-connector/es] Add missed supported flink versions and Elasticsearch versions This closes apache#3752 * [FLINK-36525][transform] Support for AI Model Integration for Data Processing (apache#3753) * [hotfix][Doc] fix spelling error in Al model docs. (apache#3764) * [FLINK-36803][cdc-connector][base & mysql] Use the same format `tableId:chunkId` for splitId in SnapshotSplit (apache#3763) * [hotfix][tests][oceanbase] Fix oceanbase test failure, possibly caused by some interactions among cases (apache#3712) * [FLINK-36315][cdc-connector][base&pg&mongodb]The flink-cdc-base module supports source metric statistics (apache#3619) Co-authored-by: molin.lxd <molin.lxd@alibaba-inc.com> Co-authored-by: Hang Ruan <ruanhang1993@hotmail.com> * [docs] Update download links to up-to-date cdc version (apache#3766) * [docs] Update download links to up-to-date cdc version Signed-off-by: yuxiqian <34335406+yuxiqian@users.noreply.github.com> * fix: replace params to allow dead link checking to pass Signed-off-by: yuxiqian <34335406+yuxiqian@users.noreply.github.com> * fix: just don't check any interpolating urls Signed-off-by: yuxiqian <34335406+yuxiqian@users.noreply.github.com> --------- Signed-off-by: yuxiqian <34335406+yuxiqian@users.noreply.github.com> * [FLINK-36878][pipeline-connector][kafka] Shade org.apache.kafka with org.apache.flink.kafka.shaded.org.apache.kafka instead of cdc path This closes apache#3790. * [hotfix][build] Add missed third-party OSS NOTICE This closes apache#3756. * [hotfix][mysql] Remove deprecated MySQL incremental sources used in tests This closes apache#3792 * [FLINK-36771][cdc-connector][base&mysql] Fix UT trigger error: Invalid assigner status {} [NEWLY_ADDED_ASSIGNING_FINISHED] (apache#3755) * [FLINK-36891[[source-connector][mysql] Fix corrupted state in case of serialization failure in MySQL CDC Source This closes apache#3794. * [FLINK-34688][cdc-connector][base] CDC framework split snapshot chunks asynchronously (apache#3510) * [FLINK-36895][cdc-connector][base] The JdbcSourceChunkSplitter#queryMin method passed a parameter with tableName/coiumnName reversed. (apache#3797) * [FLINK-36558][source-connector/mysql] Fix column metadata parsing compatibility with MySQL 8.0.17&8.0.18 This closes apache#3647. * [FLINK-36854][transform] Add missed comment and default value after transform This closes apache#3780. * [FLINK-36866][transform] Fix unable to narrow casting on numeric values This closes apache#3786. * [hotfix] Fix Java 11 target compatibility & add tests (apache#3633) * [hotfix] Fix Java 11 target compatibility Signed-off-by: yuxiqian <34335406+yuxiqian@users.noreply.github.com> # Conflicts: # .github/workflows/flink_cdc_java_8.yml # .github/workflows/flink_cdc_migration_test_base.yml # pom.xml * fix: clarify GiHub workflow names Signed-off-by: yuxiqian <34335406+yuxiqian@users.noreply.github.com> --------- Signed-off-by: yuxiqian <34335406+yuxiqian@users.noreply.github.com> * [FLINK-36864][cdc-runtime] Fix unable to use numeric literals that goes beyond Int32 range (apache#3785) Signed-off-by: yuxiqian <34335406+yuxiqian@users.noreply.github.com> * [FLINK-36879][runtime] Support to convert delete as insert in transform (apache#3804) * [hotfix] Fix building status badge in README as workflow files have been refactored (apache#3807) * [hotfix] Fix building status badge in README as workflow files have been refactored Signed-off-by: yuxiqian <34335406+yuxiqian@users.noreply.github.com> * fix: rename workflow files to `flink_cdc_ci` / `flink_cdc_ci_nightly` As they're not strictly tied to specific JDK versions. Signed-off-by: yuxiqian <34335406+yuxiqian@users.noreply.github.com> --------- Signed-off-by: yuxiqian <34335406+yuxiqian@users.noreply.github.com> * [FLINK-36784][common] Support to add metadata columns for data in the meta fields of DataChangeEvent at transform (apache#3758) Co-authored-by: Kunni <lvyanquan.lyq@alibaba-inc.com> * [hotfix][docs] Fix ToC to include H1 This closes apache#3773. * [tests][build] Update migration test matrix to 3.2.0 and later * [cdc-common] Extract column / schema type merging utility methods to `SchemaMergingUtils` * [FLINK-36763][cdc-runtime] Introduce distributed schema evolution topology for sources with parallelized metadata This closes apache#3801 * [FLINK-36690][cdc-runtime] Fix schema operator hanging under extreme parallelized pressure This closes apache#3680 * [hotfix][tests] Fix unstable `testInitialStartupModeWithOpTs` case (apache#3809) Signed-off-by: yuxiqian <34335406+yuxiqian@users.noreply.github.com> * [FLINK-36952][source-connector/mongodb] Fix typos in MongoDB connector Chinese documentation (apache#3811) * [build][hotfix] Allow pinning issues & PRs that never decay This closes apache#3816 * [hotfix][tests] Fix unstable `OceanBaseMySQLModelITCase` (apache#3831) Signed-off-by: yuxiqian <34335406+yuxiqian@users.noreply.github.com> * [FLINK-36790][cdc-connector][paimon] Set waitCompaction to true in PaimonWriter to avoid CME problem This closes apache#3760 Co-authored-by: wuzhiping <wuzhiping.007@bytedance.com> * [FLINK-34545][pipeline-connector/ob]Add OceanBase pipeline connector to Flink CDC This closes apache#3360. * [FLINK-37017][cdc-common] Supports map and array types for binary record data This closes apache#3434. * [hotfix][docs] Add missed enumeration in MySQL Source This closes apache#3832 * [FLINK-35167][pipeline-connector/maxcompute] Introduce MaxCompute pipeline DataSink This closes apache#3254. * [hotfix][k8s] Fix command-line option `from-savepoint` does not work in K8s deployment mode This closes apache#3825. Co-authored-by: helloliuxg@gmail.com <xiaogenliu@kugou.net> * [hotfix][cdc-common] Remove duplicated code to improve performance This closes apache#3840. Co-authored-by: zhangchaoming.zcm <zhangchaoming.zcm@antgroup.com> * [FLINK-36524][pipeline-connector][paimon] Bump Paimon version to 0.9.0 This closes apache#3644 * [FLINK-36877][pipeline-connector/kafka] Correct canal-json output for delete record This closes apache#3788 * [FLINK-36956][transform] Append NOT NULL attribute for Primary Key columns This closes apache#3815 * [FLINK-36981][transform] Considering sharding tables with different schema in transform projection This closes apache#3826. * [FLINK-36970][cdc-common] Merge result of data type BIGINT and DOUBLE should be DOUBLE instead of STRING This closes apache#3821 * [FLINK-36701][cdc-runtime] Obtain latest evolvedSchema when SinkDataWriterOperator handles FlushEvent from failover This closes apache#3802 Co-authored-by: jzjsnow <snow.jiangzj@gmail.com> * [hotfix][ci] Fix CI failure due to implicit conflicts This closes apache#3846 * [FLINK-35325][transform] Skip insufficient_quota error when running test case using ad model. (apache#3849) * [FLINK-36811][mysql] MySQL cdc setIsProcessingBacklog in snapshot phase and exit when snapshot phase finished (apache#3793) * [FLINK-37012][transform] Fix argument type mismatch when metadata column used in function This closes apache#3837 * [FLINK-36610] MySQL CDC supports parsing gh-ost / pt-osc generated schema changes (apache#3668) Co-authored-by: MOBIN-F <18814118038@163.com> * [FLINK-36865][cdc] Provide UNIX_TIMESTAMP series functions in YAML pipeline This closes apache#3819. * [FLINK-36964][pipeline-connector/paimon] Fix potential exception when SchemaChange in parallel with Paimon Sink This closes apache#3818 Co-authored-by: yuxiqian.yxq <yuxiqian.yxq@alibaba-inc.com> * [FLINK-36858][pipeline-connector][kafka] Fix JsonRowDataSerializationSchema compatibility issue with Flink 1.20 This closes apache#3784 Co-authored-by: Leonard Xu <xbjtdcq@gmail.com> * [FLINK-36985][pipeline-connector/paimon] Tolerante ColumnAlreadyExistException when apply AddColumnEvent in paimon This closes apache#3828. * [FLINK-37042][pipeline-connector/maxcompute] Rename maxcompute pipieline connector options to follow flink style This closes apache#3852 * [FLINK-36193][pipeline-connectors][paimon/doris/starrocks] Supports applying TRUNCATE & DROP Table Event to Paimon, StarRocks and Doris This closes apache#3673 * [FLINK-36700][pipeline-connector][elasticsearch] Elasticsearch pipeline sink supports authentication This closes apache#3728 * [FLINK-37011][cdc-transform] Improve get source field value by column name in PreTransformProcessor This closes apache#3836 * [FLINK-35634][build] Add CdcUp playground CLI scripts This closes apache#3605 * [FLINK-36974][cdc-cli] Support overwrite flink configuration via command line This closes apache#3823 Co-authored-by: helloliuxg@gmail.com <xiaogenliu@kugou.net> * [FLINK-35387][cdc-connector][postgres] PG CDC source support heart beat This closes apache#3667 * [FLINK-36636][transform] Supports timestamp comparison in cdc pipeline transform This closes apache#3677 * [FLINK-36977][pipeline-connector/paimon] Apply default value when process add_column schema change envent This closes apache#3824. Co-authored-by: Leonard Xu <xbjtdcq@gmail.com> * [FLINK-36741][transform] Fix the decimal precision and length lost during transform This closes apache#3740 * [FLINK-36647][transform] Support Timestampdiff and Timestampadd function in cdc pipeline transform This closes apache#3698 * [build][minor] Upgrade the max file length from 3k to 4k * [FLINK-34865][pipeline-connector/mysql] Support sync table and column comment This closes apache#3482 Co-authored-by: Leonard Xu <xbjtdcq@gmail.com> * [FLINK-37124][build] Simplify logs in test cases to avoid flooding GHA outputs This closes apache#3860 * [FLINK-36754][transform] Projection should treated as an asterisk when projection expression is empty or null This closes apache#3749 * [hotfix][pipeline-connector][maxcompute] Fix MaxCompute Pipeline Connector tests by upgrading maxcompute-emulator This closes apache#3862 * [FLINK-36282][pipeline-connector][cdc-connector][mysql]fix incorrect data type of TINYINT(1) in mysql pipeline connector (apache#3608) * [FLINK-36282][pipeline-connector][cdc-connector][mysql]fix incorrect data type of TINYINT(1) in mysql pipeline connector * reformat code * Update MySqlPipelineITCase.java * pass a boolean value instead of Properties * uodate FAQ * add a method to get tinyInt1isBit * add new cdc config `treat-tinyint1-as-boolean` * Update MySqlChunkSplitter.java * change param name * [FLINK-36406][cdc-runtime] Close MetadataApplier when the job stops This closes apache#3623 * [FLINK-36224][docs] Add the version mapping between pipeline connectors and flink This closes apache#3598 * [FLINK-36620][cdc-cli] Add support for specifying the --flink-home parameter via an '=' character This closes apache#3838 * [FLINK-36351][pipeline-connector/doris] Support the conversion of Flink TIME type to Doris String type This closes apache#3620 * [FLINK-35802][pipeline-connectors/mysql] Clean ChangeEventQueue to avoid deadlock when calling BinaryLogClient#disconnect method This closes apache#3463 * [FLINK-35600][pipeline-connector/mysql] Add timestamp for low and high watermark This closes apache#3415 * [hotfix][test] Make pipeline migration tests more robust This closes apache#3866 * [hotfix[cdc-runtime] Close MetadataApplier in SchemaRegistry when the job stops This closes apache#3864 * [FLINK-36913][pipeline-connector][kafka] Introduce option to define custom mapping from upstream table id to downstream topic name This closes apache#3805 * [FLINK-35067][cdc-connector][postgres] Adding metadata 'row_kind' for Postgres CDC Connector. This closes apache#3716. Co-authored-by: Leonard Xu <xbjtdcq@gmail.com> * [FLINK-37122][build] Try to be compatible with old flink version 1.18.x This closes apache#3859 * [FLINK-36578][pipeline-connector/mysql] Introduce option to unify json type output between snapshot phase and binlog phase This closes apache#3658 * [minor][ci] Set proper timeout for compile_and_test step of CI job * [hotfix][pipeline-connector][mysql] Fix missed optional option in MySqlDataSourceFactory This closes apache#3867 * [FLINK-34865][pipeline-connector/mysql] Support sync newly added table's comment This closes apache#3869 * [hotfix][build] Miscellaneous fixes on GHA workflows This closes apache#3839 * [build] Update version to 3.4-SNAPSHOT and add release-3.3 docs This closes apache#3870 * [build] Update version to 3.4-SNAPSHOT and add release-3.3 docs This closes apache#3870 * [hotfix][docs][pipeline-connector/maxcompute] Fix maxcompute connector typo in examples section This closes apache#3875 * [hotfix][cdc][docs] Build 3.3 docs and mark it as stable (apache#3882) * [hotfix][docs] Change docs in master to 3.4-SNAPSHOT (apache#3886) * [FLINK-37233][docs] Update supported flink versions and sinks (apache#3899) * [FLINK-34729][docs] Translate "Core Concept" Pages of Flink CDC into Chinese This closes apache#3901 * [FLINK-37252][doc] Align Postgres CDC Connector Chinese docs with English version This closes apache#3903 * [FLINK-37224][docs] Add the missing documents and parameters of MongoDB CDC This closes apache#3895 * [FLINK-37251][doc] Add pipeline connectors' download link in overview.md This closes apache#3900 * [FLINK-37231][docs] Add documentation for CDC Source metrics This closes apache#3897 * [FLINK-36698][pipeline-connector][elasticsearch] Elasticsearch Pipeline Sink supports index sharding This closes apache#3723. Co-authored-by: wangjunbo <wangjunbo@qiyi.com> * [FLINK-37287][docs] Add missed Apache Paimon 0.9 and Fixing typo on overview.md This closes apache#3913 * [hotfix][license] Update legacy license This closes apache#3908 * [tests][ci] Miscellaneous improvements on CI robustness This closes apache#3911 * [FLINK-37262][pipeline-connector/mysql] Fix missing PARSE_ONLINE_SCHEMA_CHANGES option in MySqlDataSourceFactory This closes apache#3910 * [FLINK-37191][cdc-connector/mysql] Avoid back filling when lowWatermark is equal to highWatermark in BinlogSplit This closes apache#3902 * [FLINK-36564][ci] Running CI in random timezone to expose more time related bugs This closes apache#3650 * [FLINK-36945][cdc-connector/mysql] Support parsing rename multiple tables in one statement This closes apache#3876. * Merge remote-tracking branch 'refs/remotes/github/master' into rh11cp * refs/remotes/github/master: (308 commits) [FLINK-36945][cdc-connector/mysql] Support parsing rename multiple tables in one statement [FLINK-36564][ci] Running CI in random timezone to expose more time related bugs [FLINK-37191][cdc-connector/mysql] Avoid back filling when lowWatermark is equal to highWatermark in BinlogSplit [FLINK-37262][pipeline-connector/mysql] Fix missing PARSE_ONLINE_SCHEMA_CHANGES option in MySqlDataSourceFactory [tests][ci] Miscellaneous improvements on CI robustness [hotfix][license] Update legacy license [FLINK-37287][docs] Add missed Apache Paimon 0.9 and Fixing typo on overview.md [FLINK-36698][pipeline-connector][elasticsearch] Elasticsearch Pipeline Sink supports index sharding [FLINK-37231][docs] Add documentation for CDC Source metrics [FLINK-37251][doc] Add pipeline connectors' download link in overview.md [FLINK-37224][docs] Add the missing documents and parameters of MongoDB CDC [FLINK-37252][doc] Align Postgres CDC Connector Chinese docs with English version [FLINK-34729][docs] Translate "Core Concept" Pages of Flink CDC into Chinese [FLINK-37233][docs] Update supported flink versions and sinks (apache#3899) [hotfix][docs] Change docs in master to 3.4-SNAPSHOT (apache#3886) [hotfix][cdc][docs] Build 3.3 docs and mark it as stable (apache#3882) [hotfix][docs][pipeline-connector/maxcompute] Fix maxcompute connector typo in examples section [build] Update version to 3.4-SNAPSHOT and add release-3.3 docs [build] Update version to 3.4-SNAPSHOT and add release-3.3 docs [hotfix][build] Miscellaneous fixes on GHA workflows ... * fix test * fix tests * fix test * fix test * fix test * fix: downloading gh-ost cli timeout 修复了 gh-ost cli 下载超时的问题,放在资源目录里了。 Link: https://code.alibaba-inc.com/ververica/flink-cdc/codereview/20506267 * fix: downloading gh-ost cli timeout * fix: fill in dummy methods in YamlServiceImpl * Shutdown previous jobs properly (to avoid serverId conflicts) * update 1.20 * feat: Resolve conflict, auto committed by CodeFlow * fix test * fix test * fix test * fix test
https://issues.apache.org/jira/browse/FLINK-35143