-
Notifications
You must be signed in to change notification settings - Fork 569
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
bug: dedicated cdc source writes the same full key more than once in an epoch #16235
Comments
@yezizp2012 @StrikeW We may need to check whether this is a bug in cdc source or in meta. If there is a race in actor assignment in meta, it may affect other use cases as well. |
I met the same issue after I rebooted the risingwave cluster.
|
Another occurrence that is probably related to this issue: https://risingwave-community.slack.com/archives/C03BW71523T/p1719592780560509 |
Another occurance in v1.10.0-rc3. But because the cluster has already been reset, we don't know the kind of problematic table. |
Describe the bug
Recently there are two user reporting the following assertion triggered in compaction both on the data related to cdc source state table:
risingwave/src/storage/hummock_sdk/src/key.rs
Line 1069 in 9f2ac7e
Here are some info for the two user reports:
v..7.2
: brand new cluster and compactor panics after table creation. No more information provided.v1.8.0
: brand new cluster and compactor panics after table creation (using dedicated cdc source).6258454417244160
(the epoch in the panic log) has two SSTs. That means there are 2 CNs writing files in the same checkpoint epoch for table id, which is strange because direct cdc source only has one parallelism and there should only be one CN writing data to table 13 in this epoch.23250
and23240
shows that these two SSTs contain only a single entry withFullKey { UserKey { 13, TableKey { 00000001313200000000000002 } }, epoch: 6258387423461376, epoch_with_gap: 6258387423461376, spill_offset: 0}, len=25
. Full sst dump result can be found here.Error message/log
No response
To Reproduce
No response
Expected behavior
No response
How did you deploy RisingWave?
No response
The version of RisingWave
No response
Additional context
No response
The text was updated successfully, but these errors were encountered: