util/chunk: optimize `(*ListInDisk).GetChunk` and add a fast row container reader (#45130) #45205

ti-chi-bot · 2023-07-06T06:27:29Z

This is an automated cherry-pick of #45130

What problem does this PR solve?

Issue Number: close #45125

Problem Summary:

The existing reading method of RowContainer (GetChunk(...)) is not fast enough for dumping a lot of rows from disk (for the cursorFetch use case).

The existing Iterator4RowContainer is even slower, as it allocates a new chunk for each row 🤦.

This PR is extracted from #44730 (with a some refractor).

What is changed and how it works?

This PR pipelines the IO and CPU calculation, to make full use of the IO bandwidth. It should also help other features using rowContainer, as GetChunk is now much faster.

The performance of existing benchmark BenchmarkListInDisk_GetChunk increases from 2877471ns/op to 462864ns/op

Check List

Tests

Unit test
Integration test
Manual test (add detailed scripts or steps below)
No code

Signed-off-by: Yang Keao <yangkeao@chunibyo.icu>

ti-chi-bot · 2023-07-06T06:27:33Z

This cherry pick PR is for a release branch and has not yet been approved by release team.
Adding the do-not-merge/cherry-pick-not-approved label.

To merge this cherry pick, it must first be approved by the collaborators.

AFTER it has been approved by collaborators, please ping the release team in a comment to request a cherry pick review.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

ti-chi-bot · 2023-07-06T06:27:37Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please assign mmyj for approval. For more information see the Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

YangKeao added 3 commits July 6, 2023 06:27

add a row container reader

078d43b

Signed-off-by: Yang Keao <yangkeao@chunibyo.icu>

re-org the row container test

ffe1ea8

Signed-off-by: Yang Keao <yangkeao@chunibyo.icu>

remove the redundant lock in

3865e17

Signed-off-by: Yang Keao <yangkeao@chunibyo.icu>

ti-chi-bot added release-note-none Denotes a PR that doesn't merit a release note. size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. type/cherry-pick-for-release-7.2 labels Jul 6, 2023

ti-chi-bot mentioned this pull request Jul 6, 2023

util/chunk: optimize (*ListInDisk).GetChunk and add a fast row container reader #45130

Merged

4 tasks

ti-chi-bot bot added the do-not-merge/cherry-pick-not-approved label Jul 6, 2023

YangKeao closed this Jul 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

util/chunk: optimize `(*ListInDisk).GetChunk` and add a fast row container reader (#45130) #45205

util/chunk: optimize `(*ListInDisk).GetChunk` and add a fast row container reader (#45130) #45205

ti-chi-bot commented Jul 6, 2023

ti-chi-bot bot commented Jul 6, 2023

ti-chi-bot bot commented Jul 6, 2023

util/chunk: optimize (*ListInDisk).GetChunk and add a fast row container reader (#45130) #45205

util/chunk: optimize (*ListInDisk).GetChunk and add a fast row container reader (#45130) #45205

Conversation

ti-chi-bot commented Jul 6, 2023

What problem does this PR solve?

What is changed and how it works?

Check List

ti-chi-bot bot commented Jul 6, 2023

ti-chi-bot bot commented Jul 6, 2023

util/chunk: optimize `(*ListInDisk).GetChunk` and add a fast row container reader (#45130) #45205

util/chunk: optimize `(*ListInDisk).GetChunk` and add a fast row container reader (#45130) #45205