Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

performance: read_escaped_string_text when COPY TSV file #8572

Closed
Tracked by #7823
BohuTANG opened this issue Nov 1, 2022 · 3 comments
Closed
Tracked by #7823

performance: read_escaped_string_text when COPY TSV file #8572

BohuTANG opened this issue Nov 1, 2022 · 3 comments
Assignees
Labels
C-performance Category: Performance

Comments

@BohuTANG
Copy link
Member

BohuTANG commented Nov 1, 2022

Summary

How to bench:
https://github.com/BohuTANG/bendlog/blob/main/scripts/README.md

When the request_body&response_body column length is 2048 random string, read_escaped_string_text is the CPU bottleneck.

perf record -g -p <databend-query-pid-when-copy>
perf report -g

813097a2-f8ba-4386-80f7-099f8036c16a

image

@BohuTANG
Copy link
Member Author

BohuTANG commented Nov 1, 2022

cc @youngsofun @sundy-li

@BohuTANG BohuTANG added the C-performance Category: Performance label Nov 1, 2022
@BohuTANG BohuTANG changed the title performance: read_escaped_string_text when COPY performance: read_escaped_string_text when COPY TSV file Nov 1, 2022
@BohuTANG
Copy link
Member Author

BohuTANG commented Nov 1, 2022

Copy parquet file perf:
image

@youngsofun
Copy link
Member

caused by #8486

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
C-performance Category: Performance
Projects
None yet
Development

No branches or pull requests

2 participants