Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Correctly record the RawBytesRead and scan time metric in DirectBufferedInput #9801

Closed
wants to merge 8 commits into from

Conversation

JkSelf
Copy link
Collaborator

@JkSelf JkSelf commented May 14, 2024

Follow up #8545

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 14, 2024
Copy link

netlify bot commented May 14, 2024

Deploy Preview for meta-velox canceled.

Name Link
🔨 Latest commit 6428352
🔍 Latest deploy log https://app.netlify.com/sites/meta-velox/deploys/666f9067f6cc8b0008e3a04b

@JkSelf
Copy link
Collaborator Author

JkSelf commented May 14, 2024

@Yuhta Reopen #8545 and refined the unit test to pass the code changes. Can you help to review again? Thanks.

@JkSelf
Copy link
Collaborator Author

JkSelf commented May 14, 2024

cc @ulysses-you

@JkSelf JkSelf changed the title Correctly record the RawBytesRead metric in DirectBufferedInput Correctly record the RawBytesRead and scan time metric in DirectBufferedInput May 14, 2024
@Yuhta Yuhta added the ready-to-merge PR that have been reviewed and are ready for merging. PRs with this tag notify the Velox Meta oncall label May 14, 2024
@JkSelf
Copy link
Collaborator Author

JkSelf commented May 15, 2024

@Yuhta Can you help to merge? Thanks.

@JkSelf
Copy link
Collaborator Author

JkSelf commented May 16, 2024

@mbasmanova Can you help to merge? Thanks.

@JkSelf
Copy link
Collaborator Author

JkSelf commented May 27, 2024

@mbasmanova @Yuhta Can you help to merge? Thanks for your help.

@Yuhta Yuhta removed the ready-to-merge PR that have been reviewed and are ready for merging. PRs with this tag notify the Velox Meta oncall label May 27, 2024
@ayushi-agarwal
Copy link

I ran q77 of tpcds with on 1TB scale using gluten/velox as execution layer with this change, and I see
Normal spark on stages page: Tasks: 31/31 Input 4.7 MiB Shuffle Write 1195.4 KiB
Native on stages page Tasks: 31/31 Input 20.5 MiB 1229.6 KiB - These are the number of raw input bytes.
Screenshot from 2024-05-06 18-05-56

Is this difference expected?

@kgpai
Copy link
Contributor

kgpai commented Jun 10, 2024

@JkSelf Can you fix the conflicts ?

@JkSelf
Copy link
Collaborator Author

JkSelf commented Jun 11, 2024

@JkSelf Can you fix the conflicts ?

@kgpai Rebased. Can you help to review again? Thanks.

@JkSelf
Copy link
Collaborator Author

JkSelf commented Jun 11, 2024

I ran q77 of tpcds with on 1TB scale using gluten/velox as execution layer with this change, and I see Normal spark on stages page: Tasks: 31/31 Input 4.7 MiB Shuffle Write 1195.4 KiB Native on stages page Tasks: 31/31 Input 20.5 MiB 1229.6 KiB - These are the number of raw input bytes. Screenshot from 2024-05-06 18-05-56

Is this difference expected?

@ayushi-agarwal In Velox, the raw input bytes refer to the size of data in memory, whereas the standard Spark UI displays the size of data on disk.

@facebook-github-bot
Copy link
Contributor

@kagamiori has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

// A quick sanity check for memory usage reporting. Check that peak total
// memory usage for the project node is > 0.
auto planStats = toPlanStats(task->taskStats());
auto scanNodeId = plan->id();
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi @JkSelf, plan is used after move at line 297.

Could you please also rebase on top of the latest main after fixing this? Rebasing would help with the merge. Thanks!

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@kgpai Thanks for your catching. I have fixed and rebased. Thanks.

@facebook-github-bot
Copy link
Contributor

@kagamiori has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

@facebook-github-bot
Copy link
Contributor

@kagamiori merged this pull request in 6b32339.

Copy link

Conbench analyzed the 1 benchmark run on commit 6b32339c.

There were no benchmark performance regressions. 🎉

The full Conbench report has more details.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Merged
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants