-
Notifications
You must be signed in to change notification settings - Fork 3.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
sql: determine impact of sql.stats.automatic_collection.enabled cluster setting on sysbench oltp_read_write #135988
Comments
This issue has multiple T-eam labels. Please make sure it only has one, or else issue synchronization will not work correctly. 🦉 Hoot! I am a Blathers, a bot for CockroachDB. My owner is dev-inf. |
@tbg could you clarify whether this is about SQL table statistics, or observability statement statistics? They're both informally called "SQL stats" unfortunately, but they are not the same. |
Oh, based on this thread, it seems to be the SQL statement stats owned by observability? Moving. |
This issue came from turning off both I think that setting has to do with full table scans that recreate table statistics, so it seems likely to have an impact: cockroach/pkg/sql/stats/automatic_stats.go Lines 581 to 597 in 27e5b9d
@mgartner could you chime in? |
I just kicked off a run where I'm testing this again on top of the 4 cluster settings which we know improve performance and stability. I'm hoping that this will give a more reliable result. Will report back when I have that data. |
In the repeated run, I'm still seeing a ~3% degradation with |
That could be possible. At the end of the runs with these settings, do we have any stats for these tables at all? When I disabled this, it was on a longer-running cluster. I think we do want stats, but measure the overhead of repeatedly recreating those stats if the underlying tables don't really change. |
Can I confirm that with the creation of the issues for flush and sql exec stats recording, this issue is only tracking the impact of |
[queries triage] if we understand correctly, right now it's unclear whether running auto stats has detrimental effect on the sysbench numbers overall, and as such we removed this from out project board. If there is detrimental effect, and Queries needs to look closer, please add it back to our project. |
Note that the
|
I still need to dig further into this one, but my theory is that auto stats is actually helpful to these runs because we're not manually collecting stats before the run. That being said, histogram collection seems to harm the runs because (again a guess) the histogram stats aren't helpful for this workload. Will need to do more digging on this in the new year. |
See here.
Epic: CRDB-42584
Jira issue: CRDB-44818
The text was updated successfully, but these errors were encountered: