Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SegmentPool improvements #352

Merged
merged 20 commits into from
Jul 12, 2024
Merged

SegmentPool improvements #352

merged 20 commits into from
Jul 12, 2024

Conversation

fzhinkin
Copy link
Collaborator

@fzhinkin fzhinkin commented Jul 4, 2024

Currently, SegmentPool implementation on JVM works more like a cache than a pool: it has a relatively small fixed size, segments are taken from the pool on an effort basis, and it's easy to make a segment non-recyclable (meaning that it will never be returned to the pool).

This behavior is totally fine for many applications, but it's not for Ktor.
This PR addresses several issues:

  • Support precise segment reference tracking to return shared segments back to the pool eventually. Currently, once a segment is shared, it is never returned to the pool because the segment state is tracked by a flag that could be set but could not be used. To solve that problem, on JVM, the exact number of shared copies is now tracked, and the value is decremented on attempts to recycle shared copies. Once the last copy is passed to the Segment.recycle, it'll be returned to the pool.
  • Retry lost CAS attempts when taking or returning segments from the pool. The previous sentence is quite descriptive; I'll only add that it does not make much sense without the next change.
  • Support segment pool size configuration. Currently, the segment pool consists of multiple buckets (the exact number depends on the system's logical CPU count), each with a max size of 64KiB. This PR adds a second level/tier to the pool with half as many buckets as an old first tier. The total size of the new tier is configurable via system properties and is 0 by default. All failed attempts to take or recycle segments in the first tier use the second tier as a fallback. Unlike the first tier, if the second tier's bucket is empty, take will scan all other buckets. The scan is also performed on recycle to a full bucket.

All these changes reduce the allocation count by combining more precise work with the pool and optionally extensible pool size.
Two-level design helps with avoiding performance penalties induced by the scan technique employed by the second tier when that tier is not used and, at the same time, allows using the same sharding technique as in the first tier enhanced with the bucket steeling techniques.

@fzhinkin fzhinkin requested review from shanshin and qwwdfsad July 4, 2024 13:59
@fzhinkin
Copy link
Collaborator Author

fzhinkin commented Jul 4, 2024

CC @e5l @bjhham

@fzhinkin fzhinkin mentioned this pull request Jul 4, 2024
7 tasks
private val HASH_BUCKET_COUNT_L2 = (HASH_BUCKET_COUNT / 2).coerceAtLeast(1)

private val SECOND_LEVEL_POOL_TOTAL_SIZE =
System.getProperty("kotlinx.io.pool.size.bytes", "0").toInt().coerceAtLeast(0)
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Discussed with @e5l that it makes sense to bump up the default size up to several megs (probably, 4Mb).

Not sure if it should be done on Android.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What really puzzles me here is how it affects the end users.
Because it seems like the characteristics of the IO (which is the way to do a lot of stuff) will depend on whether or not Ktor is used in the project.

I don't mind it as a palliative intermediate solution as the ground for further benchmarking, but definitely not something to keep as as

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I checked how BufferReadWriteByteArray performs depending on the presence of the second-level pool and the working set size.

If WSS is within L1-pool, there's no difference.
If WSS doesn't fit into L1-pool, it might be beneficial not to use L2-pool (i.e. it's faster w/o L2-pool) until some threshold size, but then L2-pool boosts the performance.
But it's hard to judge by looking at synthetic benchmarks only, as they provide zero to no evidence regarding GC-related effects on overall performance.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We can always revert it and also provide some alternative mechanism to adjust the pool size, so those who really demands it (Ktor), could tune it when needed.

Copy link
Collaborator

@qwwdfsad qwwdfsad left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Round of cosmetics to speed up the convergence, still digging into the segment pool details

core/common/src/Segment.kt Outdated Show resolved Hide resolved
core/common/src/Segment.kt Outdated Show resolved Hide resolved
core/common/src/Segment.kt Outdated Show resolved Hide resolved
core/common/src/Segment.kt Outdated Show resolved Hide resolved
core/jvm/src/SegmentPool.kt Outdated Show resolved Hide resolved
core/jvm/src/SegmentPool.kt Outdated Show resolved Hide resolved
core/jvm/src/SegmentPool.kt Outdated Show resolved Hide resolved
core/common/src/SegmentPool.kt Show resolved Hide resolved
core/jvm/src/SegmentPool.kt Outdated Show resolved Hide resolved
fzhinkin and others added 2 commits July 10, 2024 15:46
Co-authored-by: Vsevolod Tolstopyatov <qwwdfsad@gmail.com>
@fzhinkin fzhinkin requested a review from qwwdfsad July 10, 2024 14:28
else -> "4194304" // 4MB
}

private val SECOND_LEVEL_POOL_TOTAL_SIZE =
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

when is the first usage of class SegmentPool?

Is there a high chance of programmatically changing this value before its first reading?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

when is the first usage of class SegmentPool?

On a first write into a buffer

Is there a high chance of programmatically changing this value before its first reading?

I think it's achievable by updating the system property before SegmentPool is initialized.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think it's achievable by updating the system property before SegmentPool is initialized.

if this is acceptable, then fine

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't see any severe consequences of that. That's the only kind of dynamic configuration (#352 (comment)) we can provide for now. :)

core/common/src/unsafe/UnsafeBufferOperations.kt Outdated Show resolved Hide resolved
core/common/src/Segment.kt Outdated Show resolved Hide resolved
core/jvm/src/SegmentPool.kt Outdated Show resolved Hide resolved
core/jvm/src/SegmentPool.kt Show resolved Hide resolved
core/jvm/test/PoolingTest.kt Show resolved Hide resolved
@fzhinkin fzhinkin requested a review from shanshin July 11, 2024 12:45
@fzhinkin fzhinkin merged commit be2bfca into develop Jul 12, 2024
1 check passed
@fzhinkin fzhinkin deleted the two-level-segment-pool branch July 12, 2024 10:58
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants