-
-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tweak EntityHasher
for more speed
#10605
Closed
Closed
Changes from all commits
Commits
File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -266,29 +266,60 @@ impl BuildHasher for EntityHash { | |
/// A very fast hash that is only designed to work on generational indices | ||
/// like `Entity`. It will panic if attempting to hash a type containing | ||
/// non-u64 fields. | ||
/// | ||
/// This is heavily optimized for typical cases, where there are lots of runs | ||
/// of contiguous indices and almost no generation conflicts. | ||
/// | ||
/// If you have an unusual case -- say all your indices are multiples of 256 | ||
/// or most of the entities are dead generations -- then you might want also to | ||
/// try [`AHasher`] for a slower hash computation but fewer lookup conflicts. | ||
#[derive(Debug, Default)] | ||
pub struct EntityHasher { | ||
hash: u64, | ||
} | ||
|
||
// This value comes from rustc-hash (also known as FxHasher) which in turn got | ||
// it from Firefox. It is something like `u64::MAX / N` for an N that gives a | ||
// value close to π and works well for distributing bits for hashing when using | ||
// with a wrapping multiplication. | ||
const FRAC_U64MAX_PI: u64 = 0x517cc1b727220a95; | ||
|
||
impl Hasher for EntityHasher { | ||
fn write(&mut self, _bytes: &[u8]) { | ||
panic!("can only hash u64 using EntityHasher"); | ||
} | ||
|
||
#[inline] | ||
fn write_u64(&mut self, i: u64) { | ||
// Apparently hashbrown's hashmap uses the upper 7 bits for some SIMD | ||
// optimisation that uses those bits for binning. This hash function | ||
// was faster than i | (i << (64 - 7)) in the worst cases, and was | ||
// faster than PassHasher for all cases tested. | ||
self.hash = i | (i.wrapping_mul(FRAC_U64MAX_PI) << 32); | ||
// We ignore the generation entirely. It's always functionally correct | ||
// to omit things when hashing, so long as it's consistent, just a perf | ||
// trade-off. This hasher is designed for "normal" cases, where nearly | ||
// everything in the table is a live entity, meaning there are few | ||
// generation conflicts. And thus it's overall faster to just ignore | ||
// the generation during hashing, leaving it to the `Entity::eq` to | ||
// confirm the generation matches -- just like `Entity::eq` checks that | ||
// the index is actually the right one, since there's always the chance | ||
// of a conflict in the index despite a good hash function. | ||
// | ||
// This masking actually ends up with negative cost after optimization, | ||
// since it saves needing to do the shift-and-or between the fields. | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Demonstration of this by comparing the codegen without the masking to having the masking: https://rust.godbolt.org/z/3vvsT38ar |
||
let index = i & 0xFFFF_FFFF; | ||
|
||
// SwissTable (and thus `hashbrown`) cares about two things from the hash: | ||
// - H1: low bits (masked by `2ⁿ-1`) to pick the slot in which to store the item | ||
// - H2: high 7 bits are used to SIMD optimize hash collision probing | ||
// For more see <https://abseil.io/about/design/swisstables#metadata-layout> | ||
|
||
// This hash function assumes that the entity ids are still well-distributed, | ||
// so for H1 leaves the entity id alone in the low bits so that id locality | ||
// will also give memory locality for things spawned together. | ||
// For H2, take advantage of the fact that while multiplication doesn't | ||
// spread entropy to the low bits, it's incredibly good at spreading it | ||
// upward, which is exactly where we need it the most. | ||
|
||
// The high 32 bits of this are ⅟φ for Fibonacci hashing. That works | ||
// particularly well for hashing for the same reason as described in | ||
// <https://extremelearning.com.au/unreasonable-effectiveness-of-quasirandom-sequences/> | ||
// It loses no information because it has a modular inverse. | ||
// (Specifically, `0x144c_bc89_u32 * 0x9e37_79b9_u32 == 1`.) | ||
// | ||
// The low 32 bits are just 1, to leave the entity id there unchanged. | ||
const UPPER_PHI: u64 = 0x9e37_79b9_0000_0001; | ||
self.hash = index.wrapping_mul(UPPER_PHI); | ||
} | ||
|
||
#[inline] | ||
|
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Demonstration of the overall assembly diff with this change: https://rust.godbolt.org/z/9zxfMdEfv