Improved `checked_isqrt` and `isqrt` methods #128166

ChaiTRex · 2024-07-25T00:23:29Z

Improved tests of `isqrt` and `checked_isqrt` implementations

Inputs chosen more thoroughly and systematically.
Checks that isqrt and checked_isqrt have equivalent results for signed types, either equivalent numerically or equivalent as a panic and a None.
Checks that isqrt has numerically-equivalent results for unsigned types and their NonZero counterparts.

Added benchmarks for `isqrt` implementations

Greatly sped up `checked_isqrt` and `isqrt` methods

Uses a lookup table for 8-bit integers and then the Karatsuba square root algorithm for larger integers.
Includes optimization hints that give the compiler the exact numeric range of results.

Feature tracking issue

isqrt is an unstable feature tracked at #116226.

Benchmarked improvements

Command used to benchmark

./x bench library/core -- int_sqrt

Before

benchmarks:
    num::int_sqrt::i128::isqrt           439591.65/iter  +/- 6652.70
    num::int_sqrt::i16::isqrt              5302.97/iter   +/- 160.93
    num::int_sqrt::i32::isqrt             62999.11/iter  +/- 2022.05
    num::int_sqrt::i64::isqrt            125248.81/iter  +/- 1674.43
    num::int_sqrt::i8::isqrt                123.56/iter     +/- 1.87
    num::int_sqrt::isize::isqrt          125356.56/iter  +/- 1017.03
    num::int_sqrt::non_zero_u128::isqrt  437443.75/iter  +/- 3535.43
    num::int_sqrt::non_zero_u16::isqrt     8604.58/iter    +/- 94.76
    num::int_sqrt::non_zero_u32::isqrt    62933.33/iter   +/- 517.30
    num::int_sqrt::non_zero_u64::isqrt   125076.38/iter +/- 11340.61
    num::int_sqrt::non_zero_u8::isqrt       221.51/iter     +/- 1.58
    num::int_sqrt::non_zero_usize::isqrt 136005.21/iter  +/- 2020.35
    num::int_sqrt::u128::isqrt           439014.55/iter  +/- 3920.45
    num::int_sqrt::u16::isqrt              8575.08/iter   +/- 148.06
    num::int_sqrt::u32::isqrt             63008.89/iter   +/- 803.67
    num::int_sqrt::u64::isqrt            125088.09/iter   +/- 879.29
    num::int_sqrt::u8::isqrt                230.18/iter     +/- 2.04
    num::int_sqrt::usize::isqrt          125237.51/iter  +/- 4747.83

After

benchmarks:
    num::int_sqrt::i128::isqrt           105184.89/iter +/- 1171.38
    num::int_sqrt::i16::isqrt              1910.26/iter   +/- 78.50
    num::int_sqrt::i32::isqrt             34260.34/iter  +/- 960.84
    num::int_sqrt::i64::isqrt             45939.19/iter +/- 2525.65
    num::int_sqrt::i8::isqrt                 22.87/iter    +/- 0.45
    num::int_sqrt::isize::isqrt           45884.17/iter  +/- 595.49
    num::int_sqrt::non_zero_u128::isqrt  106344.27/iter  +/- 780.99
    num::int_sqrt::non_zero_u16::isqrt     2790.19/iter   +/- 53.43
    num::int_sqrt::non_zero_u32::isqrt    33613.99/iter  +/- 362.96
    num::int_sqrt::non_zero_u64::isqrt    46235.42/iter  +/- 429.69
    num::int_sqrt::non_zero_u8::isqrt        31.78/iter    +/- 0.75
    num::int_sqrt::non_zero_usize::isqrt  46208.75/iter  +/- 375.27
    num::int_sqrt::u128::isqrt           106385.94/iter +/- 1649.95
    num::int_sqrt::u16::isqrt              2747.69/iter   +/- 28.72
    num::int_sqrt::u32::isqrt             33627.09/iter  +/- 475.68
    num::int_sqrt::u64::isqrt             46182.29/iter  +/- 311.16
    num::int_sqrt::u8::isqrt                 33.10/iter    +/- 0.30
    num::int_sqrt::usize::isqrt           46165.00/iter  +/- 388.41

Tracking Issue for {u8,i8,...}::isqrt #116226

try-job: test-various

rustbot · 2024-07-25T00:23:36Z

r? @tgross35

rustbot has assigned @tgross35.
They will have a look at your PR within the next two weeks and either review your PR or reassign to another reviewer.

Use r? to explicitly pick a reviewer

tgross35 · 2024-07-25T00:35:57Z

I'll take a closer look in a bit but FYI your last commit message has a typo (s/sped/speed)

ChaiTRex · 2024-07-25T00:57:06Z

The last force push added a missing file.

I'll take a closer look in a bit but FYI your last commit message has a typo (s/sped/speed)

OK, thanks for taking a look. Oh, I was using 'sped' as a past tense of the verb form of 'speed'.

tgross35

I left a very superficial review, will need to take some time to double check the important parts (I think the proposed deduplication will make this a bit more straightforward). At a high level these changes are awesome, thanks for the PR!

library/core/src/num/int_sqrt.rs

library/core/src/num/int_macros.rs

library/core/src/num/int_sqrt.rs

library/core/benches/num/int_sqrt/mod.rs

library/core/src/num/int_macros.rs

library/core/src/num/int_sqrt.rs

tgross35 · 2024-07-25T01:42:13Z

I'll take a closer look in a bit but FYI your last commit message has a typo (s/sped/speed)

OK, thanks for taking a look. Oh, I was using 'sped' as a past tense of the verb form of 'speed'.

Ah, I see you did the same for the other commits too. Commit messages are usually in imperative mood to describe what the change does, i.e. "Improve test", "Add benchmarks" "Greatly speed up". Hence my confusion :)

(Rust has no commit message policy so no need to anything if you don't feel like it).

library/core/src/num/int_sqrt.rs

library/core/tests/num/int_macros.rs

library/core/tests/num/int_sqrt.rs

tgross35 · 2024-07-26T08:21:00Z

Looks pretty reasonable after a more thorough read through, awesome improvements to the tests. I think this should be good after combining some similar code, just to make things a bit easier to understand & maintain.

If you run benchmarks again, could you post your cpu model for a reference?

@rustbot author

ChaiTRex · 2024-08-01T06:32:38Z

If you run benchmarks again, could you post your cpu model for a reference?

I was using aarch64-apple-darwin on a MacBook Air with an M1 processor under macOS 14.5.1 (now 14.6).

After I replaced the benchmarks with equivalents to ilog10's benchmarks (which use an exponential distribution for the random inputs), I get the following:

Benchmarks

Before this pull request

benchmarks:
    num::int_sqrt::u128_sqrt_predictable  108945.82/iter +/- 1673.86
    num::int_sqrt::u128_sqrt_random        16389.17/iter  +/- 231.44
    num::int_sqrt::u128_sqrt_random_small   1290.22/iter   +/- 41.90
    num::int_sqrt::u16_sqrt_predictable     1080.06/iter   +/- 43.44
    num::int_sqrt::u16_sqrt_random          1064.57/iter   +/- 20.79
    num::int_sqrt::u16_sqrt_random_small     579.85/iter   +/- 13.81
    num::int_sqrt::u32_sqrt_predictable     2979.65/iter   +/- 29.79
    num::int_sqrt::u32_sqrt_random          1718.20/iter   +/- 58.86
    num::int_sqrt::u32_sqrt_random_small     480.19/iter   +/- 26.59
    num::int_sqrt::u64_sqrt_predictable    13089.55/iter  +/- 484.72
    num::int_sqrt::u64_sqrt_random          3751.04/iter   +/- 79.66
    num::int_sqrt::u64_sqrt_random_small     498.58/iter    +/- 3.60
    num::int_sqrt::u8_sqrt_predictable       329.07/iter   +/- 12.64
    num::int_sqrt::u8_sqrt_random            576.37/iter   +/- 47.12
    num::int_sqrt::u8_sqrt_random_small      589.01/iter   +/- 22.53

When I first made this pull request

benchmarks:
    num::int_sqrt::u128_sqrt_predictable  30728.33/iter +/- 1093.31
    num::int_sqrt::u128_sqrt_random        4392.30/iter  +/- 149.35
    num::int_sqrt::u128_sqrt_random_small   323.58/iter    +/- 7.26
    num::int_sqrt::u16_sqrt_predictable     304.50/iter   +/- 10.60
    num::int_sqrt::u16_sqrt_random          334.78/iter   +/- 18.02
    num::int_sqrt::u16_sqrt_random_small    139.06/iter    +/- 2.50
    num::int_sqrt::u32_sqrt_predictable    1585.35/iter   +/- 59.03
    num::int_sqrt::u32_sqrt_random          972.40/iter   +/- 38.47
    num::int_sqrt::u32_sqrt_random_small    149.02/iter    +/- 4.50
    num::int_sqrt::u64_sqrt_predictable    5641.28/iter  +/- 136.16
    num::int_sqrt::u64_sqrt_random         1703.97/iter   +/- 43.75
    num::int_sqrt::u64_sqrt_random_small    197.42/iter    +/- 4.03
    num::int_sqrt::u8_sqrt_predictable       42.90/iter    +/- 1.39
    num::int_sqrt::u8_sqrt_random           133.55/iter    +/- 3.92
    num::int_sqrt::u8_sqrt_random_small     133.85/iter    +/- 2.36

With the changes I'm going to push soon

benchmarks:
    num::int_sqrt::u128_sqrt_predictable  26622.63/iter +/- 386.54
    num::int_sqrt::u128_sqrt_random        3584.59/iter +/- 131.42
    num::int_sqrt::u128_sqrt_random_small   904.23/iter  +/- 22.04
    num::int_sqrt::u16_sqrt_predictable     388.01/iter   +/- 7.85
    num::int_sqrt::u16_sqrt_random          480.00/iter  +/- 16.07
    num::int_sqrt::u16_sqrt_random_small    332.05/iter   +/- 7.46
    num::int_sqrt::u32_sqrt_predictable    1317.23/iter  +/- 25.71
    num::int_sqrt::u32_sqrt_random          783.17/iter   +/- 8.53
    num::int_sqrt::u32_sqrt_random_small    415.10/iter  +/- 14.06
    num::int_sqrt::u64_sqrt_predictable    4563.42/iter +/- 129.48
    num::int_sqrt::u64_sqrt_random         1378.32/iter  +/- 60.46
    num::int_sqrt::u64_sqrt_random_small    658.41/iter   +/- 8.93
    num::int_sqrt::u8_sqrt_predictable       42.90/iter   +/- 0.82
    num::int_sqrt::u8_sqrt_random           132.98/iter   +/- 1.53
    num::int_sqrt::u8_sqrt_random_small     132.94/iter   +/- 2.94

There's a tradeoff here. In the push I'm going to make soon, randomly chosen inputs from the whole input type range get a decent speedup from the initial pull request's code, but randomly chosen small inputs get about 3 times slower. Is that a good tradeoff? Is there another benchmark that should be added?

ChaiTRex · 2024-08-01T07:20:58Z

With a uniform distribution, we get these results:

Uniform distribution benchmarks

original_u8             time:   [9.7782 ns 9.8095 ns 9.8501 ns]
karatsuba_u8            time:   [5.2377 ns 5.2448 ns 5.2531 ns]
karatsuba_2_u8          time:   [5.2304 ns 5.2347 ns 5.2399 ns]

original_u16            time:   [13.415 ns 13.437 ns 13.465 ns]
karatsuba_u16           time:   [6.7425 ns 6.7487 ns 6.7568 ns]
karatsuba_2_u16         time:   [6.3988 ns 6.4035 ns 6.4094 ns]

original_u32            time:   [19.016 ns 19.044 ns 19.078 ns]
karatsuba_u32           time:   [10.162 ns 10.182 ns 10.211 ns]
karatsuba_2_u32         time:   [8.8756 ns 8.8866 ns 8.8993 ns]

original_u64            time:   [44.361 ns 44.413 ns 44.480 ns]
karatsuba_u64           time:   [19.043 ns 19.144 ns 19.276 ns]
karatsuba_2_u64         time:   [16.330 ns 16.359 ns 16.392 ns]

original_u128           time:   [147.14 ns 147.49 ns 147.88 ns]
karatsuba_u128          time:   [51.335 ns 51.455 ns 51.602 ns]
karatsuba_2_u128        time:   [45.447 ns 45.507 ns 45.575 ns]

library/core/src/num/int_sqrt.rs

library/core/src/num/int_macros.rs

library/core/src/num/nonzero.rs

library/core/src/num/int_sqrt.rs

library/core/src/num/int_macros.rs

library/core/src/num/int_sqrt.rs

tgross35 · 2024-08-12T07:26:18Z

Thanks for the changes, this looks a lot tidier with the deduplication. I left a handful of new comments but those are just small things, nothing major.

I'm not sure whether you had been waiting on a review, but comment @rustbot ready whenever you are done. This will need a squash before merge but no need to do that until it is final.

ChaiTRex · 2024-08-19T23:23:13Z

With regard to the squash, would it be alright to have two final commits, one for the testing and benchmarking and the next for the implementation? This would allow the benchmarking at least to be run on the prior implementation for comparison.

ChaiTRex · 2024-08-20T00:42:54Z

@rustbot ready

bors · 2024-08-29T03:42:57Z

⌛ Trying commit 7af8e21 with merge daf8e6a...

Improved `checked_isqrt` and `isqrt` methods ### Improved tests of `isqrt` and `checked_isqrt` implementations * Inputs chosen more thoroughly and systematically. * Checks that `isqrt` and `checked_isqrt` have equivalent results for signed types, either equivalent numerically or equivalent as a panic and a `None`. * Checks that `isqrt` has numerically-equivalent results for unsigned types and their `NonZero` counterparts. ### Added benchmarks for `isqrt` implementations ### Greatly sped up `checked_isqrt` and `isqrt` methods * Uses a lookup table for 8-bit integers and then the Karatsuba square root algorithm for larger integers. * Includes optimization hints that give the compiler the exact numeric range of results. ### Feature tracking issue `isqrt` is an unstable feature tracked at rust-lang#116226. <details><summary>Benchmarked improvements</summary> ### Command used to benchmark ./x bench library/core -- int_sqrt ### Before benchmarks: num::int_sqrt::i128::isqrt 439591.65/iter +/- 6652.70 num::int_sqrt::i16::isqrt 5302.97/iter +/- 160.93 num::int_sqrt::i32::isqrt 62999.11/iter +/- 2022.05 num::int_sqrt::i64::isqrt 125248.81/iter +/- 1674.43 num::int_sqrt::i8::isqrt 123.56/iter +/- 1.87 num::int_sqrt::isize::isqrt 125356.56/iter +/- 1017.03 num::int_sqrt::non_zero_u128::isqrt 437443.75/iter +/- 3535.43 num::int_sqrt::non_zero_u16::isqrt 8604.58/iter +/- 94.76 num::int_sqrt::non_zero_u32::isqrt 62933.33/iter +/- 517.30 num::int_sqrt::non_zero_u64::isqrt 125076.38/iter +/- 11340.61 num::int_sqrt::non_zero_u8::isqrt 221.51/iter +/- 1.58 num::int_sqrt::non_zero_usize::isqrt 136005.21/iter +/- 2020.35 num::int_sqrt::u128::isqrt 439014.55/iter +/- 3920.45 num::int_sqrt::u16::isqrt 8575.08/iter +/- 148.06 num::int_sqrt::u32::isqrt 63008.89/iter +/- 803.67 num::int_sqrt::u64::isqrt 125088.09/iter +/- 879.29 num::int_sqrt::u8::isqrt 230.18/iter +/- 2.04 num::int_sqrt::usize::isqrt 125237.51/iter +/- 4747.83 ### After benchmarks: num::int_sqrt::i128::isqrt 105184.89/iter +/- 1171.38 num::int_sqrt::i16::isqrt 1910.26/iter +/- 78.50 num::int_sqrt::i32::isqrt 34260.34/iter +/- 960.84 num::int_sqrt::i64::isqrt 45939.19/iter +/- 2525.65 num::int_sqrt::i8::isqrt 22.87/iter +/- 0.45 num::int_sqrt::isize::isqrt 45884.17/iter +/- 595.49 num::int_sqrt::non_zero_u128::isqrt 106344.27/iter +/- 780.99 num::int_sqrt::non_zero_u16::isqrt 2790.19/iter +/- 53.43 num::int_sqrt::non_zero_u32::isqrt 33613.99/iter +/- 362.96 num::int_sqrt::non_zero_u64::isqrt 46235.42/iter +/- 429.69 num::int_sqrt::non_zero_u8::isqrt 31.78/iter +/- 0.75 num::int_sqrt::non_zero_usize::isqrt 46208.75/iter +/- 375.27 num::int_sqrt::u128::isqrt 106385.94/iter +/- 1649.95 num::int_sqrt::u16::isqrt 2747.69/iter +/- 28.72 num::int_sqrt::u32::isqrt 33627.09/iter +/- 475.68 num::int_sqrt::u64::isqrt 46182.29/iter +/- 311.16 num::int_sqrt::u8::isqrt 33.10/iter +/- 0.30 num::int_sqrt::usize::isqrt 46165.00/iter +/- 388.41 </details> Tracking Issue for {u8,i8,...}::isqrt rust-lang#116226 try-job: test-various

bors · 2024-08-29T04:46:51Z

☀️ Try build successful - checks-actions
Build commit: daf8e6a (daf8e6a755a4ccac4f08164a49fff14a2cd7e1d0)

tgross35 · 2024-08-29T04:49:45Z

Great! Thanks for the fix.

@bors r+

bors · 2024-08-29T04:49:47Z

📌 Commit 7af8e21 has been approved by tgross35

It is now in the queue for this repository.

workingjubilee · 2024-08-29T05:19:42Z

Sorry to bother but is there a particular reason that #[should_panic] annotations on a separate test function is not appropriate? Like so: https://doc.rust-lang.org/rust-by-example/testing/unit_testing.html#testing-panics

tgross35 · 2024-08-29T05:23:26Z

I thought about that too - but it runs the tests on a range of numbers, so it's not really feasible. In theory we could spot check a few numbers or we probably don't really need to test this, but it seems pretty harmless.

workingjubilee · 2024-08-29T05:32:12Z

hm, okay!

workingjubilee · 2024-08-29T05:33:39Z

The most useful negative integers to test are usually -1, -2, -10, and i{N}::MIN.

The last often manages to do something clever.

Improved `checked_isqrt` and `isqrt` methods ### Improved tests of `isqrt` and `checked_isqrt` implementations * Inputs chosen more thoroughly and systematically. * Checks that `isqrt` and `checked_isqrt` have equivalent results for signed types, either equivalent numerically or equivalent as a panic and a `None`. * Checks that `isqrt` has numerically-equivalent results for unsigned types and their `NonZero` counterparts. ### Added benchmarks for `isqrt` implementations ### Greatly sped up `checked_isqrt` and `isqrt` methods * Uses a lookup table for 8-bit integers and then the Karatsuba square root algorithm for larger integers. * Includes optimization hints that give the compiler the exact numeric range of results. ### Feature tracking issue `isqrt` is an unstable feature tracked at rust-lang#116226. <details><summary>Benchmarked improvements</summary> ### Command used to benchmark ./x bench library/core -- int_sqrt ### Before benchmarks: num::int_sqrt::i128::isqrt 439591.65/iter +/- 6652.70 num::int_sqrt::i16::isqrt 5302.97/iter +/- 160.93 num::int_sqrt::i32::isqrt 62999.11/iter +/- 2022.05 num::int_sqrt::i64::isqrt 125248.81/iter +/- 1674.43 num::int_sqrt::i8::isqrt 123.56/iter +/- 1.87 num::int_sqrt::isize::isqrt 125356.56/iter +/- 1017.03 num::int_sqrt::non_zero_u128::isqrt 437443.75/iter +/- 3535.43 num::int_sqrt::non_zero_u16::isqrt 8604.58/iter +/- 94.76 num::int_sqrt::non_zero_u32::isqrt 62933.33/iter +/- 517.30 num::int_sqrt::non_zero_u64::isqrt 125076.38/iter +/- 11340.61 num::int_sqrt::non_zero_u8::isqrt 221.51/iter +/- 1.58 num::int_sqrt::non_zero_usize::isqrt 136005.21/iter +/- 2020.35 num::int_sqrt::u128::isqrt 439014.55/iter +/- 3920.45 num::int_sqrt::u16::isqrt 8575.08/iter +/- 148.06 num::int_sqrt::u32::isqrt 63008.89/iter +/- 803.67 num::int_sqrt::u64::isqrt 125088.09/iter +/- 879.29 num::int_sqrt::u8::isqrt 230.18/iter +/- 2.04 num::int_sqrt::usize::isqrt 125237.51/iter +/- 4747.83 ### After benchmarks: num::int_sqrt::i128::isqrt 105184.89/iter +/- 1171.38 num::int_sqrt::i16::isqrt 1910.26/iter +/- 78.50 num::int_sqrt::i32::isqrt 34260.34/iter +/- 960.84 num::int_sqrt::i64::isqrt 45939.19/iter +/- 2525.65 num::int_sqrt::i8::isqrt 22.87/iter +/- 0.45 num::int_sqrt::isize::isqrt 45884.17/iter +/- 595.49 num::int_sqrt::non_zero_u128::isqrt 106344.27/iter +/- 780.99 num::int_sqrt::non_zero_u16::isqrt 2790.19/iter +/- 53.43 num::int_sqrt::non_zero_u32::isqrt 33613.99/iter +/- 362.96 num::int_sqrt::non_zero_u64::isqrt 46235.42/iter +/- 429.69 num::int_sqrt::non_zero_u8::isqrt 31.78/iter +/- 0.75 num::int_sqrt::non_zero_usize::isqrt 46208.75/iter +/- 375.27 num::int_sqrt::u128::isqrt 106385.94/iter +/- 1649.95 num::int_sqrt::u16::isqrt 2747.69/iter +/- 28.72 num::int_sqrt::u32::isqrt 33627.09/iter +/- 475.68 num::int_sqrt::u64::isqrt 46182.29/iter +/- 311.16 num::int_sqrt::u8::isqrt 33.10/iter +/- 0.30 num::int_sqrt::usize::isqrt 46165.00/iter +/- 388.41 </details> Tracking Issue for {u8,i8,...}::isqrt rust-lang#116226 try-job: test-various

tgross35 · 2024-08-29T05:41:09Z

Ah that's actually a good point - I thought we were checking a range but it looks like we are checking MIN https://github.com/rust-lang/rust/pull/128166/files#diff-eef7e7d7a0a129c86de4f1152c95cbbfcdf4dd594cfbf8d4a42c5b7f90746f4dR24.

@ChaiTRex am I missing something? If not, it would be good to add Jubilee's other suggested spot checks since the machinery is already there. It can just be a follow up since this is already in a rollup, and I don't really expect them to fail.

Improved `checked_isqrt` and `isqrt` methods ### Improved tests of `isqrt` and `checked_isqrt` implementations * Inputs chosen more thoroughly and systematically. * Checks that `isqrt` and `checked_isqrt` have equivalent results for signed types, either equivalent numerically or equivalent as a panic and a `None`. * Checks that `isqrt` has numerically-equivalent results for unsigned types and their `NonZero` counterparts. ### Added benchmarks for `isqrt` implementations ### Greatly sped up `checked_isqrt` and `isqrt` methods * Uses a lookup table for 8-bit integers and then the Karatsuba square root algorithm for larger integers. * Includes optimization hints that give the compiler the exact numeric range of results. ### Feature tracking issue `isqrt` is an unstable feature tracked at rust-lang#116226. <details><summary>Benchmarked improvements</summary> ### Command used to benchmark ./x bench library/core -- int_sqrt ### Before benchmarks: num::int_sqrt::i128::isqrt 439591.65/iter +/- 6652.70 num::int_sqrt::i16::isqrt 5302.97/iter +/- 160.93 num::int_sqrt::i32::isqrt 62999.11/iter +/- 2022.05 num::int_sqrt::i64::isqrt 125248.81/iter +/- 1674.43 num::int_sqrt::i8::isqrt 123.56/iter +/- 1.87 num::int_sqrt::isize::isqrt 125356.56/iter +/- 1017.03 num::int_sqrt::non_zero_u128::isqrt 437443.75/iter +/- 3535.43 num::int_sqrt::non_zero_u16::isqrt 8604.58/iter +/- 94.76 num::int_sqrt::non_zero_u32::isqrt 62933.33/iter +/- 517.30 num::int_sqrt::non_zero_u64::isqrt 125076.38/iter +/- 11340.61 num::int_sqrt::non_zero_u8::isqrt 221.51/iter +/- 1.58 num::int_sqrt::non_zero_usize::isqrt 136005.21/iter +/- 2020.35 num::int_sqrt::u128::isqrt 439014.55/iter +/- 3920.45 num::int_sqrt::u16::isqrt 8575.08/iter +/- 148.06 num::int_sqrt::u32::isqrt 63008.89/iter +/- 803.67 num::int_sqrt::u64::isqrt 125088.09/iter +/- 879.29 num::int_sqrt::u8::isqrt 230.18/iter +/- 2.04 num::int_sqrt::usize::isqrt 125237.51/iter +/- 4747.83 ### After benchmarks: num::int_sqrt::i128::isqrt 105184.89/iter +/- 1171.38 num::int_sqrt::i16::isqrt 1910.26/iter +/- 78.50 num::int_sqrt::i32::isqrt 34260.34/iter +/- 960.84 num::int_sqrt::i64::isqrt 45939.19/iter +/- 2525.65 num::int_sqrt::i8::isqrt 22.87/iter +/- 0.45 num::int_sqrt::isize::isqrt 45884.17/iter +/- 595.49 num::int_sqrt::non_zero_u128::isqrt 106344.27/iter +/- 780.99 num::int_sqrt::non_zero_u16::isqrt 2790.19/iter +/- 53.43 num::int_sqrt::non_zero_u32::isqrt 33613.99/iter +/- 362.96 num::int_sqrt::non_zero_u64::isqrt 46235.42/iter +/- 429.69 num::int_sqrt::non_zero_u8::isqrt 31.78/iter +/- 0.75 num::int_sqrt::non_zero_usize::isqrt 46208.75/iter +/- 375.27 num::int_sqrt::u128::isqrt 106385.94/iter +/- 1649.95 num::int_sqrt::u16::isqrt 2747.69/iter +/- 28.72 num::int_sqrt::u32::isqrt 33627.09/iter +/- 475.68 num::int_sqrt::u64::isqrt 46182.29/iter +/- 311.16 num::int_sqrt::u8::isqrt 33.10/iter +/- 0.30 num::int_sqrt::usize::isqrt 46165.00/iter +/- 388.41 </details> Tracking Issue for {u8,i8,...}::isqrt rust-lang#116226 try-job: test-various

…kingjubilee Rollup of 9 pull requests Successful merges: - rust-lang#120221 (Don't make statement nonterminals match pattern nonterminals) - rust-lang#127912 (std: make `thread::current` available in all `thread_local!` destructors) - rust-lang#128166 (Improved `checked_isqrt` and `isqrt` methods) - rust-lang#129123 (rustdoc-json: Add test for `Self` type) - rust-lang#129366 (linker: Synchronize native library search in rustc and linker) - rust-lang#129527 (Don't use `TyKind` in a lint) - rust-lang#129534 (Deny `wasm_c_abi` lint to nudge the last 25%) - rust-lang#129640 (Re-enable android tests/benches in alloc/core) - rust-lang#129675 (allow BufReader::peek to be called on unsized types) r? `@ghost` `@rustbot` modify labels: rollup

workingjubilee · 2024-08-29T08:50:40Z

library/core/tests/num/int_sqrt.rs

+                    for n in (0..=127)
+                        .chain(<$T>::MAX - 127..=<$T>::MAX)
+                        .chain((0..<$T>::MAX.count_ones()).map(|exponent| (1 << exponent) - 1))
+                        .chain((0..<$T>::MAX.count_ones()).map(|exponent| 1 << exponent))
+                    {
+                        isqrt_consistency_check(n);


this is the range.

Yeah, I meant specifically a range including negative values

ChaiTRex · 2024-08-29T13:40:50Z

@tgross35 @workingjubilee There are several places where isqrt_consistency_check is called, such as six places in the isqrt_extended test. It basically checks i*::MIN and the negative version of every positive value that gets tested (there are thousands of positive values that get checked).

Do #[should_panic] tests run on platforms like Wasm where all panics abort? If they do, I could add a few #[should_panic] tests to also test on those platforms.

…llaumeGomez Rollup of 7 pull requests Successful merges: - rust-lang#123940 (debug-fmt-detail option) - rust-lang#128166 (Improved `checked_isqrt` and `isqrt` methods) - rust-lang#128970 (Add `-Zlint-llvm-ir`) - rust-lang#129316 (riscv64imac: allow shadow call stack sanitizer) - rust-lang#129690 (Add `needs-unwind` compiletest directive to `libtest-thread-limit` and replace some `Path` with `path` in `run-make`) - rust-lang#129732 (Add `unreachable_pub`, round 3) - rust-lang#129743 (Fix rustdoc clippy lints) r? `@ghost` `@rustbot` modify labels: rollup

ChaiTRex · 2024-08-29T15:42:55Z

nemo157 mentioned on the community Discord server that -Zpanic-abort-test can handle should_panic tests on platforms that abort during panic. Since abort-on-panic eliminates my catch_unwind checks, I'll add a few should_panic tests for negative inputs in my next pull request so that panicking is tested on those platforms.

tgross35 · 2024-08-29T17:48:20Z

@tgross35 @workingjubilee There are several places where isqrt_consistency_check is called, such as six places in the isqrt_extended test. It basically checks i*::MIN and the negative version of every positive value that gets tested (there are thousands of positive values that get checked).

I missed that it checked .wrapping_neg() too within that function, all good then.

Do #[should_panic] tests run on platforms like Wasm where all panics abort? If they do, I could add a few #[should_panic] tests to also test on those platforms.

[...]

nemo157 mentioned on the community Discord server that -Zpanic-abort-test can handle should_panic tests on platforms that abort during panic. Since abort-on-panic eliminates my catch_unwind checks, I'll add a few should_panic tests for negative inputs in my next pull request so that panicking is tested on those platforms.

We can pretty reasonably expect that if it panics on one platform and all other tests pass, it will continue on panic on others. So no worries here, I don't think you need a follow up.

…llaumeGomez Rollup of 7 pull requests Successful merges: - rust-lang#123940 (debug-fmt-detail option) - rust-lang#128166 (Improved `checked_isqrt` and `isqrt` methods) - rust-lang#128970 (Add `-Zlint-llvm-ir`) - rust-lang#129316 (riscv64imac: allow shadow call stack sanitizer) - rust-lang#129690 (Add `needs-unwind` compiletest directive to `libtest-thread-limit` and replace some `Path` with `path` in `run-make`) - rust-lang#129732 (Add `unreachable_pub`, round 3) - rust-lang#129743 (Fix rustdoc clippy lints) r? `@ghost` `@rustbot` modify labels: rollup

Rollup merge of rust-lang#128166 - ChaiTRex:isqrt, r=tgross35 Improved `checked_isqrt` and `isqrt` methods ### Improved tests of `isqrt` and `checked_isqrt` implementations * Inputs chosen more thoroughly and systematically. * Checks that `isqrt` and `checked_isqrt` have equivalent results for signed types, either equivalent numerically or equivalent as a panic and a `None`. * Checks that `isqrt` has numerically-equivalent results for unsigned types and their `NonZero` counterparts. ### Added benchmarks for `isqrt` implementations ### Greatly sped up `checked_isqrt` and `isqrt` methods * Uses a lookup table for 8-bit integers and then the Karatsuba square root algorithm for larger integers. * Includes optimization hints that give the compiler the exact numeric range of results. ### Feature tracking issue `isqrt` is an unstable feature tracked at rust-lang#116226. <details><summary>Benchmarked improvements</summary> ### Command used to benchmark ./x bench library/core -- int_sqrt ### Before benchmarks: num::int_sqrt::i128::isqrt 439591.65/iter +/- 6652.70 num::int_sqrt::i16::isqrt 5302.97/iter +/- 160.93 num::int_sqrt::i32::isqrt 62999.11/iter +/- 2022.05 num::int_sqrt::i64::isqrt 125248.81/iter +/- 1674.43 num::int_sqrt::i8::isqrt 123.56/iter +/- 1.87 num::int_sqrt::isize::isqrt 125356.56/iter +/- 1017.03 num::int_sqrt::non_zero_u128::isqrt 437443.75/iter +/- 3535.43 num::int_sqrt::non_zero_u16::isqrt 8604.58/iter +/- 94.76 num::int_sqrt::non_zero_u32::isqrt 62933.33/iter +/- 517.30 num::int_sqrt::non_zero_u64::isqrt 125076.38/iter +/- 11340.61 num::int_sqrt::non_zero_u8::isqrt 221.51/iter +/- 1.58 num::int_sqrt::non_zero_usize::isqrt 136005.21/iter +/- 2020.35 num::int_sqrt::u128::isqrt 439014.55/iter +/- 3920.45 num::int_sqrt::u16::isqrt 8575.08/iter +/- 148.06 num::int_sqrt::u32::isqrt 63008.89/iter +/- 803.67 num::int_sqrt::u64::isqrt 125088.09/iter +/- 879.29 num::int_sqrt::u8::isqrt 230.18/iter +/- 2.04 num::int_sqrt::usize::isqrt 125237.51/iter +/- 4747.83 ### After benchmarks: num::int_sqrt::i128::isqrt 105184.89/iter +/- 1171.38 num::int_sqrt::i16::isqrt 1910.26/iter +/- 78.50 num::int_sqrt::i32::isqrt 34260.34/iter +/- 960.84 num::int_sqrt::i64::isqrt 45939.19/iter +/- 2525.65 num::int_sqrt::i8::isqrt 22.87/iter +/- 0.45 num::int_sqrt::isize::isqrt 45884.17/iter +/- 595.49 num::int_sqrt::non_zero_u128::isqrt 106344.27/iter +/- 780.99 num::int_sqrt::non_zero_u16::isqrt 2790.19/iter +/- 53.43 num::int_sqrt::non_zero_u32::isqrt 33613.99/iter +/- 362.96 num::int_sqrt::non_zero_u64::isqrt 46235.42/iter +/- 429.69 num::int_sqrt::non_zero_u8::isqrt 31.78/iter +/- 0.75 num::int_sqrt::non_zero_usize::isqrt 46208.75/iter +/- 375.27 num::int_sqrt::u128::isqrt 106385.94/iter +/- 1649.95 num::int_sqrt::u16::isqrt 2747.69/iter +/- 28.72 num::int_sqrt::u32::isqrt 33627.09/iter +/- 475.68 num::int_sqrt::u64::isqrt 46182.29/iter +/- 311.16 num::int_sqrt::u8::isqrt 33.10/iter +/- 0.30 num::int_sqrt::usize::isqrt 46165.00/iter +/- 388.41 </details> Tracking Issue for {u8,i8,...}::isqrt rust-lang#116226 try-job: test-various

4f47132 Auto merge of rust-lang#129941 - BoxyUwU:bump-boostrap, r=albertlarsan68 fd0bc94 Adjust doc comment of Condvar::wait_while 2699de6 Rollup merge of rust-lang#129963 - rjooske:fix/inaccurate_to_string_lossy_doc, r=workingjubilee cde8145 Auto merge of rust-lang#129999 - matthiaskrgr:rollup-pzr9c8p, r=matthiaskrgr ab4b4f8 Rollup merge of rust-lang#129947 - LiterallyVoid:duration-docs-digit-separators, r=tgross35 3e7e6cd Rollup merge of rust-lang#129653 - RalfJung:addr-of-read-only, r=scottmcm e51a0bc Rollup merge of rust-lang#129938 - chancancode:patch-1, r=thomcc 349f8d5 update cfgs 181dc26 Rollup merge of rust-lang#129919 - kevinmehall:waker-getters, r=dtolnay 3d2a91f Rollup merge of rust-lang#127021 - thesummer:1-add-target-support-for-rtems-arm-xilinx-zedboard, r=tgross35 25891c8 Rollup merge of rust-lang#101339 - the8472:ci-randomize-debug, r=Mark-Simulacrum eb47468 fix: correct {Path,OsStr}::to_string_lossy() docs 7697231 docs: add digit separators in `Duration` examples 9ed92df replace placeholder version 00e12f7 Update marker.rs 5de059f Update marker.rs 72e79f0 Update marker.rs 870dfed Update marker.rs de72cd3 Elaborate on deriving vs implementing `Copy` fee6300 More robust extension checking ae90e45 Port std library to RTEMS c313c07 Rollup merge of rust-lang#129916 - tshepang:basic-usage, r=ChrisDenton c501959 Rollup merge of rust-lang#129913 - saethlin:l4re-read-buf, r=Noratrieb 83524b9 Rollup merge of rust-lang#129885 - cuishuang:master, r=scottmcm e41afdc Rollup merge of rust-lang#129800 - ChrisDenton:remove-dir-all2, r=Amanieu 851f5b6 Add `Waker::new` and `LocalWaker::new` a2b8bb8 Stabilize waker_getters 2ec266b Move the `data` and `vtable` methods from `RawWaker` to `Waker` 562fdce process.rs: remove "Basic usage" text where not useful 9b3c3fe Rollup merge of rust-lang#129907 - saethlin:solid-io-error, r=WaffleLapkin 02ceceb Rollup merge of rust-lang#129892 - oskgo:clarify-slice-from-raw, r=RalfJung ccc294c Rollup merge of rust-lang#129890 - alex:patch-1, r=workingjubilee 6d0e687 Rollup merge of rust-lang#129856 - RalfJung:compiler_fence, r=thomcc 0ccc851 Rollup merge of rust-lang#129748 - RalfJung:box-validity, r=workingjubilee 3761849 Add missing read_buf stub for x86_64-unknown-l5re-uclibc 3b8ab5a Fix compile error in solid's remove_dir_all e14b9f3 clarify language around non-null ptrs in slice::raw 9a76abd Remove stray word in a comment 1dd630f Auto merge of rust-lang#129873 - matthiaskrgr:rollup-bv849ud, r=matthiaskrgr 2649882 chore: remove repetitive words 7fd784e Rollup merge of rust-lang#129804 - ranger-ross:fixed-documentation-typos, r=Noratrieb e4e9f6b Rollup merge of rust-lang#129793 - lolbinarycat:doc-missing-newlines, r=workingjubilee c4aa66a Auto merge of rust-lang#129063 - the8472:cold-opt-size, r=Amanieu 4e3dbee add extra linebreaks so rustdoc can identify the first sentence e00784f compiler_fence documentation: emphasize synchronization, not reordering 8d8dbe9 tweak wording regarding Box validity 065844b Auto merge of rust-lang#127897 - nyurik:add-qnx-70-target, r=saethlin 759399b Rollup merge of rust-lang#129832 - eduardosm:stray-dot, r=jhpratt 60f37e4 Rollup merge of rust-lang#129207 - GrigorenkoPV:elided-is-named, r=cjgillot 68e6537 Rollup merge of rust-lang#128641 - Konippi:standardize-duplicate-processes-in-parser, r=scottmcm b93e3ab Rollup merge of rust-lang#128495 - joboet:more_memcmp, r=scottmcm 64c1db2 when -Zrandomize-layout is enabled disable alloc test testing internal struct sizes d432698 Auto merge of rust-lang#129831 - matthiaskrgr:rollup-befq6zx, r=matthiaskrgr 77cf0ba Remove stray dot in `std::char::from_u32_unchecked` documentation ef033b0 Rollup merge of rust-lang#129826 - Alcaro:patch-1, r=workingjubilee 2ad03e0 Rollup merge of rust-lang#129650 - Zalathar:profiler-builtins, r=Mark-Simulacrum c33b3df Update mod.rs 24ed1c1 Rollup merge of rust-lang#129785 - RalfJung:miri-sync, r=RalfJung 50681ab Rollup merge of rust-lang#129730 - RalfJung:float-arithmetic, r=workingjubilee 0402bb1 Fix `elided_named_lifetimes` in code 667d060 Move remove_dir_all impl into a module ae18edf Rollup merge of rust-lang#129754 - alexcrichton:fix-wasi-long-sleep, r=workingjubilee 9138bd1 Rollup merge of rust-lang#129675 - lolbinarycat:bufreader_peek_unsized, r=workingjubilee 83cadd0 Rollup merge of rust-lang#129642 - workingjubilee:bump-backtrace-fc37b22, r=workingjubilee d9af971 Rollup merge of rust-lang#129640 - saethlin:unignore-android-in-alloc, r=tgross35 6b12a63 Fixed more typos in library/core 40f9251 Fixed typos in btree map docs 628be3d Fixed some typos in the standard library documentation/comments 21e893e enumerate the two parts of the NaN rules 081353c add hyphen in floating-point c664843 Squashed `aarch64_unknown_nto_qnx700` support 5c4c81a Merge from rustc a374647 Try latest backtrace 2c75dd8 wasi: Fix sleeping for `Duration::MAX` 374229a Rollup merge of rust-lang#128166 - ChaiTRex:isqrt, r=tgross35 f0dce76 Rollup merge of rust-lang#123940 - kornelski:remove-derived-debug, r=Urgau 228ec9e Box validity: update for new zero-sized rules 93a72da f32 docs: define 'arithmetic' operations 1bc188f Merge from rustc c44af61 Speed up `checked_isqrt` and `isqrt` methods 2139651 Improve `isqrt` tests and add benchmarks 7c1560f Rollup merge of rust-lang#129715 - Amjad50:update-compiler-builtins, r=tgross35 d2a001d Rollup merge of rust-lang#129683 - RalfJung:copysign, r=thomcc 8753a35 Rollup merge of rust-lang#129673 - matthewpipie:arc-weak-debug-trait, r=dtolnay 3e1f63a Rollup merge of rust-lang#129401 - workingjubilee:partial-initialization-of-stabilization, r=dtolnay,joboet cd59153 Rollup merge of rust-lang#129378 - goffrie:patch-3, r=ChrisDenton 6d31b6d Rollup merge of rust-lang#128192 - mrkajetanp:feature-detect, r=Amanieu 60fd9c9 Update `compiler_builtins` to `0.1.123` 86c924f fmt-debug option 8623fa4 allow BufReader::peek to be called on unsized types b1a56b5 Auto merge of rust-lang#129691 - matthiaskrgr:rollup-owlcr3m, r=matthiaskrgr 39ad6a9 Rollup merge of rust-lang#129668 - coolreader18:fix-pin-set-regr, r=dtolnay 5a4fe40 Rollup merge of rust-lang#129657 - jswrenn:transmute-name, r=compiler-errors 1d44fab Rollup merge of rust-lang#129551 - RalfJung:ub-checks-fallback, r=saethlin b0fee98 Rollup merge of rust-lang#129480 - lolbinarycat:euclid-docs, r=joboet c8d3265 Enable some ilog2 tests as well da08ef4 Re-enable android tests/benches in alloc bfbe13e Auto merge of rust-lang#129589 - saethlin:improve-panic-immediate-abort, r=tgross35 89021c8 copysign with sign being a NaN is non-portable ed66a11 addr_of on places derived from raw pointers should preserve permissions 1729898 Add fmt::Debug to sync::Weak<T, A> 927a6da Fix Pin::set bounds regression 9876bd1 library: Stabilize new_uninit for Box, Rc, and Arc a0ea69f Rollup merge of rust-lang#129652 - RalfJung:ptr-to-ref, r=traviscross c7cbb41 Rollup merge of rust-lang#129645 - beetrees:fix-float-docs, r=tgross35 04eabb5 Rollup merge of rust-lang#129581 - RalfJung:exit, r=joshtriplett 33e2d7e safe transmute: Rename `BikeshedIntrinsicFrom` to `TransmuteFrom` 72c676f Auto merge of rust-lang#128134 - joboet:move_pal_alloc, r=cupiver c108af0 fix Pointer to reference conversion docs 1933103 clarify that addr_of creates read-only pointers 4f6b814 rustc_target: Add SME aarch64 features 012bb44 rustc_target: Add various aarch64 features 4dc5b67 std: move allocators to `sys` 264fa88 Don't skip nonexistent source files 5298b52 Add `cargo::rerun-if-changed` directives for source directories 5defa79 Always include `WindowsMMap.c` in the list of source files 91d2ecf Sort the list of source files cb468d7 Remove `InstrProfilingBiasVar.c` from the list of source files 0e0134f Use helper functions to read environment variables a628540 Rollup merge of rust-lang#129559 - RalfJung:float-nan-semantics, r=thomcc 00c8f98 Rollup merge of rust-lang#128731 - RalfJung:simd-shuffle-vector, r=workingjubilee 6d3344f Update old comment referring to `libcompiler_builtins` 6d8a1f6 Reflow a couple of paragraphs in floating-point primitive docs 8834d35 Fix typos in floating-point primitive type docs 54c986a Bump backtrace to rust-lang/backtrace@fc37b22 932cbd4 Rollup merge of rust-lang#129032 - jswrenn:transmute-method, r=compiler-errors 28a983d Rollup merge of rust-lang#128157 - lolbinarycat:unify-ptr-ref-docs, r=cuviper d1e21bd Apply suggestions from code review febaf22 Rollup merge of rust-lang#129592 - saethlin:core-cfg-test, r=tgross35 77a1318 Rollup merge of rust-lang#129588 - hermit-os:sleep-micros, r=workingjubilee 12fe23b Rollup merge of rust-lang#129539 - oconnor663:poll_link, r=tgross35 864e465 Rollup merge of rust-lang#129377 - chorman0773:unbounded-shifts-impl, r=scottmcm 07cfc6a also update copysign docs acaef60 move per-target NaN info into a table 854ba7e float types: document NaN bit pattern guarantees d958260 Auto merge of rust-lang#129595 - matthiaskrgr:rollup-4udn7nn, r=matthiaskrgr 8dd3363 Remove cfg(test) from library/core cd554e2 Rollup merge of rust-lang#129544 - mu001999-contrib:dead-code/clean, r=compiler-errors ff769ee Rollup merge of rust-lang#129525 - notriddle:notriddle/fake-variadic-tuple-array, r=GuillaumeGomez 4d22c1c Auto merge of rust-lang#129488 - saethlin:alignment-precondition, r=workingjubilee c688def pal/hermit: saturate `usleep` microseconds at `u64::MAX` 8ea71ae Auto merge of rust-lang#129563 - matthiaskrgr:rollup-t6bai2d, r=matthiaskrgr 46eff20 Tweak some attributes to improve panic_immediate_abort fdb5fc1 pal/hermit: correctly round up microseconds in `Thread::sleep` b392703 exit: explain our expectations for the exit handlers registered in a Rust program 22ec897 link to Future::poll from the Poll docs a994fbb Rollup merge of rust-lang#129487 - GrigorenkoPV:repr_transparent_external_private_fields, r=compiler-errors 3a33922 Rollup merge of rust-lang#129416 - workingjubilee:partial-move-from-stabilization, r=dtolnay 3a8de95 Rollup merge of rust-lang#129091 - RalfJung:box_as_ptr, r=Amanieu 4de4deb Auto merge of rust-lang#129295 - Zalathar:profiler-builtins, r=Kobzol 0872cf3 ub_checks intrinsics: fall back to cfg(ub_checks) 8dafd33 Auto merge of rust-lang#129521 - matthiaskrgr:rollup-uigv77m, r=matthiaskrgr d9e489b Removes dead code from the compiler c14cf57 Rollup merge of rust-lang#129481 - scottmcm:update-cb, r=tgross35 acf6f03 Rollup merge of rust-lang#129449 - coolreader18:pin-as_deref_mut-signature, r=dtolnay 112ebc4 Rollup merge of rust-lang#128735 - jieyouxu:pr-120176-revive, r=cjgillot 49aa496 rustdoc: clean up tuple <-> primitive conversion docs 0fe3746 Rollup merge of rust-lang#129501 - RalfJung:miri-rust-backtrace, r=Noratrieb 7d5cf38 Rollup merge of rust-lang#129500 - fee1-dead-contrib:fxrel, r=compiler-errors e91d825 Rollup merge of rust-lang#129323 - Urgau:ptr_fn_addr_eq, r=Mark-Simulacrum f647079 Rollup merge of rust-lang#128596 - RalfJung:const_fn_floating_point_arithmetic, r=nnethercote f965950 New `#[rustc_pub_transparent]` attribute a6ea125 panicking: improve hint for Miri's RUST_BACKTRACE behavior a437005 Build `library/profiler_builtins` from `ci-llvm` if appropriate 693477a remove invalid `TyCompat` relation for effects 82fc74f library: Move unstable API of new_uninit to new features 3ee2e18 Enable Alignment::new_unchecked precondition check 0803686 Change `f16` doctests in core to run on x86-64 linux 9359a12 Update `compiler_builtins` to `0.1.121` da02e8b Enable `f16` tests on x86 and x86-64 f3a198e docs: correct panic conditions for rem_euclid and similar functions 976fb4a Move into_inner_unchecked back to the bottom of the impl block 2741e8d Put Pin::as_deref_mut in impl Pin<Ptr> 88790f8 document & impl the transmutation modeled by `BikeshedIntrinsicFrom` f670207 Auto merge of rust-lang#129464 - GuillaumeGomez:rollup-ckfqd7h, r=GuillaumeGomez 5bf661c Rollup merge of rust-lang#129276 - eduardosm:stabilize-char_indices_offset, r=Amanieu e2614f2 Rollup merge of rust-lang#129400 - Amjad50:update-compiler-builtins, r=tgross35 2c06146 Rollup merge of rust-lang#127623 - lolbinarycat:fix_remove_dir_all, r=Amanieu eb747e5 Check that `library/profiler_builtins` actually found some source files eae7987 fix typos in new pointer conversion docs fe33d2c fix: fs::remove_dir_all: treat ENOENT as success 3fd591e feat(core): Make `unbounded_shl{l,r}` unstably const and remove `rustc_allow_const_fn_unstable` 2168ce3 Auto merge of rust-lang#129398 - matthiaskrgr:rollup-50l01ry, r=matthiaskrgr 12944c7 Update `compiler_builtins` to `0.1.120` 7496478 stabilize const_fn_floating_point_arithmetic 6f534f9 Rollup merge of rust-lang#129382 - tgross35:once-cell-const-into-inner, r=Noratrieb 2535017 Rollup merge of rust-lang#129376 - ChaiTRex:assert_unsafe_precondition_check_language_ub, r=workingjubilee,the8472 4ec19af Rollup merge of rust-lang#129374 - ChaiTRex:digit_unchecked_assert_unsafe_precondition, r=scottmcm 024ec3c Rollup merge of rust-lang#128432 - g0djan:godjan/wasi_prohibit_implicit_unsafe, r=tgross35 f671c11 Auto merge of rust-lang#129365 - matthiaskrgr:rollup-ebwx6ya, r=matthiaskrgr 5299ef1 fix(core): Use correct operations/values in `unbounded_shr` doctests 8423006 chore: `x fmt` cbe7338 fix(core): Add `#![feature(unbounded_shifts)]` to doctests for `unbounded_shr`/`unbounded_shl` 863123b Add `const_cell_into_inner` to `OnceCell` b51f35e format 6fd5393 chore: `x fmt` and hopefully fix the tidy issue e99c681 Clean up cfg-gating of ProcessPrng extern 9d2bb97 Change `assert_unsafe_precondition` docs to refer to `check_language_ub` 32bd5df chore: Also format the control flow 5f8cf71 Manually format functions and use `rhs` instead of `v` from my CE testing 700af56 feat(core): Add implementations for `unbounded_shl`/`unbounded_shr` a9ad57e Use `assert_unsafe_precondition!` in `AsciiChar::digit_unchecked` 77bd65f Rollup merge of rust-lang#129321 - krtab:float_sum, r=workingjubilee cc21978 Rollup merge of rust-lang#129232 - ivmarkov:master, r=workingjubilee c9cf844 Rollup merge of rust-lang#127945 - tgross35:debug-more-non-exhaustive, r=Noratrieb d37ebfe Rollup merge of rust-lang#129332 - cuviper:cstr-cast, r=compiler-errors 6d01ed8 Rollup merge of rust-lang#129312 - tbu-:pr_str_not_impl_error, r=Noratrieb 93319c8 Fix stability attribute of `impl !Error for &str` 7f8bdd5 Auto merge of rust-lang#126556 - saethlin:layout-precondition, r=joboet 9e9141f Auto merge of rust-lang#128866 - scottmcm:update-stdarch, r=tgross35 d47cfba Update stdarch submodule b507a8b Try to golf down the amount of code in Layout 32b574e Avoid extra `cast()`s after `CStr::as_ptr()` 9d4113f Rollup merge of rust-lang#129294 - scottmcm:stabilize-repeat-n, r=Noratrieb 62d240d Implement `ptr::fn_addr_eq` 529e33a Change neutral element of <fNN as iter::Sum> to neg_zero 126935f Stabilize `iter::repeat_n` 91439ce Auto merge of rust-lang#129226 - RalfJung:libc, r=Mark-Simulacrum bef7be0 Add a precondition check for Layout::from_size_align_unchecked a55ab85 Stabilize feature `char_indices_offset` 7f45dcf library: bump libc dependency ebe99f3 Rollup merge of rust-lang#128902 - evanj:evan.jones/env-var-doc, r=workingjubilee 8bdd95b soft-deprecate the addr_of macros 23b0aad code review improvements 0b0dad4 Fix for issue rust-lang#129212 for the ESP-IDF bd7aa57 Auto merge of rust-lang#126877 - GrigorenkoPV:clone_to_uninit, r=dtolnay d3c08f8 Auto merge of rust-lang#128598 - RalfJung:float-comments, r=workingjubilee dc5fed5 Auto merge of rust-lang#106943 - mina86:exact_size_take_repeat, r=dtolnay 88927ac Auto merge of rust-lang#116528 - daxpedda:stabilize-ready-into-inner, r=dtolnay 9952947 Rollup merge of rust-lang#129161 - dtolnay:spawnunck, r=Noratrieb db3abec Rollup merge of rust-lang#129086 - slanterns:is_none_or, r=dtolnay 44a558d Stabilize std::thread::Builder::spawn_unchecked 5c553c4 float to/from bits and classify: update comments regarding non-conformant hardware 9704e2d Rollup merge of rust-lang#128064 - ijackson:noop-waker-doc, r=workingjubilee 0497f0c Add cautionary paragraph about noop wakers. 16dd426 Rollup merge of rust-lang#128946 - orlp:faster-ip-hash, r=joboet 383c4db Rollup merge of rust-lang#128925 - dingxiangfei2009:smart-ptr-helper-attr, r=compiler-errors ba3a942 Rollup merge of rust-lang#125970 - RalfJung:before_exec, r=m-ou-se 32a71bb size-optimize some of the panic dependencies d7b85f2 apply #[optimize(size)] to #[cold] ones and part of the panick machinery 0dbf8cf Rollup merge of rust-lang#128954 - zachs18:fromresidual-no-default, r=scottmcm 4f09599 Rollup merge of rust-lang#128570 - folkertdev:stabilize-asm-const, r=Amanieu b6c9e44 add Box::as_ptr and Box::as_mut_ptr methods 23d1309 CommandExt::before_exec: deprecate safety in edition 2024 9858d49 stabilize `is_none_or` fd2b339 Auto merge of rust-lang#129060 - matthiaskrgr:rollup-s72gpif, r=matthiaskrgr 3b8aab7 Rollup merge of rust-lang#129001 - cblh:fix/128713, r=Noratrieb 16edf69 Rollup merge of rust-lang#128873 - ChrisDenton:windows-targets, r=Mark-Simulacrum 0199b00 Rollup merge of rust-lang#128759 - notriddle:notriddle/spec-to-string, r=workingjubilee,compiler-errors c6dc243 stabilize `asm_const` b4bfc21 Rollup merge of rust-lang#129034 - henryksloan:coroutine-must-use, r=joboet b56fdcb Rollup merge of rust-lang#127857 - tbu-:pr_deprecated_safe_todo, r=petrochenkov 77f462d Rollup merge of rust-lang#122884 - mzabaluev:pow-remove-exit-branch, r=Amanieu 0a6a74b Reduce merged doctest source code size a83dde6 Mark location doctest as standalone since file information will not work in merged doctest file 7334c71 Auto merge of rust-lang#129046 - matthiaskrgr:rollup-9x4xgak, r=matthiaskrgr 9ed7210 Rollup merge of rust-lang#128745 - dtolnay:spawnunchecked, r=workingjubilee c39d90e Rollup merge of rust-lang#128655 - joboet:play_with_the_dice, r=ChrisDenton f81c96a `#[deprecated_safe_2024]`: Also use the `// TODO:` hint in the compiler error 23a1968 Allow to customize `// TODO:` comment for deprecated safe autofix 37017c0 Auto merge of rust-lang#128962 - devnexen:fs_get_mode_haiku, r=workingjubilee 6ad03a7 simd_shuffle intrinsic: allow argument to be passed as vector (not just as array) 8a2671a Revert to original loop for const pow exponents c5e8189 Auto merge of rust-lang#128742 - RalfJung:miri-vtable-uniqueness, r=saethlin ac682f1 Add must_use attribute to Coroutine trait 658904d chore(lib): fmt core::fmt::Formatter's write_fmt method 7eb7376 trying common codepath for every unixes 5fabf93 std::fs: get_mode implementation for haiku. e3da824 Rollup merge of rust-lang#129017 - its-the-shrimp:core_fmt_from_fn, r=Noratrieb b247d9a derive(SmartPointer): register helper attributes aa85448 Explicitly specify type parameter on FromResidual impls in stdlib. 262a4f6 std::fmt::FormatterFn -> std::fmt::FromFn ceceae3 Rollup merge of rust-lang#128632 - joboet:dont_overwrite_style, r=Amanieu e8f7afe Rollup merge of rust-lang#128149 - RalfJung:nontemporal_store, r=jieyouxu,Amanieu,Jubilee 7dd2083 chore(lib): Enhance documentation for core::fmt::Formatter's write_fmt method 048efd0 ignore some vtable/fn ptr equality tests in Miri, their result is not fully predictable a367a12 std: use `/scheme/rand` on Redox 4b816b4 core: make documentation clearer, rename slice comparison specialization trait 1ca6b42 std: do not overwrite style in `get_backtrace_style` 9147777 Auto merge of rust-lang#128862 - cblh:fix/128855, r=scottmcm 56e1afe Auto merge of rust-lang#126793 - saethlin:mono-rawvec, r=scottmcm ec7a585 Do not use unnecessary endian conversion. f48facf Rollup merge of rust-lang#128882 - RalfJung:local-waker-will-wake, r=cuviper b581949 Rollup merge of rust-lang#120314 - mina86:i, r=Mark-Simulacrum 451feca Fix stability annotation and expand comment 2e34ac3 Hash Ipv*Addr as an integer b8b61e1 Auto merge of rust-lang#128927 - GuillaumeGomez:rollup-ei2lr0f, r=GuillaumeGomez 44f5b4f Rollup merge of rust-lang#128273 - Voultapher:improve-ord-violation-help, r=workingjubilee 3d7afa0 Update std and compiler 971df1c Stabilize `min_exhaustive_patterns` c37c666 Add an optimizer hint for the capacity that with_capacity_in returns c8cbd5c Hoist IS_ZST check out of RawVecInner::from_*_in e843f71 Polymorphize RawVec dc39cbf core: optimise Debug impl for ascii::Char 9668691 doc: std::env::var: Returns None for names with '=' or NUL byte 5d5d8bc Rollup merge of rust-lang#128859 - MinxuanZ:mips-sig, r=Amanieu 825def0 Rollup merge of rust-lang#128817 - biabbas:vxworks_update, r=tgross35 6e933a8 make LocalWaker::will_wake consistent with Waker::will_wake 118c712 Fix linkchecker issue b1460b9 Exclude windows-targets from the workspace a3a6a98 Add windows-targets crate to std's sysroot f74940d Rollup merge of rust-lang#128824 - GuillaumeGomez:update-compiler-builtins, r=Amanieu 39b1eaf VxWorks: Add safety comment for vxCpuEnabledGet 8b0a25d fix: Ensure `Guard`'s `drop` method is removed at `opt-level=s` for `Copy` types c54958c delete space dadbd58 fix format 7c34ebf [SPARC] fix the name of signal 19 in sparc arch b75648a [MIPS] fix the name of signal 19 in mips 3840b09 Rollup merge of rust-lang#128818 - RalfJung:std-miri-floats, r=tgross35 d03bb5e Rollup merge of rust-lang#128640 - RalfJung:rwlock-macos-miri, r=joboet 7680a3c Rollup merge of rust-lang#128749 - tgross35:float-inline, r=scottmcm 9df61ad Rollup merge of rust-lang#128306 - WiktorPrzetacznik:WiktorPrzetacznik-nonnull-alignoffset-update, r=Amanieu 39860ad Update compiler-builtins version to 0.1.118 4281185 std float tests: special-case Miri in feature detection 4d6b36a Vxworks: Extern taskNameSet and fix build errors e24a6ca rwlock: disable 'frob' test in Miri on macOS c21ba97 Fix VxWorks available parallelism: Move nonzero::uncheked into unsafe block 2495418 Rollup merge of rust-lang#128800 - clarfonthey:core-pattern-type, r=compiler-errors 79cd72a Rollup merge of rust-lang#128691 - tgross35:update-builtins, r=Amanieu 8f84015 Add tracking issue to core-pattern-type b8f7f38 Stabilize `Ready::into_inner()` 62ccdeb Rollup merge of rust-lang#128261 - clarfonthey:iter-default, r=dtolnay b4e5330 alloc: make `to_string_str!` a bit less complex ec74467 Mark `{f32,f64}::{next_up,next_down,midpoint}` inline b90a026 Rollup merge of rust-lang#128766 - Monadic-Cat:patch-1, r=tgross35 5d7906c Rollup merge of rust-lang#128417 - tgross35:f16-f128-math, r=dtolnay 83d1d16 Trivial grammar fix in const keyword docs 97384fa Update `compiler-builtins` to 0.1.117 6dc79bb Rollup merge of rust-lang#128751 - devnexen:vxworks_set_thread_name, r=tgross35 432425d Rollup merge of rust-lang#128539 - biabbas:deny_unsafe, r=workingjubilee 1bd5338 Rollup merge of rust-lang#128406 - lolbinarycat:bufreader_peek, r=Mark-Simulacrum e20aa64 Rollup merge of rust-lang#125048 - dingxiangfei2009:stable-deref, r=amanieu bc13c6c alloc: add ToString specialization for `&&str` 14fe723 std::thread: set_name implementation proposal for vxWorks. 67fa603 Remove unused lifetime parameter from spawn_unchecked 4a3da12 Add a special case for CStr/CString in the improper_ctypes lint 51ec2bb implement BufReader::peek e6aede2 nontemporal_store: make sure that the intrinsic is truly just a hint a300df7 WASI fixing unsafe_op_in_unsafe_fn for std::{os, sys} 59436fc std: refactor UNIX random data generation 8fe1e32 refactor: standardize duplicate processes in parser 6fafc6b Apply review comments to PartialOrd section 7850a64 Forbid unsafe_op_in_unsafe_fn in vxworks specific os and sys files e844eff Add a disclaimer about x86 `f128` math functions 21d297b Update comments for `{f16, f32, f64, f128}::midpoint` ad27d08 Add `core` functions for `f16` and `f128` that require math routines c6407b0 Add math functions for `f16` and `f128` d9b1de5 Add math intrinsics for `f16` and `f128` 3c1586b Hide internal sort module b927541 core: use `compare_bytes` for more slice element types 2188712 Apply review comments 2ebe00a PinCoerceUnsized trait into core 569ab6a CloneToUninit: use a private specialization trait 26874cc Sparkle some attributes over `CloneToUninit` stuff e8c3718 impl CloneToUninit for Path and OsStr ef8c591 impl CloneToUninit for str and CStr 65c6173 Update NonNull::align_offset quarantees b014b0d Improve panic sections for sort*, sort_unstable* and select_nth_unstable* 9bcfe84 Improve panic message and surrounding documentation for Ord violations 7e55abb Okay, I guess I have to give these a different feature name bdc18e2 impl Default for collection iterators that don't already have it f26f981 clarify interactions with MaybeUninit and UnsafeCell 394c864 remove duplicate explanations of the ptr to ref conversion rules 571348b create a new section on pointer to reference conversion 971aa37 LocalWaker docs: Make long-ago omitted but probably intended changes c4fdac9 Docs for Waker and LocalWaker: Add cross-refs in comment 9c299bc Implement `debug_more_non_exhaustive` b405024 Make use of raw strings in `core::fmt::builders` 20e64bd Use is_val_statically_known to optimize pow 05ee322 Explicitly unroll integer pow for small exponents 4cfe24a Optimize integer pow by removing exit branch 7c219da Implement DoubleEnded and ExactSize for Take<Repeat> and Take<RepeatWith> git-subtree-dir: library git-subtree-split: 4f47132

rustbot assigned tgross35 Jul 25, 2024

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-libs Relevant to the library team, which will review and decide on the PR/issue. labels Jul 25, 2024

This comment has been minimized.

Sign in to view

ChaiTRex force-pushed the isqrt branch from eb17af7 to fea585d Compare July 25, 2024 00:34

tgross35 reviewed Jul 25, 2024

View reviewed changes

tgross35 reviewed Jul 26, 2024

View reviewed changes

library/core/src/num/int_sqrt.rs Outdated Show resolved Hide resolved

tgross35 reviewed Jul 26, 2024

View reviewed changes

library/core/tests/num/int_macros.rs Outdated Show resolved Hide resolved

tgross35 reviewed Jul 26, 2024

View reviewed changes

library/core/tests/num/int_sqrt.rs Show resolved Hide resolved

rustbot added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Jul 26, 2024

ChaiTRex force-pushed the isqrt branch from 82389f4 to f4bbacc Compare August 1, 2024 06:47

This comment was marked as outdated.

Sign in to view

ChaiTRex force-pushed the isqrt branch from a9292f7 to 05941db Compare August 11, 2024 22:19

tgross35 reviewed Aug 12, 2024

View reviewed changes

rustbot added the has-merge-commits PR has merge commits, merge with caution. label Aug 19, 2024

This comment was marked as outdated.

Sign in to view

ChaiTRex force-pushed the isqrt branch from d61b4dd to d376ac8 Compare August 19, 2024 23:20

rustbot removed the has-merge-commits PR has merge commits, merge with caution. label Aug 19, 2024

rustbot removed the S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. label Aug 20, 2024

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. labels Aug 29, 2024

workingjubilee mentioned this pull request Aug 29, 2024

Rollup of 6 pull requests #129728

Closed

workingjubilee mentioned this pull request Aug 29, 2024

Rollup of 9 pull requests #129729

Closed

workingjubilee reviewed Aug 29, 2024

View reviewed changes

GuillaumeGomez mentioned this pull request Aug 29, 2024

Rollup of 7 pull requests #129750

Merged

bors merged commit 4b08b2e into rust-lang:master Aug 29, 2024
7 checks passed

rustbot added this to the 1.82.0 milestone Aug 29, 2024

This was referenced Oct 6, 2024

FCP WIP ChaiTRex/rust#1

Closed

Tracking Issue for {u8,i8,...}::isqrt #116226

Closed

Improved checked_isqrt and isqrt methods #128166

Improved checked_isqrt and isqrt methods #128166

Uh oh!

Conversation

ChaiTRex commented Jul 25, 2024 • edited by tgross35 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Improved tests of isqrt and checked_isqrt implementations

Added benchmarks for isqrt implementations

Greatly sped up checked_isqrt and isqrt methods

Feature tracking issue

Command used to benchmark

Before

After

Uh oh!

rustbot commented Jul 25, 2024

Uh oh!

This comment has been minimized.

tgross35 commented Jul 25, 2024

Uh oh!

ChaiTRex commented Jul 25, 2024

Uh oh!

tgross35 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tgross35 commented Jul 25, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tgross35 commented Jul 26, 2024

Uh oh!

ChaiTRex commented Aug 1, 2024

Before this pull request

When I first made this pull request

With the changes I'm going to push soon

Uh oh!

This comment was marked as outdated.

ChaiTRex commented Aug 1, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tgross35 commented Aug 12, 2024

Uh oh!

This comment was marked as outdated.

ChaiTRex commented Aug 19, 2024

Uh oh!

ChaiTRex commented Aug 20, 2024

Uh oh!

bors commented Aug 29, 2024

Uh oh!

bors commented Aug 29, 2024

Uh oh!

tgross35 commented Aug 29, 2024

Uh oh!

bors commented Aug 29, 2024

Uh oh!

workingjubilee commented Aug 29, 2024

Uh oh!

tgross35 commented Aug 29, 2024

Uh oh!

workingjubilee commented Aug 29, 2024

Uh oh!

workingjubilee commented Aug 29, 2024

Uh oh!

tgross35 commented Aug 29, 2024

Improved `checked_isqrt` and `isqrt` methods #128166

Improved `checked_isqrt` and `isqrt` methods #128166

ChaiTRex commented Jul 25, 2024 •

edited by tgross35

Loading

Improved tests of `isqrt` and `checked_isqrt` implementations

Added benchmarks for `isqrt` implementations

Greatly sped up `checked_isqrt` and `isqrt` methods

tgross35 left a comment •

edited

Loading

ChaiTRex commented Aug 29, 2024 •

edited

Loading

ChaiTRex commented Aug 29, 2024 •

edited

Loading