<numeric> Optimize gcd to use builtins #665

miscco · 2020-03-31T20:29:37Z

Description

This fixes #298 by moving the meat of the bitscan implementation to the limits header.

This seems unfortunate but is the smallest common denominator of <numeric> and <bit>

Also we cannot go without limits anyway because the implementation needs it.

Checklist

Be sure you've read README.md and understand the scope of this repo.

If you're unsure about a box, leave it unchecked. A maintainer will help you.

Identifiers in product code changes are properly _Ugly as per
https://eel.is/c++draft/lex.name#3.1 or there are no product code changes.
The STL builds successfully and all tests have passed (must be manually
verified by an STL maintainer before automated testing is enabled on GitHub,
leave this unchecked for initial submission).
These changes introduce no known ABI breaks (adding members, renaming
members, adding virtual functions, changing whether a type is an aggregate
or trivially copyable, etc.).
These changes were written from scratch using only this repository,
the C++ Working Draft (including any cited standards), other WG21 papers
(excluding reference implementations outside of proposed standard wording),
and LWG issues as reference material. If they were derived from a project
that's already listed in NOTICE.txt, that's fine, but please mention it.
If they were derived from any other project (including Boost and libc++,
which are not yet listed in NOTICE.txt), you must mention it here,
so we can determine whether the license is compatible and what else needs
to be done.

BillyONeal · 2020-04-01T03:02:18Z

Test failures look legit on this one :)

miscco · 2020-04-01T06:09:35Z

Hm I compiled for me, I will check again :(

miscco · 2020-04-01T06:25:05Z

Go home test you are drunk:

C:\agent_work\1\a\x86\out\inc\limits(1021,26): note: candidate template ignored: requirement '_Is_standard_unsigned_integer' was not satisfied [with _Ty = int]

So it seems that common_unsigned does funny things

miscco · 2020-04-01T06:35:30Z

The full error is:

In file included from C:\agent_work\1\s\llvm-project\libcxx\test\std\numerics\numeric.ops\numeric.ops.gcd\gcd.pass.cpp:16:
C:\agent_work\1\a\x86\out\inc\numeric(864,39): error: no matching function for call to '_Countl_zero'

return static_cast(_Countl_zero(~_Mask));
^~~~~~~~~~~~
C:\agent_work\1\a\x86\out\inc\numeric(898,39): note: in instantiation of function template specialization 'std::_Stl_bitscan_forward' requested here

const auto _Mx_trailing_zeroes = _Stl_bitscan_forward(_Mx_magnitude);

C:\agent_work\1\s\llvm-project\libcxx\test\std\numerics\numeric.ops\numeric.ops.gcd\gcd.pass.cpp(48,45): note: in instantiation of function template specialization 'std::gcd<signed char, signed char>' requested here
assert(static_cast<Output>(out) == std::gcd(value1, value2));
C:\agent_work\1\s\llvm-project\libcxx\test\std\numerics\numeric.ops\numeric.ops.gcd\gcd.pass.cpp(64,27): note: in instantiation of function template specialization 'test0<signed char, signed char, signed char>' requested here
        accumulate &= test0<S1, S2, Output>(TC.x, TC.y, TC.expect);

                      ^
C:\agent_work\1\s\llvm-project\libcxx\test\std\numerics\numeric.ops\numeric.ops.gcd\gcd.pass.cpp(100,19): note: in instantiation of function template specialization 'do_test<signed char, signed char>' requested here
static_assert(do_test<signed char>(), "");

              ^
C:\agent_work\1\a\x86\out\inc\limits(1021,26): note: candidate template ignored: requirement '_Is_standard_unsigned_integer' was not satisfied [with _Ty = int]

_NODISCARD constexpr int _Countl_zero(const _Ty _Val) noexcept {

So in the test the type is signed_char that is passed into make_unsigned_t to deduce _Common_unsigned. I guess that should make it a unsigned char which is the type of _Mx_magnitude.

So the type passed to _Countl_zero is unsigned char where does it get the int from?

miscco · 2020-04-01T12:05:30Z

It seems that for whatever reason the template argument deduction fails. if I specify _Countl_zero<_Unsigned> it works.

That however smells terribly like a compiler bug

miscco · 2020-04-02T07:36:27Z

So tests are good, although I would like an opinion whether this is a compiler bug or just me not understanding C++.

The only thing I could com up with is that ~_Mask is changing the type but please tell me it doesnt

stl/inc/numeric

AlexGuteniev · 2020-04-02T15:54:11Z

that ~_Mask is changing the type

Why not?

https://eel.is/c++draft/expr.unary.op#10

Integral promotions are performed. The type of the result is the type of the promoted operand.

https://eel.is/c++draft/conv.prom#2

A prvalue of type char16_t, char32_t, or wchar_t ([basic.fundamental]) can be converted to a prvalue of the first of the following types that can represent all the values of its underlying type: int, unsigned int, long int, unsigned long int, long long int, or unsigned long long int.

("Can" here means the conversion exists as integral promotion. The key verb here is "are" above. Promotions are performed)

stl/inc/numeric

Fix tpyo

StephanTLavavej

Looks great, thank you! I tried my hardest, but the only issue I could find was a whitespace nitpick.

stl/inc/numeric

Co-Authored-By: Stephan T. Lavavej <stl@nuwen.net>

stl/inc/bit

CaseyCarter · 2020-04-11T02:10:16Z

Thanks for the performance enhancement! 🐎

miscco requested a review from a team as a code owner March 31, 2020 20:29

BillyONeal approved these changes Mar 31, 2020

View reviewed changes

miscco added 2 commits April 1, 2020 13:12

[numerics] Optimize gcd to use builtins

06e6d39

Fix template argument deduction for _countl_zero

d08ea4b

miscco force-pushed the gcd branch from 178616c to d08ea4b Compare April 1, 2020 12:04

CaseyCarter added the performance Must go faster label Apr 1, 2020

miscco closed this Apr 2, 2020

miscco reopened this Apr 2, 2020

miscco commented Apr 2, 2020

View reviewed changes

stl/inc/numeric Outdated Show resolved Hide resolved

Use the same casts as the other bit operations

6a87c38

CaseyCarter suggested changes Apr 2, 2020

View reviewed changes

stl/inc/numeric Outdated Show resolved Hide resolved

Update stl/inc/numeric

a269a90

Fix tpyo

CaseyCarter approved these changes Apr 2, 2020

View reviewed changes

StephanTLavavej approved these changes Apr 3, 2020

View reviewed changes

stl/inc/numeric Outdated Show resolved Hide resolved

Remove whitespace

c8961dd

Co-Authored-By: Stephan T. Lavavej <stl@nuwen.net>

StephanTLavavej approved these changes Apr 3, 2020

View reviewed changes

CaseyCarter changed the title ~~[numerics] Optimize gcd to use builtins~~ <numeric> Optimize gcd to use builtins Apr 9, 2020

CaseyCarter self-assigned this Apr 9, 2020

_BitScanForward is countr_zero(x), not countl_zero(~x)

c833caa

CaseyCarter reviewed Apr 11, 2020

View reviewed changes

stl/inc/bit Show resolved Hide resolved

StephanTLavavej approved these changes Apr 11, 2020

View reviewed changes

CaseyCarter merged commit bda4230 into microsoft:master Apr 11, 2020

StephanTLavavej mentioned this pull request May 13, 2020

<numeric>: Consider avoiding <limits> inclusion #832

Closed

miscco deleted the gcd branch June 3, 2020 13:31

CaseyCarter removed their assignment Jun 27, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

<numeric> Optimize gcd to use builtins #665

<numeric> Optimize gcd to use builtins #665

miscco commented Mar 31, 2020 •

edited by CaseyCarter

Loading

BillyONeal commented Apr 1, 2020

miscco commented Apr 1, 2020

miscco commented Apr 1, 2020 •

edited

Loading

miscco commented Apr 1, 2020

miscco commented Apr 1, 2020

miscco commented Apr 2, 2020

AlexGuteniev commented Apr 2, 2020 •

edited

Loading

StephanTLavavej left a comment

CaseyCarter commented Apr 11, 2020

<numeric> Optimize gcd to use builtins #665

<numeric> Optimize gcd to use builtins #665

Conversation

miscco commented Mar 31, 2020 • edited by CaseyCarter Loading

Description

Checklist

BillyONeal commented Apr 1, 2020

miscco commented Apr 1, 2020

miscco commented Apr 1, 2020 • edited Loading

miscco commented Apr 1, 2020

miscco commented Apr 1, 2020

miscco commented Apr 2, 2020

AlexGuteniev commented Apr 2, 2020 • edited Loading

StephanTLavavej left a comment

Choose a reason for hiding this comment

CaseyCarter commented Apr 11, 2020

miscco commented Mar 31, 2020 •

edited by CaseyCarter

Loading

miscco commented Apr 1, 2020 •

edited

Loading

AlexGuteniev commented Apr 2, 2020 •

edited

Loading