BUG: <`uint64 dtype` is broken for `Max`> #770

Dhruvanshu-Joshi · 2024-05-14T17:21:01Z

Describe the issue:

The Max op which is a subclass of CAReduce fails for 64 bit unsigned integers. This is also evident in the PR 731 and its test. The exact number where uint64 starts failing for is 9223372036854775.
Error can also be reproduced with the following example:

Reproducable code example:

import pytensor.tensor as pt
import numpy as np

dtype="uint64"
n = pt.vector("n", shape=(None,), dtype=dtype)
test_n = np.array([0, 9223372036854775], dtype)

assert pt.max(n).dtype == dtype
print(pt.max(n).eval({n: test_n})) # 9223372036854776
print(test_n.max()) # 9223372036854775
assert pt.max(n).eval({n: test_n}) == test_n.max()  # Fails, returns 1 larger

Error message:

No response

PyTensor version information:

Python 3.11.8
Pytensor 2.20.0

Context for the issue:

No response

The text was updated successfully, but these errors were encountered:

aseyboldt · 2024-05-14T17:42:40Z

Consider me intrigued...
Sounds like somewhere there is a conversion to float64, because that exact number is one bigger than the largest integer a float64 can represent exactly:
np.float64(9223372036854775).astype(np.int64) is our magic 9223372036854776.

aseyboldt · 2024-05-14T18:12:41Z

This seems to have been around for a long time.
The maximum and minimum are implemented as CAReduction (pytensor.scalar.basic.ScalarMaximum). There seem to be two problems with this implementation: Both scalar Ops specify inf and -inf as identity element, but if we are working with integer types, those do not even have an identity.
But the direct source of the bug seems to be this generated C-Code:

        return f"{z} = (({y})>({x})? ({y}): " f'(({x})>=({y})? ({x}): nan("")));'

It tries to ignore nans, but by doing so it implicitly converts the intermediate values to floats.

I guess it might make sense to split the ScalarOps: One for floats (where we have an identity) and one for ints (where we don't). And then use the nan-check only for floats.

edit

This was introduced here: aesara-devs/aesara#297

ricardoV94 · 2024-05-14T19:19:35Z

The conversion story makes sense. Regarding infty, the code actually initializes those to zero when it's (u)integers.

The number thing was not a hard cutoff. It starts working with larger values and then fails again. In case that helps.

And yes this bug is likely there for a while but was hidden when the max/min could be constant folded, triggering the C implementation of MaxAndArgmax instead which just calls numpy C code and handles it correctly.

This is expected to fail, but has started to pass. I split out the example from the issue cited in the xfail to a new test, which fails as expected. However, the original `test_uint` (which isn't exactly the same as the example on Issue pymc-devs#770) is passing, while it is marked xfail.

I split this test up to test uint64 separately, since this is the case discussed in Issue pymc-devs#770. I also added a test for the exact example used in that issue. The uint dtypes with lower precision should pass. The uint64 case started passing for me locally on Mac OSX, but still fails on CI. I'm not sure why this is, but at least the test will be more specific now if it fails in the future.

I split this test up to test uint64 separately, since this is the case discussed in Issue #770. I also added a test for the exact example used in that issue. The uint dtypes with lower precision should pass. The uint64 case started passing for me locally on Mac OSX, but still fails on CI. I'm not sure why this is, but at least the test will be more specific now if it fails in the future.

I split this test up to test uint64 separately, since this is the case discussed in Issue pymc-devs#770. I also added a test for the exact example used in that issue. The uint dtypes with lower precision should pass. The uint64 case started passing for me locally on Mac OSX, but still fails on CI. I'm not sure why this is, but at least the test will be more specific now if it fails in the future.

Dhruvanshu-Joshi added the bug Something isn't working label May 14, 2024

Dhruvanshu-Joshi mentioned this issue May 16, 2024

Break MaxandArgmax Op to seperate TensorMax Op and Argmax Op #731

Merged

11 tasks

ricardoV94 added the C-backend label Jun 23, 2024

brendan-m-murphy mentioned this issue Feb 12, 2025

Make PyTensor compatible with numpy 2.0 #1194

Merged

11 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: <`uint64 dtype` is broken for `Max`> #770

BUG: <`uint64 dtype` is broken for `Max`> #770

Dhruvanshu-Joshi commented May 14, 2024

aseyboldt commented May 14, 2024

aseyboldt commented May 14, 2024 •

edited

Loading

ricardoV94 commented May 14, 2024 •

edited

Loading

BUG: <uint64 dtype is broken for Max> #770

BUG: <uint64 dtype is broken for Max> #770

Comments

Dhruvanshu-Joshi commented May 14, 2024

Describe the issue:

Reproducable code example:

Error message:

PyTensor version information:

Context for the issue:

aseyboldt commented May 14, 2024

aseyboldt commented May 14, 2024 • edited Loading

ricardoV94 commented May 14, 2024 • edited Loading

BUG: <`uint64 dtype` is broken for `Max`> #770

BUG: <`uint64 dtype` is broken for `Max`> #770

aseyboldt commented May 14, 2024 •

edited

Loading

ricardoV94 commented May 14, 2024 •

edited

Loading