Skip to content

Commit

Permalink
math/py-numba: update to 0.58.1
Browse files Browse the repository at this point in the history
This is the first version I tested with the re-vived py-llvmlite. This version
works with Pythons below 3.12 so far. Upstream changes since 0.55.2:


Version 0.58.1 (17 October 2023)

This is a maintenance release that adds support for NumPy 1.26 and fixes a bug.
NumPy Support
Support NumPy 1.26

Support for NumPy 1.26 is added.

(PR-#9227)
Bug Fixes
Fixed handling of float default arguments in inline closures

Float default arguments in inline closures would produce incorrect results since updates for Python 3.11 - these are now handled correctly again.

(PR-#9222)
Pull-Requests

    PR #9220: Support passing arbitrary flags to NVVM (gmarkall)

    PR #9227: Support NumPy 1.26 (PR aimed at review / merge) (Tialo gmarkall)

    PR #9228: Fix #9222 - Don’t replace . with _ in func arg names in inline closures (gmarkall)

Authors

    gmarkall

    Tialo


Version 0.58.0 (20 September 2023)

Table of Contents

    Version 0.58.0 (20 September 2023)

        Highlights

        New Features

        Improvements

        NumPy Support

        CUDA Changes

        Bug Fixes

        Changes

        Deprecations

        Pull-Requests

        Authors

This is a major Numba release. Numba now uses towncrier to create the release notes, so please find a summary of all noteworthy items below.
Highlights
Added towncrier

This PR adds towncrier as a GitHub workflow for checking release notes. From this PR onwards every PR made in Numba will require a appropriate release note associated with it. The reviewer may decide to skip adding release notes in smaller PRs with minimal impact by addition of a skip_release_notes label to the PR.

(PR-#8792)
The minimum supported NumPy version is 1.22.

Following NEP-0029, the minimum supported NumPy version is now 1.22.

(PR-#9093)
Add support for NumPy 1.25

Extend Numba to support new and changed features released in NumPy 1.25.

(PR-#9011)
Remove NVVM 3.4 and CTK 11.0 / 11.1 support

Support for CUDA toolkits < 11.2 is removed.

(PR-#9040)
Removal of Windows 32-bit Support

This release onwards, Numba has discontinued support for Windows 32-bit operating systems.

(PR-#9083)
The minimum llvmlite version is now 0.41.0.

The minimum required version of llvmlite is now version 0.41.0.

(PR-#8916)
Added RVSDG-frontend

This PR is a preliminary work on adding a RVSDG-frontend for processing bytecode. RVSDG (Regionalized Value-State Dependence Graph) allows us to have a dataflow-centric view instead of a traditional SSA-CFG view. This allows us to simplify the compiler in the future.

(PR-#9012)
New Features
numba.experimental.jitclass gains support for __*matmul__ methods.

numba.experimental.jitclass now has support for the following methods:

    __matmul__

    __imatmul__

    __rmatmul__

(PR-#8892)
numba.experimental.jitclass gains support for reflected “dunder” methods.

numba.experimental.jitclass now has support for the following methods:

    __radd__

    __rand_

    __rfloordiv__

    __rlshift__

    __ror_

    __rmod_

    __rmul_

    __rpow_

    __rrshift_

    __rsub_

    __rtruediv_

    __rxor_

(PR-#8906)
Add support for value max to NUMBA_OPT.

The optimisation level that Numba applies when compiling can be set through the environment variable NUMBA_OPT. This has historically been a value between 0 and 3 (inclusive). Support for the value max has now been added, this is a Numba-specific optimisation level which indicates that the user would like Numba to try running the most optimisation possible, potentially trading a longer compilation time for better run-time performance. In practice, use of the max level of optimisation may or may not benefit the run-time or compile-time performance of user code, but it has been added to present an easy to access option for users to try if they so wish.

(PR-#9094)
Improvements
Updates to numba.core.pythonapi.

Support for Python C-API functions PyBytes_AsString and PyBytes_AsStringAndSize is added to numba.core.pythonapi.PythonAPI as bytes_as_string and bytes_as_string_and_size methods respectively.

(PR-#8462)
Support for isinstance is now non-experimental.

Support for the isinstance built-in function has moved from being considered an experimental feature to a fully supported feature.

(PR-#8911)
NumPy Support
All modes are supported in numpy.correlate and numpy.convolve.

All values for the mode argument to numpy.correlate and numpy.convolve are now supported.

(PR-#7543)
@vectorize accommodates arguments implementing __array_ufunc__.

Universal functions (ufuncs) created with numba.vectorize will now respect arguments implementing __array_ufunc__ (NEP-13) to allow pre- and post-processing of arguments and return values when the ufunc is called from the interpreter.

(PR-#8995)
Added support for np.geomspace function.

This PR improves on #4074 by adding support for np.geomspace. The current implementation only supports scalar start and stop parameters.

(PR-#9068)
Added support for np.vsplit, np.hsplit, np.dsplit.

This PR improves on #4074 by adding support for np.vsplit, np.hsplit, and np.dsplit.

(PR-#9082)
Added support for np.row_stack function.

Support is added for numpy.row_stack.

(PR-#9085)
Added support for functions np.polynomial.polyutils.trimseq, as well as functions polyadd, polysub, polymul from np.polynomial.polynomial.

Support is added for np.polynomial.polyutils.trimseq, np.polynomial.polynomial.polyadd, np.polynomial.polynomial.polysub, np.polynomial.polynomial.polymul.

(PR-#9087)
Added support for np.diagflat function.

Support is added for numpy.diagflat.

(PR-#9113)
Added support for np.resize function.

Support is added for numpy.resize.

(PR-#9118)
Add np.trim_zeros

Support for np.trim_zeros() is added.

(PR-#9074)
CUDA Changes
Bitwise operation ufunc support for the CUDA target.

Support is added for some ufuncs associated with bitwise operation on the CUDA target. Namely:

    numpy.bitwise_and

    numpy.bitwise_or

    numpy.bitwise_not

    numpy.bitwise_xor

    numpy.invert

    numpy.left_shift

    numpy.right_shift

(PR-#8974)
Add support for the latest CUDA driver codes.

Support is added for the latest set of CUDA driver codes.

(PR-#8988)
Add NumPy comparison ufunc in CUDA

this PR adds support for comparison ufuncs for the CUDA target (eg. numpy.greater, numpy.greater_equal, numpy.less_equal, etc.).

(PR-#9007)
Report absolute path of libcuda.so on Linux

numba -s now reports the absolute path to libcuda.so on Linux, to aid troubleshooting driver issues, particularly on WSL2 where a Linux driver can incorrectly be installed in the environment.

(PR-#9034)
Add debuginfo support to nvdisasm output.

Support is added for debuginfo (source line and inlining information) in functions that make calls through nvdisasm. For example the CUDA dispatcher .inspect_sass method output is now augmented with this information.

(PR-#9035)
Add CUDA SASS CFG Support

This PR adds support for getting the SASS CFG in dot language format. It adds an inspect_sass_cfg() method to CUDADispatcher and the -cfg flag to the nvdisasm command line tool.

(PR-#9051)
Support NVRTC using the ctypes binding

NVRTC can now be used when the ctypes binding is in use, enabling float16, and linking CUDA C / C++ sources without needing the NVIDIA CUDA Python bindings.

(PR-#9086)
Fix CUDA atomics tests with toolkit 12.2

CUDA 12.2 generates slightly different PTX for some atomics, so the relevant tests are updated to look for the correct instructions when 12.2 is used.

(PR-#9088)
Bug Fixes
Handling of different sized unsigned integer indexes are fixed in numba.typed.List.

An issue with the order of truncation/extension and casting of unsigned integer indexes in numba.typed.List has been fixed.

(PR-#7262)
Prevent invalid fusion

This PR fixes an issue in which an array first read in a parfor and later written in the same parfor would only be classified as used in the parfor. When a subsequent parfor also used the same array then fusion of the parfors was happening which should have been forbidden given that that the first parfor was also writing to the array. This PR treats such arrays in a parfor as being both used and defined so that fusion will be prevented.

(PR-#7582)
The numpy.allclose implementation now correctly handles default arguments.

The implementation of numpy.allclose is corrected to use TypingError to report typing errors.

(PR-#8885)
Add type validation to numpy.isclose.

Type validation is added to the implementation of numpy.isclose.

(PR-#8944)
Fix support for overloading dispatcher with non-compatible first-class functions

Fixes an error caused by not handling compilation error during casting of Dispatcher objects into first-class functions. With the fix, users can now overload a dispatcher with non-compatible first-class functions. Refer to https://github.com/numba/numba/issues/9071 for details.

(PR-#9072)
Support dtype keyword argument in numpy.arange with parallel=True

Fixes parfors transformation to support the use of dtype keyword argument in numpy.arange(..., dtype=dtype).

(PR-#9095)
Fix all @overloads to use parameter names that match public APIs.

Some of the Numba @overloads for functions in NumPy and Python’s built-ins were written using parameter names that did not match those used in API they were overloading. The result of this being that calling a function with such a mismatch using the parameter names as key-word arguments at the call site would result in a compilation error. This has now been universally fixed throughout the code base and a unit test is running with a best-effort attempt to prevent reintroduction of similar mistakes in the future. Fixed functions include:

From Python built-ins:

    complex

From the Python random module:

    random.seed

    random.gauss

    random.normalvariate

    random.randrange

    random.randint

    random.uniform

    random.shuffle

From the numpy module:

    numpy.argmin

    numpy.argmax

    numpy.array_equal

    numpy.average

    numpy.count_nonzero

    numpy.flip

    numpy.fliplr

    numpy.flipud

    numpy.iinfo

    numpy.isscalar

    numpy.imag

    numpy.real

    numpy.reshape

    numpy.rot90

    numpy.swapaxes

    numpy.union1d

    numpy.unique

From the numpy.linalg module:

    numpy.linalg.norm

    numpy.linalg.cond

    numpy.linalg.matrix_rank

From the numpy.random module:

    numpy.random.beta

    numpy.random.chisquare

    numpy.random.f

    numpy.random.gamma

    numpy.random.hypergeometric

    numpy.random.lognormal

    numpy.random.pareto

    numpy.random.randint

    numpy.random.random_sample

    numpy.random.ranf

    numpy.random.rayleigh

    numpy.random.sample

    numpy.random.shuffle

    numpy.random.standard_gamma

    numpy.random.triangular

    numpy.random.weibull

(PR-#9099)
Changes
Support for @numba.extending.intrinsic(prefer_literal=True)

In the high level extension API, the prefer_literal option is added to the numba.extending.intrinsic decorator to prioritize the use of literal types when available. This has the same behavior as in the prefer_literal option in the numba.extending.overload decorator.

(PR-#6647)
Deprecations
Deprecation of old-style NUMBA_CAPTURED_ERRORS

Added deprecation schedule of NUMBA_CAPTURED_ERRORS=old_style. NUMBA_CAPTURED_ERRORS=new_style will become the default in future releases. Details are documented at https://numba.readthedocs.io/en/stable/reference/deprecation.html#deprecation-of-old-style-numba-captured-errors

(PR-#9090)
Pull-Requests

    PR #6647: Support prefer_literal option for intrinsic decorator (ashutoshvarma sklam)

    PR #7262: fix order of handling and casting (esc)

    PR #7543: Support for all modes in np.correlate and np.convolve (jeertmans)

    PR #7582: Use get_parfor_writes to detect illegal array access that prevents fusion. (DrTodd13)

    PR #8371: Added binomial distribution (esc kc611)

    PR #8462: Add PyBytes_AsString and PyBytes_AsStringAndSize (ianna)

    PR #8633: DOC: Convert vectorize and guvectorize examples to doctests (Matt711)

    PR #8730: Update dev-docs (sgbaird esc)

    PR #8792: Added towncrier as a github workflow (kc611)

    PR #8854: Updated mk_alloc to support Numba-Dpex compute follows data. (mingjie-intel)

    PR #8861: CUDA: Don’t add device kwarg for jit registry (gmarkall)

    PR #8871: Don’t return the function in CallConv.decorate_function() (gmarkall)

    PR #8885: Fix np.allclose not handling default args (guilhermeleobas)

    PR #8892: Add support for __*matmul__ methods in jitclass (louisamand)

    PR #8895: CUDA: Enable caching functions that use CG (gmarkall)

    PR #8906: Add support for reflected dunder methods in jitclass (louisamand)

    PR #8911: Remove isinstance experimental feature warning (guilhermeleobas)

    PR #8916: Bump llvmlite requirement to 0.41.0dev0 (sklam)

    PR #8925: Update release checklist template (sklam)

    PR #8937: Remove old Website development documentation (esc gmarkall)

    PR #8944: Add exceptions to np.isclose (guilhermeleobas)

    PR #8974: CUDA: Add binary ufunc support (Matt711)

    PR #8976: Fix index URL for ptxcompiler/cubinlinker packages. (bdice)

    PR #8978: Import MVC packages when using MVCLinker. (bdice)

    PR #8983: Fix typo in deprecation.rst (dsgibbons)

    PR #8988: support for latest CUDA driver codes #8363 (s1Sharp)

    PR #8995: Allow libraries that implement __array_ufunc__ to override DUFunc.__c… (jpivarski)

    PR #9007: CUDA: Add comparison ufunc support (Matt711)

    PR #9012: RVSDG-frontend (sklam)

    PR #9021: update the release checklist following 0.57.1rc1 (esc)

    PR #9022: fix: update the C++ ABI repo reference (emmanuel-ferdman)

    PR #9028: Replace use of imp module removed in 3.12 (hauntsaninja)

    PR #9034: CUDA libs test: Report the absolute path of the loaded libcuda.so on Linux, + other improvements (gmarkall)

    PR #9035: CUDA: Allow for debuginfo in nvdisasm output (Matt711)

    PR #9037: Recognize additional functions as being pure or not having side effects. (DrTodd13)

    PR #9039: Correct git clone link in installation instructions. (ellifteria)

    PR #9040: Remove NVVM 3.4 and CTK 11.0 / 11.1 support (gmarkall)

    PR #9046: copy the change log changes for 0.57.1 to main (esc)

    PR #9050: Update CODEOWNERS (sklam)

    PR #9051: Add CUDA CFG support (Matt711)

    PR #9056: adding weekly meeting notes script (esc)

    PR #9068: Adding np.geomspace (KrisMinchev)

    PR #9069: Fix towncrier error due to importlib_resources upgrade (sklam)

    PR #9072: Fix support for overloading dispatcher with non-compatible first-class functions (gmarkall sklam)

    PR #9074: Add np.trim_zeros (sungraek guilhermeleobas)

    PR #9082: Add np.vsplit, np.hsplit, and np.dsplit (KrisMinchev)

    PR #9083: Removed windows 32 references from code and documentation (kc611)

    PR #9085: Add tests for np.row_stack (KrisMinchev)

    PR #9086: Support NVRTC using ctypes binding (testhound gmarkall)

    PR #9087: Add trimseq from np.polynomial.polyutils and polyadd, polysub, polymul from np.polynomial.polynomial (KrisMinchev)

    PR #9088: Fix: Issue 9063 - CUDA atomics tests failing with CUDA 12.2 (gmarkall)

    PR #9090: Add deprecation notice for old_style error capturing. (esc sklam)

    PR #9094: Add support for a ‘max’ level to NUMBA_OPT environment variable. (stuartarchibald)

    PR #9095: Support dtype keyword in arange_parallel_impl (DrTodd13 sklam)

    PR #9105: NumPy 1.25 support (PR #9011) continued (gmarkall apmasell)

    PR #9111: Fixes ReST syntax error in PR#9099 (stuartarchibald gmarkall sklam apmasell)

    PR #9112: Fixups for PR#9100 (stuartarchibald sklam)

    PR #9113: Add support for np.diagflat (KrisMinchev)

    PR #9114: update np min to 122 (stuartarchibald esc)

    PR #9117: Fixed towncrier template rendering (kc611)

    PR #9118: Add support for np.resize() (KrisMinchev)

    PR #9120: Update conda-recipe for numba-rvsdg (sklam)

    PR #9127: Fix accidental cffi test deps, refactor cffi skipping (gmarkall)

    PR #9128: Merge rvsdg_frontend branch to main (esc sklam)

    PR #9152: Fix old_style error capturing deprecation warnings (sklam)

    PR #9159: Fix uncaught exception in find_file() (gmarkall)

    PR #9173: Towncrier fixups (Continue #9158 and retarget to main branch) (sklam)

    PR #9181: Remove extra decrefs in RNG (sklam)

    PR #9190: Fix issue with incompatible multiprocessing context in test. (stuartarchibald)

Authors

    apmasell

    ashutoshvarma

    bdice

    DrTodd13

    dsgibbons

    ellifteria

    emmanuel-ferdman

    esc

    gmarkall

    guilhermeleobas

    hauntsaninja

    ianna

    jeertmans

    jpivarski

    jtilly

    kc611

    KrisMinchev

    louisamand

    Matt711

    mingjie-intel

    s1Sharp

    sgbaird

    sklam

    stuartarchibald

    sungraek

    testhound




Version 0.57.1 (21 June, 2023)

Pull-Requests:

    PR #8964: fix missing nopython keyword in cuda random module (esc)

    PR #8965: fix return dtype for np.angle (guilhermeleobas esc)

    PR #8982: Don’t do the parfor diagnostics pass for the parfor gufunc. (DrTodd13)

    PR #8996: adding a test for 8940 (esc)

    PR #8958: resurrect the import, this time in the registry initialization (esc)

    PR #8947: Introduce internal _isinstance_no_warn (guilhermeleobas esc)

    PR #8998: Fix 8939 (second attempt) (esc)

    PR #8978: Import MVC packages when using MVCLinker. (bdice)

    PR #8895: CUDA: Enable caching functions that use CG (gmarkall)

    PR #8976: Fix index URL for ptxcompiler/cubinlinker packages. (bdice)

    PR #9004: Skip MVC test when libraries unavailable (gmarkall esc)

    PR #9006: link to version support table instead of using explicit versions (esc)

    PR #9005: Fix: Issue #8923 - avoid spurious device-to-host transfers in CUDA ufuncs (gmarkall)

Authors:

    bdice

    DrTodd13

    esc

    gmarkall

Version 0.57.0 (1 May, 2023)

This release continues to add new features, bug fixes and stability improvements to Numba. Please note that this release contains a significant number of both deprecation and pending-deprecation notices with view of making it easier to develop new technology for Numba in the future. Also note that this will be the last release to support Windows 32-bit packages produced by the Numba team.

Highlights of core dependency upgrades:

    Support for Python 3.11 (minimum is moved to 3.8)

    Support for NumPy 1.24 (minimum is moved to 1.21)

Python language support enhancements:

    Exception classes now support arguments that are not compile time constant.

    The built-in functions hasattr and getattr are supported for compile time constant attributes.

    The built-in functions str and repr are now implemented similarly to their Python implementations. Custom __str__ and __repr__ functions can be associated with types and work as expected.

    Numba’s unicode functionality in str.startswith now supports kwargs start and end.

    min and max now support boolean types.

    Support is added for the dict(iterable) constructor.

NumPy features/enhancements:

    The largest set of new features is within the numpy.random.Generator support, the vast majority of commonly used distributions are now supported. Namely:

        Generator.beta

        Generator.chisquare

        Generator.exponential

        Generator.f

        Generator.gamma

        Generator.geometric

        Generator.integers

        Generator.laplace

        Generator.logistic

        Generator.lognormal

        Generator.logseries

        Generator.negative_binomial

        Generator.noncentral_chisquare

        Generator.noncentral_f

        Generator.normal

        Generator.pareto

        Generator.permutation

        Generator.poisson

        Generator.power

        Generator.random

        Generator.rayleigh

        Generator.shuffle

        Generator.standard_cauchy

        Generator.standard_exponential

        Generator.standard_gamma

        Generator.standard_normal

        Generator.standard_t

        Generator.triangular

        Generator.uniform

        Generator.wald

        Generator.weibull

        Generator.zipf

    The nbytes property on NumPy ndarray types is implemented.

    Nesting of nested-array types is now supported.

    datetime and timedelta types can be cast to int.

    F-order iteration is supported in ufunc generation for increased performance when using combinations of predominantly F-order arrays.

    The following functions are also now supported:

        np.argpartition

        np.isclose

        np.nan_to_num

        np.new_axis

        np.union1d

Highlights of core changes:

    A large amount of refactoring has taken place to convert many of Numba’s internal implementations, of both Python and NumPy functions, from the low-level extension API to the high-level extension API (numba.extending).

    The __repr__ method is supported for Numba types.

    The default target for applicable functions in the extension API (numba.extending) is now "generic". This means that @overload* and @intrinsic functions will by default be accepted by both the CPU and CUDA targets.

    The use of __getitem__ on Numba types is now supported in compiled code. i.e. types.float64[:, ::1] is now compilable.

Performance:

    The performance of str.find() and str.rfind() has been improved.

    Unicode support for __getitem__ now avoids allocation and returns a view.

    The numba.typed.Dict dictionary now accepts an n_keys option to enable allocating the dictionary instance to a predetermined initial size (useful to avoid resizes!).

    The Numba Run-time (NRT) has been improved in terms of performance and safety:

        The NRT internal statistics counters are now off by default (removes atomic lock contentions).

        Debug cache line filling is off by default.

        The NRT is only compiled once a compilation starts opposed to at function decoration time, this improves import speed.

        The NRT allocation calls are all made through a “checked” layer by default.

CUDA:

    New NVIDIA hardware and software compatibility / support:

        Toolkits: CUDA 11.8 and 12, with Minor Version Compatibility for 11.x.

        Packaging: NVIDIA-packaged CUDA toolkit conda packages.

        Hardware: Hopper, Ada Lovelace, and AGX Orin.

    float16 support:

        Arithmetic operations are now fully supported.

        A new method, is_fp16_supported(), and device property, supports_float16, for checking the availability of float16 support.

    Functionality:

        The high-level extension API is now fully-supported in the CUDA target.

        Eager compilation of multiple signatures, multiple outputs from generalized ufuncs, and specifying the return type of ufuncs are now supported.

        A limited set of NumPy ufuncs (trigonometric functions) can now be called inside kernels.

    Lineinfo quality improvement: enabling lineinfo no longer results in any changes to generated code.

Deprecations:

    The numba.pycc module and everything in it is now pending deprecation.

    The long awaited full deprecation of object mode fall-back is underway. This change means @jit with no keyword arguments will eventually alias @njit.

    The @generated_jit decorator is deprecated as the Numba extension API provides a better supported superset of the same functionality, particularly through @numba.extending.overload.

Version support/dependency changes:

    The setuptools package is now an optional run-time dependency opposed to a required run-time dependency.

    The TBB threading-layer now requires version 2021.6 or later.

    LLVM 14 is now supported on all platforms via llvmlite.

Pull-Requests:

    PR #5113: Fix error handling in the Interval extending example (esc eric-wieser)

    PR #5544: Add support for np.union1d (shangbol gmarkall)

    PR #7009: Add writable args (dmbelov)

    PR #7067: Implement np.isclose (guilhermeleobas)

    PR #7255: CUDA: Support CUDA Toolkit conda packages from NVIDIA (gmarkall)

    PR #7622: Support fortran loop ordering for ufunc generation (sklam)

    PR #7733: fix for /tmp/tmp access issues (ChiCheng45)

    PR #7884: Implement getattr builtin. (stuartarchibald)

    PR #7885: Adds CUDA FP16 arithmetic operators (testhound)

    PR #7920: Drop pre-3.7 code path (CPU only) (sklam)

    PR #8001: CUDA fp16 math functions (testhound gmarkall)

    PR #8010: Add support for fp16 comparison native operators (testhound)

    PR #8024: Allow converting NumPy datetimes to int (apmasell)

    PR #8038: Support for Numpy BitGenerators PR#2: Standard Distributions support (kc611)

    PR #8040: Support for Numpy BitGenerators PR#3: Advanced Distributions Support. (kc611)

    PR #8041: Support for Numpy BitGenerators PR#4: Generator().integers() Support. (kc611)

    PR #8042: Support for NumPy BitGenerators PR#5: Generator Shuffling Methods. (kc611)

    PR #8061: Migrate random glue_lowering to overload where easy (apmasell)

    PR #8106: Remove injection of atomic JIT functions into NRT memsys. (stuartarchibald)

    PR #8120: Support nesting of nested array types (gmarkall)

    PR #8134: Support non-constant exception values in JIT (guilhermeleobas sklam)

    PR #8147: Adds size variable at runtime for arrays that cannot be inferred (njriasan)

    PR #8154: Testhound/native cast 8138 (testhound)

    PR #8158: adding -pthread for linux-ppc64le in setup.py (esc)

    PR #8164: remove myself from automatic reviewer assignment (esc)

    PR #8167: CUDA: Facilitate and document passing arrays / pointers to foreign functions (gmarkall)

    PR #8180: CUDA: Initial support for Minor Version Compatibility (gmarkall)

    PR #8183: Add n_keys option to Dict.empty() (stefanfed gmarkall)

    PR #8198: Update the release template to include updating the version table. (stuartarchibald)

    PR #8200: Make the NRT use the “unsafe” allocation API by default. (stuartarchibald)

    PR #8201: Bump llvmlite dependency to 0.40.dev0 for Numba 0.57.0dev0 (stuartarchibald)

    PR #8207: development tag should be in monofont (esc)

    PR #8212: release checklist: include a note to ping @RC_testers on discourse (esc)

    PR #8216: chore: Set permissions for GitHub actions (naveensrinivasan)

    PR #8217: Fix syntax in docs (jorgepiloto)

    PR #8220: Added the interval example as doctest (kc611)

    PR #8221: CUDA stubs docstring: Replace illegal escape sequence (gmarkall)

    PR #8228: Fix typo in @vectorize docstring and a NumPy spelling. (stuartarchibald)

    PR #8229: Remove mk_unique_var in inline_closurecall.py (sklam)

    PR #8234: Replace @overload_glue by @overload for 20 NumPy functions (guilhermeleobas)

    PR #8235: Make the NRT stats counters optional. (stuartarchibald)

    PR #8238: Advanced Indexing Support #1 (kc611)

    PR #8240: Add get_shared_mem_per_block method to Dispatcher (testhound)

    PR #8241: Reorder typeof checks to avoid infinite loops on StructrefProxy __hash__ (DannyWeitekamp)

    PR #8243: Add a note to reference/numpysupported.rst ()

    PR #8245: Fix links in CONTRIBUTING.md ()

    PR #8247: Fix issue 8127 (bszollosinagy)

    PR #8250: Fix issue 8161 (bszollosinagy)

    PR #8253: CUDA: Verify NVVM IR prior to compilation (gmarkall)

    PR #8255: CUDA: Make numba.cuda.tests.doc_examples.ffi a module to fix #8252 (gmarkall)

    PR #8256: Migrate linear algebra functions from glue_lowering (apmasell)

    PR #8258: refactor np.where to use overload (guilhermeleobas)

    PR #8259: Add np.broadcast_to(scalar_array, ()) (guilhermeleobas)

    PR #8264: remove mk_unique_var from parfor_lowering_utils.py (guilhermeleobas)

    PR #8265: Remove mk_unique_var from array_analysis.py (guilhermeleobas)

    PR #8266: Remove mk_unique_var in untyped_passes.py (guilhermeleobas)

    PR #8267: Fix segfault for invalid axes in np.split (aseyboldt)

    PR #8271: Implement some CUDA intrinsics with @overload, @overload_attribute, and @intrinsic (gmarkall)

    PR #8274: Update version support table doc for 0.56. (stuartarchibald)

    PR #8275: Update CHANGE_LOG for 0.56.0 final (stuartarchibald)

    PR #8283: Clean up / remove support for old NumPy versions (gmarkall)

    PR #8287: Drop CUDA 10.2 (gmarkall)

    PR #8289: Revert #8265. (stuartarchibald)

    PR #8290: CUDA: Replace use of deprecated NVVM IR features, questionable constructs (gmarkall)

    PR #8292: update checklist (esc)

    PR #8294: CUDA: Add trig ufunc support (gmarkall)

    PR #8295: Add get_const_mem_size method to Dispatcher (testhound gmarkall)

    PR #8297: Add __name__ attribute to CUDAUFuncDispatcher and test case (testhound)

    PR #8299: Fix build for mingw toolchain (Biswa96)

    PR #8302: CUDA: Revert numba_nvvm intrinsic name workaround (gmarkall)

    PR #8308: CUDA: Support for multiple signatures (gmarkall)

    PR #8315: Add get_local_mem_per_thread method to Dispatcher (testhound)

    PR #8319: Bump minimum supported Python version to 3.8 (esc stuartarchibald jamesobutler)

    PR #8320: Add __name__ support for GUFuncs (testhound)

    PR #8321: Fix literal_unroll pass erroneously exiting on non-conformant loop. (stuartarchibald)

    PR #8325: Remove use of mk_unique_var in stencil.py (bszollosinagy)

    PR #8326: Remove mk_unique_var from parfor_lowering.py (guilhermeleobas)

    PR #8331: Extend docs with info on how to call C functions from Numba (guilhermeleobas)

    PR #8334: Add dict(*iterable) constructor (guilhermeleobas)

    PR #8335: Remove deprecated pycc script and related source. (stuartarchibald)

    PR #8336: Fix typos of “Generalized” in GUFunc-related code (gmarkall)

    PR #8338: Calculate reductions before fusion so that use of reduction vars can stop fusion. (DrTodd13)

    PR #8339: Fix #8291 parfor leak of redtoset variable (sklam)

    PR #8341: CUDA: Support multiple outputs for Generalized Ufuncs (gmarkall)

    PR #8343: Eliminate references to type annotation in compile_ptx (testhound)

    PR #8348: Add get_max_threads_per_block method to Dispatcher (testhound)

    PR #8354: pin setuptools to < 65 and switch from mamba to conda on RTD (esc gmarkall)

    PR #8357: Clean up the buildscripts directory. (stuartarchibald)

    PR #8359: adding warnings about cache behaviour (luk-f-a)

    PR #8368: Remove glue_lowering in random math that requires IR (apmasell)

    PR #8376: Fix issue 8370 (bszollosinagy)

    PR #8387: Add support for compute capability in IR Lowering (testhound)

    PR #8388: Remove more references to the pycc binary. (stuartarchibald)

    PR #8389: Make C++ extensions compile with correct compiler (apmasell)

    PR #8390: Use NumPy logic for lessthan in sort to move NaNs to the back. (sklam)

    PR #8401: Remove Cuda toolkit version check (testhound)

    PR #8415: Refactor numba.np.arraymath methods from lower_builtins to overloads (kc611)

    PR #8418: Fixes ravel failure on 1d arrays (#5229) (cako)

    PR #8421: Update release checklist: add a task to check dependency pinnings on subsequent releases (e.g. PATCH) (esc)

    PR #8422: Switch public CI builds to use gdb from conda packages. (stuartarchibald)

    PR #8423: Remove public facing and CI references to 32 bit linux support. (stuartarchibald, in addition, we are grateful for the contribution of jamesobutler towards a similar goal in PR #8319)

    PR #8425: Post 0.56.2 cleanup (esc)

    PR #8427: Shorten the time to verify test discovery. (stuartarchibald)

    PR #8429: changelog generator script (esc)

    PR #8431: Replace @overload_glue by @overload for np.linspace and np.take (guilhermeleobas)

    PR #8432: Refactor carray/farray to use @overload (guilhermeleobas)

    PR #8435: Migrate np.atleast_? functions from glue_lowering to overload (apmasell)

    PR #8438: Make the initialisation of the NRT more lazy for the njit decorator. (stuartarchibald)

    PR #8439: Update the contributing docs to include a policy on formatting changes. (stuartarchibald)

    PR #8440: [DOC]: Replaces icc_rt with intel-cmplr-lib-rt (oleksandr-pavlyk)

    PR #8442: Implement hasattr(), str() and repr(). (stuartarchibald)

    PR #8446: add version info in ImportError’s (raybellwaves)

    PR #8450: remove GitHub username from changelog generation script (esc)

    PR #8467: Convert implementations using generated_jit to overload (gmarkall)

    PR #8468: Reference test suite in installation documentation (apmasell)

    PR #8469: Correctly handle optional types in parfors lowering (apmasell)

    PR #8473: change the include style in _pymodule.h and remove unused or duplicate headers in two header files ()

    PR #8476: Make setuptools optional at runtime. (stuartarchibald)

    PR #8490: Restore installing SciPy from defaults instead of conda-forge on public CI (esc)

    PR #8494: Remove context.compile_internal where easy on numba/cpython/cmathimpl.py (guilhermeleobas)

    PR #8495: Removes context.compile_internal where easy on numba/cpython/listobj.py (guilhermeleobas)

    PR #8496: Rewrite most of the set API to use overloads (guilhermeleobas)

    PR #8499: Deprecate numba.generated_jit (stuartarchibald)

    PR #8508: This updates the release checklists to capture some more checks. (stuartarchibald)

    PR #8513: Added support for numpy.newaxis (kc611)

    PR #8517: make some typedlist C-APIs public ()

    PR #8518: Adjust stencil tests to use hardcoded python source opposed to AST. (stuartarchibald)

    PR #8520: Added noncentral-chisquared, noncentral-f and logseries distributions (kc611)

    PR #8522: Import jitclass from numba.experimental in jitclass documentation (armgabrielyan)

    PR #8524: Fix grammar in stencil.rst (armgabrielyan)

    PR #8525: Making CUDA specific datamodel manager (sklam)

    PR #8526: Fix broken url (Nimrod0901)

    PR #8527: Fix grammar in troubleshoot.rst (armgabrielyan)

    PR #8532: Vary NumPy version on gpuCI (gmarkall)

    PR #8535: LLVM14 (apmasell)

    PR #8536: Fix fusion bug. (DrTodd13)

    PR #8539: Fix #8534, np.broadcast_to should update array size attr. (stuartarchibald)

    PR #8541: Remove restoration of “free” channel in Azure CI windows builds. (stuartarchibald)

    PR #8542: CUDA: Make arg optional for Stream.add_callback() (gmarkall)

    PR #8544: Remove reliance on npy_<impl> ufunc loops. (stuartarchibald)

    PR #8545: Py3.11 basic support (esc sklam)

    PR #8547: [Unicode] Add more string view usages for unicode operations ()

    PR #8549: Fix rstcheck in Azure CI builds, update sphinx dep and docs to match (stuartarchibald)

    PR #8550: Changes how tests are split between test instances (apmasell)

    PR #8554: Make target for @overload have ‘generic’ as default. (stuartarchibald gmarkall)

    PR #8557: [Unicode] support startswith with args, start and end. ()

    PR #8566: Update workqueue abort message on concurrent access. (stuartarchibald)

    PR #8572: CUDA: Reduce memory pressure from local memory tests (gmarkall)

    PR #8579: CUDA: Add CUDA 11.8 / Hopper support and required fixes (gmarkall)

    PR #8580: adding note about doing a wheel test build prior to tagging (esc)

    PR #8583: Skip tests that contribute to M1 RuntimeDyLd Assertion error (sklam)

    PR #8587: Remove unused refcount removal code, clean core/cpu.py module. (stuartarchibald)

    PR #8588: Remove lowering extension hooks, replace with pass infrastructure. (stuartarchibald)

    PR #8590: Py3.11 support continues (sklam)

    PR #8592: fix failure of test_cache_invalidate due to read-only install (tpwrules)

    PR #8593: Adjusted ULP precesion for noncentral distribution test (kc611)

    PR #8594: Fix various CUDA lineinfo issues (gmarkall)

    PR #8597: Prevent use of NumPy’s MaskedArray. (stuartarchibald)

    PR #8598: Setup Azure CI to test py3.11 (sklam)

    PR #8600: Chrome trace timestamp should be in microseconds not seconds. (sklam)

    PR #8602: Throw error for unsupported dunder methods (apmasell)

    PR #8605: Support for CUDA fp16 math functions (part 1) (testhound)

    PR #8606: [Doc] Make the RewriteArrayExprs doc more precise ()

    PR #8619: Added flat iteration logic for random distributions (kc611)

    PR #8623: Adds support for np.nan_to_num (thomasjpfan)

    PR #8624: DOC: Add guvectorize scalar return example (Matt711)

    PR #8625: Refactor test_ufuncs (gmarkall)

    PR #8626: [unicode-PERF]: use optmized BM algorithm to replace the brute-force finder (dlee992)

    PR #8630: Fix #8628: Don’t test math.trunc with non-float64 NumPy scalars (gmarkall)

    PR #8634: Add new method is_fp16_supported (testhound)

    PR #8636: CUDA: Skip test_ptds on Windows (gmarkall)

    PR #8639: Python 3.11 - fix majority of remaining test failures. (stuartarchibald)

    PR #8644: Fix bare reraise support (sklam)

    PR #8649: Remove numba.core.overload_glue module. (apmasell)

    PR #8659: Preserve module name of jitted class (neilflood)

    PR #8661: Make external compiler discovery lazy in the test suite. (stuartarchibald)

    PR #8662: Add support for .nbytes accessor for numpy arrays (alanhdu)

    PR #8666: Updates for Python 3.8 baseline/Python 3.11 migration (stuartarchibald)

    PR #8673: Enable the CUDA simulator tests on Windows builds in Azure CI. (stuartarchibald)

    PR #8675: Make always_run test decorator a tag and improve shard tests. (stuartarchibald)

    PR #8677: Add support for min and max on boolean types. (DrTodd13)

    PR #8680: Adjust flake8 config to be compatible with flake8=6.0.0 (thomasjpfan)

    PR #8685: Implement __repr__ for numba types (luk-f-a)

    PR #8691: NumPy 1.24 (gmarkall)

    PR #8697: Close stale issues after 7 days (gmarkall)

    PR #8701: Relaxed ULP testing precision for NumPy Generator tests across all systems (kc611)

    PR #8702: Supply concrete timeline for objmode fallback deprecation/removal. (stuartarchibald)

    PR #8706: Fix doctest for @vectorize (sklam)

    PR #8711: Python 3.11 tracing support (continuation of #8670). (AndrewVallette sklam)

    PR #8716: CI: Use set -e in “Before Install” step and fix install (gmarkall)

    PR #8720: Enable coverage for subprocess testing (sklam)

    PR #8723: Check for void return type in cuda.compile_ptx (brandonwillard)

    PR #8726: Make Numba dependency check run ahead of Numba internal imports. (stuartarchibald)

    PR #8728: Fix flake8 checks since upgrade to flake8=6.x (stuartarchibald)

    PR #8729: Run flake8 CI step in multiple processes. (stuartarchibald)

    PR #8732: Add numpy argpartition function support ()

    PR #8735: Update bot to close PRs waiting on authors for more than 3 months (guilhermeleobas)

    PR #8736: Implement np.lib.stride_tricks.sliding_window_view ()

    PR #8744: Update CtypesLinker::add_cu error message to include fp16 usage (testhound gmarkall)

    PR #8746: Fix failing test_dispatcher test case (testhound)

    PR #8748: Suppress known test failures for py3.11 (sklam)

    PR #8751: Recycle test runners more aggressively (apmasell)

    PR #8752: Flake8 fixes for py311 branch (esc sklam)

    PR #8760: Bump llvmlite PR in py3.11 branch testing (sklam)

    PR #8764: CUDA tidy-up: remove some unneeded methods (gmarkall)

    PR #8765: BLD: remove distutils (fangchenli)

    PR #8766: Stale bot: Use abandoned - stale label for closed PRs (gmarkall)

    PR #8771: Update vendored Versioneer from 0.14 to 0.28 (oscargus gmarkall)

    PR #8775: Revert PR#8751 for buildfarm stability (sklam)

    PR #8780: Improved documentation for Atomic CAS (MiloniAtal)

    PR #8781: Ensure gc.collect() is called before checking refcount in tests. (sklam)

    PR #8782: Changed wording of the escape error (MiloniAtal)

    PR #8786: Upgrade stale GitHub action (apmasell)

    PR #8788: CUDA: Fix returned dtype of vectorized functions (Issue #8400) (gmarkall)

    PR #8790: CUDA compare and swap with index (ianthomas23)

    PR #8795: Add pending-deprecation warnings for numba.pycc (stuartarchibald)

    PR #8802: Move the minimum supported NumPy version to 1.21 (stuartarchibald)

    PR #8803: Attempted fix to #8789 by changing compile_ptx to accept a signature instead of argument tuple (KyanCheung)

    PR #8804: Split parfor pass into 3 parts (DrTodd13)

    PR #8809: Update LLVM versions for 0.57 release (apmasell)

    PR #8810: Fix llvmlite dependency in meta.yaml (sklam)

    PR #8816: Fix some buildfarm test failures (sklam)

    PR #8819: Support “static” __getitem__ on Numba types in @njit code. (stuartarchibald)

    PR #8822: Merge py3.11 branch to main (esc AndrewVallette stuartarchibald sklam)

    PR #8826: CUDA CFFI test: conditionally require cffi module (gmarkall)

    PR #8831: Redo py3.11 sync branch with main (sklam)

    PR #8833: Fix typeguard import hook location. (stuartarchibald)

    PR #8836: Fix failing typeguard test. (stuartarchibald)

    PR #8837: Update AzureCI matrix for Python 3.11/NumPy 1.21..1.24 (stuartarchibald)

    PR #8839: Add Dynamic Shared Memory example. (k1m190r)

    PR #8842: Fix buildscripts, setup.py, docs for setuptools becoming optional. (stuartarchibald)

    PR #8843: Pin typeguard to 3.0.1 in AzureCI. (stuartarchibald)

    PR #8848: added lifted loops to glossary term (cherieliu)

    PR #8852: Disable SLP vectorisation due to miscompilations. (stuartarchibald)

    PR #8855: DOC: pip into double backticks in installing.rst (F3eQnxN3RriK)

    PR #8856: Update TBB to use >= 2021.6 by default. (kozlov-alexey stuartarchibald)

    PR #8858: Update deprecation notice for objmode fallback RE @jit use. (stuartarchibald)

    PR #8864: Remove obsolete deprecation notices (gmarkall)

    PR #8866: Revise CUDA deprecation notices (gmarkall)

    PR #8869: Update CHANGE_LOG for 0.57.0rc1 (stuartarchibald esc gmarkall)

    PR #8870: Fix opcode “spelling” change since Python 3.11 in CUDA debug test. (stuartarchibald)

    PR #8879: Remove use of compile_isolated from generator tests. (stuartarchibald)

    PR #8880: Fix missing dependency guard on pyyaml in test_azure_config. (stuartarchibald)

    PR #8881: Replace use of compile_isolated in test_obj_lifetime (sklam)

    PR #8884: Pin llvmlite and NumPy on release branch (sklam)

    PR #8887: Update PyPI supported version tags (bryant1410)

    PR #8896: Remove codecov install (now deleted from PyPI) (gmarkall)

    PR #8902: Enable CALL_FUNCTION_EX fix for py3.11 (sklam)

    PR #8907: Work around issue #8898. Defer exp2 (and log2) calls to Numba internal symbols. (stuartarchibald)

    PR #8909: Fix #8903. NumbaDeprecationWarning``s raised from ``@{gu,}vectorize. (stuartarchibald)

    PR #8929: Update CHANGE_LOG for 0.57.0 final. (stuartarchibald)

    PR #8930: Fix year in change log (jtilly)

    PR #8932: Fix 0.57 release changelog (sklam)

Authors:

    alanhdu

    AndrewVallette

    apmasell

    armgabrielyan

    aseyboldt

    Biswa96

    brandonwillard

    bryant1410

    bszollosinagy

    cako

    cherieliu

    ChiCheng45

    DannyWeitekamp

    dlee992

    dmbelov

    DrTodd13

    eric-wieser

    esc

    F3eQnxN3RriK

    fangchenli

    gmarkall

    guilhermeleobas

    ianthomas23

    jamesobutler

    jorgepiloto

    jtilly

    k1m190r

    kc611

    kozlov-alexey

    KyanCheung

    luk-f-a

    Matt711

    MiloniAtal

    naveensrinivasan

    neilflood

    Nimrod0901

    njriasan

    oleksandr-pavlyk

    oscargus

    raybellwaves

    shangbol

    sklam

    stefanfed

    stuartarchibald

    testhound

    thomasjpfan

    tpwrules

Version 0.56.4 (3 November, 2022)

This is a bugfix release to fix a regression in the CUDA target in relation to the .view() method on CUDA device arrays that is present when using NumPy version 1.23.0 or later.

Pull-Requests:

    PR #8537: Make ol_compatible_view accessible on all targets (gmarkall)

    PR #8552: Update version support table for 0.56.4. (stuartarchibald)

    PR #8553: Update CHANGE_LOG for 0.56.4 (stuartarchibald)

    PR #8570: Release 0.56 branch: Fix overloads with target="generic" for CUDA (gmarkall)

    PR #8571: Additional update to CHANGE_LOG for 0.56.4 (stuartarchibald)

Authors:

    gmarkall

    stuartarchibald

Version 0.56.3 (13 October, 2022)

This is a bugfix release to remove the version restriction applied to the setuptools package and to fix a bug in the CUDA target in relation to copying zero length device arrays to zero length host arrays.

Pull-Requests:

    PR #8475: Remove setuptools version pin (gmarkall)

    PR #8482: Fix #8477: Allow copies with different strides for 0-length data (gmarkall)

    PR #8486: Restrict the TBB development package to supported version in Azure. (stuartarchibald)

    PR #8503: Update version support table for 0.56.3 (stuartarchibald)

    PR #8504: Update CHANGE_LOG for 0.56.3 (stuartarchibald)

Authors:

    gmarkall

    stuartarchibald

Version 0.56.2 (1 September, 2022)

This is a bugfix release that supports NumPy 1.23 and fixes CUDA function caching.

Pull-Requests:

    PR #8239: Add decorator to run a test in a subprocess (stuartarchibald)

    PR #8276: Move Azure to use macos-11 (stuartarchibald)

    PR #8310: CUDA: Fix Issue #8309 - atomics don’t work on complex components (Graham Markall)

    PR #8342: Upgrade to ubuntu-20.04 for azure pipeline CI (jamesobutler)

    PR #8356: Update setup.py, buildscripts, CI and docs to require setuptools<60 (stuartarchibald)

    PR #8374: Don’t pickle LLVM IR for CUDA code libraries (Graham Markall)

    PR #8377: Add support for NumPy 1.23 (stuartarchibald)

    PR #8384: Move strace() check into tests that actually need it (stuartarchibald)

    PR #8386: Fix the docs for numba.get_thread_id (stuartarchibald)

    PR #8407: Pin NumPy version to 1.18-1.24 (Andre Masella)

    PR #8411: update version support table for 0.56.1 (esc)

    PR #8412: Create changelog for 0.56.1 (Andre Masella)

    PR #8413: Fix Azure CI for NumPy 1.23 and use conda-forge scipy (Siu Kwan Lam)

    PR #8414: Hotfix for 0.56.2 (Siu Kwan Lam)

Authors:

    Andre Masella

    esc

    Graham Markall

    jamesobutler

    Siu Kwan Lam

    stuartarchibald

Version 0.56.1 (NO RELEASE)

The release was skipped due to issues during the release process.
Version 0.56.0 (25 July, 2022)

This release continues to add new features, bug fixes and stability improvements to Numba. Please note that this will be the last release that has support for Python 3.7 as the next release series (Numba 0.57) will support Python 3.11! Also note that, this will be the last release to support linux-32 packages produced by the Numba team.

Python language support enhancements:

    Previously missing support for large, in-line dictionaries and internal calls to functions with large numbers of keyword arguments in Python 3.10 has been added.

    operator.mul now works for list s.

    Literal slices, e.g. slice(1, 10, 2) can be returned from nopython mode functions.

    The len function now works on dict_keys, dict_values and dict_items .

    Numba’s set implementation now supports reference counted items e.g. strings.

Numba specific feature enhancements:

    The experimental jitclass feature gains support for a large number of builtin methods e.g. declaring __hash__ or __getitem__ for a jitclass type.

    It’s now possible to use @vectorize on an already @jit family decorated function.

    Name mangling has been updated to emit compiled function names that exactly match the function name in Python. This means debuggers, like GDB, can be set to break directly on Python function names.

    A GDB “pretty printing” support module has been added, when loaded into GDB Numba’s internal representations of Python/NumPy types are rendered inside GDB as they would be in Python.

    An experimental option is added to the @jit family decorators to entirely turn off LLVM’s optimisation passes for a given function (see _dbg_optnone kwarg in the @jit decorator family).

    A new environment variable is added NUMBA_EXTEND_VARIABLE_LIFETIMES, which if set will extend the lifetime of variables to the end of their basic block, this to permit a debugging experience in GDB similar to that found in compiled C/C++/Fortran code.

NumPy features/enhancements:

    Initial support for passing, using and returning numpy.random.Generator instances has been added, this currently includes support for the random distribution.

    The broadcasting functions np.broadcast_shapes and np.broadcast_arrays are now supported.

    The min and max functions now work with np.timedelta64 and np.datetime64 types.

    Sorting multi-dimensional arrays along the last axis is now supported in np.sort().

    The np.clip function is updated to accept NumPy arrays for the a_min and a_max arguments.

    The NumPy allocation routines (np.empty , np.ones etc.) support shape arguments specified using members of enum.IntEnum s.

    The function np.random.noncentral_chisquare is now supported.

    The performance of functions np.full and np.ones has been improved.

Parallel Accelerator enhancements:

    The parallel=True functionality is enhanced through the addition of the functions numba.set_parallel_chunksize and numba.get_parallel_chunksize to permit a more fine grained scheduling of work defined in a parallel region. There is also support for adjusting the chunksize via a context manager.

    The ID of a thread is now defined to be predictable and within a known range, it is available through calling the function numba.get_thread_id.

    The performance of @stencil s has been improved in both serial and parallel execution.

CUDA enhancements:

    New functionality:

        Self-recursive device functions.

        Vector type support (float4, int2, etc.).

        Shared / local arrays of extension types can now be created.

        Support for linking CUDA C / C++ device functions into Python kernels.

        PTX generation for Compute Capabilities 8.6 and 8.7 - e.g. RTX A series, GTX 3000 series.

        Comparison operations for float16 types.

    Performance improvements:

        Context queries are no longer made during launch configuration.

        Launch configurations are now LRU cached.

        On-disk caching of CUDA kernels is now supported.

    Documentation: many new examples added.

Docs:

    Numba now has an official “mission statement”.

    There’s now a “version support table” in the documentation to act as an easy to use, single reference point, for looking up information about Numba releases and their required/supported dependencies.

General Enhancements:

    Numba imports more quickly in environments with large numbers of packages as it now uses importlib-metadata for querying other packages.

    Emission of chrome tracing output is now supported for the internal compilation event handling system.

    This release is tested and known to work when using the Pyston Python interpreter.

Pull-Requests:

    PR #5209: Use importlib to load numba extensions (Stepan Rakitin Graham Markall stuartarchibald)

    PR #5877: Jitclass builtin methods (Ethan Pronovost Graham Markall)

    PR #6490: Stencil output allocated with np.empty now and new code to initialize the borders. (Todd A. Anderson)

    PR #7005: Make numpy.searchsorted match NumPy when first argument is unsorted (Brandon T. Willard)

    PR #7363: Update cuda.local.array to clarify “simple constant expression” (e.g. no NumPy ints) (Sterling Baird)

    PR #7364: Removes an instance of signed integer overflow undefined behaviour. (Tobias Sargeant)

    PR #7537: Add chrome tracing (Hadia Ahmed Siu Kwan Lam)

    PR #7556: Testhound/fp16 comparison (Michael Collison Graham Markall)

    PR #7586: Support for len on dict.keys, dict.values, and dict.items (Nick Riasanovsky)

    PR #7617: Numba gdb-python extension for printing (stuartarchibald)

    PR #7619: CUDA: Fix linking with PTX when compiling lazily (Graham Markall)

    PR #7621: Add support for linking CUDA C / C++ with @cuda.jit kernels (Graham Markall)

    PR #7625: Combined parfor chunking and caching PRs. (stuartarchibald Todd A. Anderson Siu Kwan Lam)

    PR #7651: DOC: pypi and conda-forge badges (Ray Bell)

    PR #7660: Add support for np.broadcast_arrays (Guilherme Leobas)

    PR #7664: Flatten mangling dicts into a single dict (Graham Markall)

    PR #7680: CUDA Docs: include example calling slow matmul (Graham Markall)

    PR #7682: performance improvements to np.full and np.ones (Rishi Kulkarni)

    PR #7684: DOC: remove incorrect warning in np.random reference (Rishi Kulkarni)

    PR #7685: Don’t convert setitems that have dimension mismatches to parfors. (Todd A. Anderson)

    PR #7690: Implemented np.random.noncentral_chisquare for all size arguments (Rishi Kulkarni)

    PR #7695: IntEnumMember support for np.empty, np.zeros, and np.ones (Benjamin Graham)

    PR #7699: CUDA: Provide helpful error if the return type is missing for declare_device (Graham Markall)

    PR #7700: Support for scalar arguments in Np.ascontiguousarray (Dhruv Patel)

    PR #7703: Ignore unsupported types in ShapeEquivSet._getnames() (Benjamin Graham)

    PR #7704: Move the type annotation pass to post legalization. (stuartarchibald)

    PR #7709: CUDA: Fixes missing type annotation pass following #7704 (stuartarchibald)

    PR #7712: Fixing issue 7693 (stuartarchibald Graham Markall luk-f-a)

    PR #7714: Support for boxing SliceLiteral type (Nick Riasanovsky)

    PR #7718: Bump llvmlite dependency to 0.39.0dev0 for Numba 0.56.0dev0 (stuartarchibald)

    PR #7724: Update URLs in error messages to refer to RTD docs. (stuartarchibald)

    PR #7728: Document that AOT-compiled functions do not check arg types (Graham Markall)

    PR #7729: Handle Omitted/OmittedArgDataModel in DI generation. (stuartarchibald)

    PR #7732: update release checklist following 0.55.0 RC1 (esc)

    PR #7736: Update CHANGE_LOG for 0.55.0 final. (stuartarchibald)

    PR #7740: CUDA Python 11.6 support (Graham Markall)

    PR #7744: Fix issues with locating/parsing source during DebugInfo emission. (stuartarchibald)

    PR #7745: Fix the release year for Numba 0.55 change log entry. (stuartarchibald)

    PR #7748: Fix #7713: Ensure _prng_random_hash return has correct bitwidth (Graham Markall)

    PR #7749: Refactor threading layer priority tests to not use stdout/stderr (stuartarchibald)

    PR #7752: Fix #7751: Use original filename for array exprs (Graham Markall)

    PR #7755: CUDA: Deprecate support for CC < 5.3 and CTK < 10.2 (Graham Markall)

    PR #7763: Update Read the Docs configuration (automatic) (readthedocs-assistant)

    PR #7764: Add dbg_optnone and dbg_extend_lifetimes flags (Siu Kwan Lam)

    PR #7771: Move function unique ID to abi-tags (stuartarchibald Siu Kwan Lam)

    PR #7772: CUDA: Add Support to Creating StructModel Array (Michael Wang)

    PR #7776: Updates coverage.py config (stuartarchibald)

    PR #7777: Remove reference existing issue from GH template. (stuartarchibald)

    PR #7778: Remove long deprecated flags from the CLI. (stuartarchibald)

    PR #7780: Fix sets with reference counted items (Benjamin Graham)

    PR #7782: adding reminder to check on deprecations (esc)

    PR #7783: remove upper limit on Python version (esc)

    PR #7786: Remove dependency on intel-openmp for OSX (stuartarchibald)

    PR #7788: Avoid issue with DI gen for arrayexprs. (stuartarchibald)

    PR #7796: update change-log for 0.55.1 (esc)

    PR #7797: prune README (esc)

    PR #7799: update the release checklist post 0.55.1 (esc)

    PR #7801: add sdist command and umask reminder (esc)

    PR #7804: update local references from master -> main (esc)

    PR #7805: Enhance source line finding logic for debuginfo (Siu Kwan Lam)

    PR #7809: Updates the gdb configuration to accept a binary name or a path. (stuartarchibald)

    PR #7813: Extend parfors test timeout for aarch64. (stuartarchibald)

    PR #7814: CUDA Dispatcher refactor (Graham Markall)

    PR #7815: CUDA Dispatcher refactor 2: inherit from dispatcher.Dispatcher (Graham Markall)

    PR #7817: Update intersphinx URLs for NumPy and llvmlite. (stuartarchibald)

    PR #7823: Add renamed vars to callee scope such that it is self consistent. (stuartarchibald)

    PR #7829: CUDA: Support Enum/IntEnum in Kernel (Michael Wang)

    PR #7833: Add version support information table to docs. (stuartarchibald)

    PR #7835: Fix pickling error when module cannot be imported (idorrington)

    PR #7836: min() and max() support for np.datetime and np.timedelta (Benjamin Graham)

    PR #7837: Initial refactoring of parfor reduction lowering (Siu Kwan Lam)

    PR #7845: change time.time() to time.perf_counter() in docs (Nopileos2)

    PR #7846: Fix CUDA enum vectorize test on Windows (Graham Markall)

    PR #7848: Support for int * list (Nick Riasanovsky)

    PR #7850: CUDA: Pass fastmath compiler flag down to compile_ptx and compile_device; Improve fastmath tests (Michael Wang)

    PR #7855: Ensure np.argmin/no.argmax return type is intp (stuartarchibald)

    PR #7858: CUDA: Deprecate ptx Attribute and Update Tests (Graham Markall Michael Wang)

    PR #7861: Fix a spelling mistake in README (Zizheng Guo)

    PR #7864: Fix cross_iter_dep check. (Todd A. Anderson)

    PR #7865: Remove add_user_function (Graham Markall)

    PR #7866: Support for large numbers of args/kws with Python 3.10 (Nick Riasanovsky)

    PR #7878: CUDA: Remove some deprecated support, add CC 8.6 and 8.7 (Graham Markall)

    PR #7893: Use uuid.uuid4() as the key in serialization. (stuartarchibald)

    PR #7895: Remove use of llvmlite.llvmpy (Andre Masella)

    PR #7898: Skip test_ptds under cuda-memcheck (Graham Markall)

    PR #7901: Pyston compatibility for the test suite (Kevin Modzelewski)

    PR #7904: Support m1 (esc)

    PR #7911: added sys import (Nightfurex)

    PR #7915: CUDA: Fix test checking debug info rendering. (stuartarchibald)

    PR #7918: Add JIT examples to CUDA docs (brandon-b-miller Graham Markall)

    PR #7919: Disallow //= reductions in pranges. (Todd A. Anderson)

    PR #7924: Retain non-modified index tuple components. (Todd A. Anderson)

    PR #7939: Fix rendering in feature request template. (stuartarchibald)

    PR #7940: Implemented np.allclose in numba/np/arraymath.py (Gagandeep Singh)

    PR #7941: Remove debug dump output from closure inlining pass. (stuartarchibald)

    PR #7946: instructions for creating a build environment were outdated (esc)

    PR #7949: Add Cuda Vector Types (Michael Wang)

    PR #7950: mission statement (esc)

    PR #7956: Stop using pip for 3.10 on public ci (Revert “start testing Python 3.10 on public CI”) (esc)

    PR #7957: Use cloudpickle for disk caches (Siu Kwan Lam)

    PR #7958: numpy.clip accept numpy.array for a_min, a_max (Gagandeep Singh)

    PR #7959: Permit a new array model to have a super set of array model fields. (stuartarchibald)

    PR #7961: numba.typed.typeddict.Dict.get uses castedkey to avoid returning default value even if the key is present (Gagandeep Singh)

    PR #7963: remove the roadmap from the sphinx based docs (esc)

    PR #7964: Support for large constant dictionaries in Python 3.10 (Nick Riasanovsky)

    PR #7965: Use uuid4 instead of PID in cache temp name to prevent collisions. (stuartarchibald)

    PR #7971: lru cache for configure call (Tingkai Liu)

    PR #7972: Fix fp16 support for cuda shared array (Michael Collison Graham Markall)

    PR #7986: Small caching refactor to support target cache implementations (Graham Markall)

    PR #7994: Supporting multidimensional arrays in quick sort (Gagandeep Singh Siu Kwan Lam)

    PR #7996: Fix binding logic in @overload_glue. (stuartarchibald)

    PR #7999: Remove @overload_glue for NumPy allocators. (stuartarchibald)

    PR #8003: Add np.broadcast_shapes (Guilherme Leobas)

    PR #8004: CUDA fixes for Windows (Graham Markall)

    PR #8014: Fix support for {real,imag} array attrs in Parfors. (stuartarchibald)

    PR #8016: [Docs] [Very Minor] Make numba.jit boundscheck doc line consistent (Kyle Martin)

    PR #8017: Update FAQ to include details about using debug-only option (Guilherme Leobas)

    PR #8027: Support for NumPy 1.22 (stuartarchibald)

    PR #8031: Support for Numpy BitGenerators PR#1 - Core Generator Support (Kaustubh)

    PR #8035: Fix a couple of typos RE implementation (stuartarchibald)

    PR #8037: CUDA self-recursion tests (Graham Markall)

    PR #8044: Make Python 3.10 kwarg peephole less restrictive (Nick Riasanovsky)

    PR #8046: Fix caching test failures (Siu Kwan Lam)

    PR #8049: support str(bool) syntax (LI Da)

    PR #8052: Ensure pthread is linked in when building for ppc64le. (Siu Kwan Lam)

    PR #8056: Move caching tests from test_dispatcher to test_caching (Graham Markall)

    PR #8057: Fix coverage checking (Graham Markall)

    PR #8064: Rename “nb:run_pass” to “numba:run_pass” and document it. (Siu Kwan Lam)

    PR #8065: Fix PyLowering mishandling starargs (Siu Kwan Lam)

    PR #8068: update changelog for 0.55.2 (esc)

    PR #8077: change return type of np.broadcast_shapes to a tuple (Guilherme Leobas)

    PR #8080: Fix windows test failure due to timeout when the machine is slow poss… (Siu Kwan Lam)

    PR #8081: Fix erroneous array count in parallel gufunc kernel generation. (stuartarchibald)

    PR #8089: Support on-disk caching in the CUDA target (Graham Markall)

    PR #8097: Exclude libopenblas 0.3.20 on osx-arm64 (esc)

    PR #8099: Fix Py_DECREF use in case of error state (for devicearray). (stuartarchibald)

    PR #8102: Combine numpy run_constrained in meta.yaml to the run requirements (Siu Kwan Lam)

    PR #8109: Pin TBB support with respect to incompatible 2021.6 API. (stuartarchibald)

    PR #8118: Update release checklists post 0.55.2 (esc)

    PR #8123: Fix CUDA print tests on Windows (Graham Markall)

    PR #8124: Add explicit checks to all allocators in the NRT. (stuartarchibald)

    PR #8126: Mark gufuncs as having mutable outputs (Andre Masella)

    PR #8133: Fix #8132. Regression in Record.make_c_struct for handling nestedarray (Siu Kwan Lam)

    PR #8137: CUDA: Fix #7806, Division by zero stops the kernel (Graham Markall)

    PR #8142: CUDA: Fix some missed changes from dropping 9.2 (Graham Markall)

    PR #8144: Fix NumPy capitalisation in docs. (stuartarchibald)

    PR #8145: Allow ufunc builder to use previously JITed function (Andre Masella)

    PR #8151: pin NumPy to build 0 of 1.19.2 on public CI (esc)

    PR #8163: CUDA: Remove context query in launch config (Graham Markall)

    PR #8165: Restrict strace based tests to be linux only via support feature. (stuartarchibald)

    PR #8170: CUDA: Fix missing space in low occupancy warning (Graham Markall)

    PR #8175: make build and upload order consistent (esc)

    PR #8181: Fix various typos (luzpaz)

    PR #8187: Update CHANGE_LOG for 0.55.2 (stuartarchibald esc)

    PR #8189: updated version support information for 0.55.2/0.57 (esc)

    PR #8191: CUDA: Update deprecation notes for 0.56. (Graham Markall)

    PR #8192: Update CHANGE_LOG for 0.56.0 (stuartarchibald esc Siu Kwan Lam)

    PR #8195: Make the workqueue threading backend once again fork safe. (stuartarchibald)

    PR #8196: Fix numerical tolerance in parfors caching test. (stuartarchibald)

    PR #8197: Fix isinstance warning check test. (stuartarchibald)

    PR #8203: pin llvmlite 0.39 for public CI builds (esc)

    PR #8255: CUDA: Make numba.cuda.tests.doc_examples.ffi a module to fix #8252 (Graham Markall)

    PR #8274: Update version support table doc for 0.56. (stuartarchibald)

    PR #8275: Update CHANGE_LOG for 0.56.0 final (stuartarchibald)

Authors:

    Andre Masella

    Benjamin Graham

    brandon-b-miller

    Brandon T. Willard

    Gagandeep Singh

    Dhruv Patel

    LI Da

    Todd A. Anderson

    Ethan Pronovost

    esc

    Tobias Sargeant

    Graham Markall

    Guilherme Leobas

    Zizheng Guo

    Hadia Ahmed

    idorrington

    Michael Wang

    Kaustubh

    Kevin Modzelewski

    luk-f-a

    luzpaz

    Kyle Martin

    Nightfurex

    Nick Riasanovsky

    Nopileos2

    Ray Bell

    readthedocs-assistant

    Rishi Kulkarni

    Sterling Baird

    Siu Kwan Lam

    stuartarchibald

    Stepan Rakitin

    Michael Collison

    Tingkai Liu
  • Loading branch information
thor committed Jan 24, 2024
1 parent a603a74 commit df45422
Show file tree
Hide file tree
Showing 3 changed files with 211 additions and 33 deletions.
10 changes: 5 additions & 5 deletions math/py-numba/Makefile
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
# $NetBSD: Makefile,v 1.33 2023/08/01 23:20:47 wiz Exp $
# $NetBSD: Makefile,v 1.34 2024/01/24 16:22:36 thor Exp $

DISTNAME= numba-0.55.2
DISTNAME= numba-0.58.1
PKGNAME= ${PYPKGPREFIX}-${DISTNAME}
CATEGORIES= math python
MASTER_SITES= ${MASTER_SITE_PYPI:=n/numba/}
Expand All @@ -10,15 +10,15 @@ HOMEPAGE= https://numba.pydata.org/
COMMENT= NumPy aware dynamic Python compiler using LLVM
LICENSE= 2-clause-bsd

DEPENDS+= ${PYPKGPREFIX}-llvmlite>=0.38.0:../../devel/py-llvmlite
DEPENDS+= ${PYPKGPREFIX}-llvmlite>=0.41.0:../../devel/py-llvmlite
DEPENDS+= ${PYPKGPREFIX}-setuptools-[0-9]*:../../devel/py-setuptools

# OpenMP is not portable
# TODO: enable OpenMP properly, like other packages.
MAKE_ENV+= NUMBA_DISABLE_OPENMP=1

USE_LANGUAGES= c c++

PYTHON_VERSIONS_INCOMPATIBLE= 27 38
PYTHON_VERSIONS_INCOMPATIBLE= 27 312

USE_PKG_RESOURCES= yes

Expand Down
Loading

0 comments on commit df45422

Please sign in to comment.