Glyph rendering optimization using variable length argument expansion #780

jonmmease · 2019-08-15T13:24:11Z

Background

This PR is motivated by the performance characteristics discovered while developing the quadmesh glyph in #779. In particular, see the discussion of fixed vs variable length arguments in #779 (comment).

The key insight here is that during glyph rendering, the numba optimization of the rendering functions can result in substantially faster code if the input functions contain a fixed number of arguments rather than a variable length argument (e.g. *args).

In the glyph rendering code, this variable length argument is usually called *aggs_and_cols and it contains a list of the aggregate arrays that are being populated and the columns that are used as input to the reduction calculations. The length of this argument varies depending on the chosen reduction operation, but the length will remain the same for every render call for a given operation.

Implementation

expand_varargs

This PR adds a datashader.macros module that provides the expand_varargs decorator builder. This decorator builder inputs the desired number of arguments to expand to, and returns a function decorator. This function decorator will transform the AST of the wrapped function to replace variable length arguments with a fixed number of arguments/variables.

This only makes sense for cases where the only thing the function does with the variable length argument is to pass it along to other functions in star-form. For example, calling...

@expand_varargs(2)
def example_fn(a, b, *args):
    print(a, b)
    other_fn(a, b, *args)

would transform the function AST into a function equivalent to

def example_fn(a, b, _0, _1):
    print(a, b)
    other_fn(a, b, _0, _1)

If the variable length argument is used in any other context then an error is raised. For example, an error will be raised if the example_fn looks inside args.

@expand_varargs(2)
def example_fn(a, b, *args):
    print(a, b, args[0])
    other_fn(a, b, *args)

⬆️ would raise a ValueError.

The whole point of doing this is to transform the input function before it is passed to numba's jit compilation decorator.

Glyph.expand_aggs_and_cols

A new expand_aggs_and_cols method has been added to the Glyph baseclass. This method inputs the append function that will perform the reduction operation, and returns an expand_varargs decorator configured to expand the *aggs_and_cols argument to the correct number of fixed arguments.

Glyph updates

The new decorator has been added to all applicable rendering functions in the points, line, and area glyphs. I think Trimesh could also benefit from this technique, but it doesn't quite follow the same pattern of passing around the *aggs_and_cols argument, so that will take some refactoring. raster doesn't currently use the same glyph/aggregation framework.

Performance results

Here are some benchmark comparisons of this branch with master.

Notebooks:

Before: https://anaconda.org/jonmmease/expand_varargs_before/notebook
After: https://anaconda.org/jonmmease/expand_varargs_after/notebook

I tested points, line, and area glyphs using the count, sum, mean, and std reduction operations. Here is a plot of the results:

Note that the y-axis is time in seconds.

Notice how the improvements grow more significant as the reduction operation get more complex. This corresponds to the *aggs_and_cols argument getting longer.

The speedups for the coming quadmesh glyph should be even more significant than the gains for the area glyph above.

@jbednar @philippjfr

This is a decorator that operates on functions that input a variable length argument (e.g. *args), and transforms the function's AST into a function that inputs a fixed number of arguments instead.

Given an append function, this will builds an expand_varargs decorator appropriate for expanding the *args_and_cols argument that gets passed through various glyph functions.

This gives numba more information to optimize on, which can significantly improve performance when may aggregates are present.

@jit

… is set The @jit decorator already followings this environment variable. Now, setting NUMBA_DISABLE_JIT=1 makes it possible to use a debugger inside functions that are decorated with @jit and @expand_varargs

jonmmease · 2019-08-15T16:50:46Z

Update: now the Glyph.expand_aggs_and_cols method will return an identity decorator if the NUMBA_DISABLE_JIT environnmet variable is set.

This environment variable is used by numba to disable JIT compilation. With this change, functions that are wrapped with both @jit and @expand_varargs will remain unchanged when NUMBA_DISABLE_JIT is set. This makes it possible to debug these functions with a graphical debugger.

jonmmease · 2019-08-15T17:19:28Z

@philippjfr this is ready to go apart from that one Travis test flaking out again. Could you restart it?

Update: Never mind, looks like I can restart myself.

jonmmease · 2019-08-15T19:40:32Z

all green. merging

jonmmease added 6 commits August 15, 2019 05:30

add expand_varargs macro and tests

4285e46

This is a decorator that operates on functions that input a variable length argument (e.g. *args), and transforms the function's AST into a function that inputs a fixed number of arguments instead.

Add expand_aggs_and_cols glyph method

49726b2

Given an append function, this will builds an expand_varargs decorator appropriate for expanding the *args_and_cols argument that gets passed through various glyph functions.

Use expand_varargs in points, area, and line glyphs

253b3e4

This gives numba more information to optimize on, which can significantly improve performance when may aggregates are present.

Python 2 compatibility

0ac3326

Add note that this new decorator can be removed in the future if needed

70a6c56

Don't perform expansion if the NUMBA_DISABLE_JIT environment variable…

a2fb052

… is set The @jit decorator already followings this environment variable. Now, setting NUMBA_DISABLE_JIT=1 makes it possible to use a debugger inside functions that are decorated with @jit and @expand_varargs

jonmmease merged commit ce9ca6e into master Aug 15, 2019

jonmmease mentioned this pull request Aug 16, 2019

Add quadmesh glyph with rectilinear and curvilinear support #779

Merged

maximlt deleted the argexpansion branch December 25, 2021 17:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Glyph rendering optimization using variable length argument expansion #780

Glyph rendering optimization using variable length argument expansion #780

jonmmease commented Aug 15, 2019

jonmmease commented Aug 15, 2019

jonmmease commented Aug 15, 2019 •

edited

Loading

jonmmease commented Aug 15, 2019

Glyph rendering optimization using variable length argument expansion #780

Glyph rendering optimization using variable length argument expansion #780

Conversation

jonmmease commented Aug 15, 2019

Background

Implementation

expand_varargs

Glyph.expand_aggs_and_cols

Glyph updates

Performance results

jonmmease commented Aug 15, 2019

jonmmease commented Aug 15, 2019 • edited Loading

jonmmease commented Aug 15, 2019

jonmmease commented Aug 15, 2019 •

edited

Loading