more inlining #3796

vtjnash · 2013-07-23T06:48:41Z

This pull requests adds inlining and improved call site type specialization. It separates function calls of type unions and provides support for multi-line inlining. (It also attempts to inline Union, but that part is causing the tests to fail)

While the standard inlining heuristic has not been updated, you can test the multiline inlining by forcing a function to be inlined by adding an :inline keyword:

function abs(x) :inline
    if x > 0
        x
    else
        -x
    end
end

timholy · 2013-07-23T09:26:11Z

Fantastic! This is something I've been wanting for a long time. Amazing work.

This will fix both #3030 and #1106.

lindahua · 2013-07-23T12:06:17Z

This is awesome! It may hopefully address many of the challenges we have been facing (e.g. make map, broadcast and reduce much more performant). I am looking forward to this.

quinnj · 2013-07-23T12:12:52Z

How about going C99 style and making inline a full keyword to be used in place of function?

This is exciting stuff!

StefanKarpinski · 2013-07-23T16:45:57Z

I think the :inline thing is just a temporary hack to avoid messing with the parser for now. I've also brought up the possibility of call-site inline annotation in addition to the traditional definition-site inline annotation.

staticfloat · 2013-07-23T17:03:37Z

Is there any documentation on the :inline thing? I'd like to play around with attaching data to functions to see if it would be as natural as I'd want it to be. (Useful for documentation, descriptions of tests for codespeed, etc....)

JeffBezanson · 2013-07-23T17:56:46Z

I suggest using macro syntax instead of adding new keywords. This allows @inline function f() to declare a function as inline, and @inline f(x) to do call site inlining. The call site version will require more work, since we'd need something like an inline expression wrapping the call. For the function def version, I'd like to add a meta expression head, for adding extra declarations and info to functions more generally. Then the @inline macro can simply insert (meta inline) as one of the first statements in the function.

vtjnash · 2013-07-23T18:38:25Z

@JeffBezanson The @inline macro sounds like it is doing the same as the current implementation, other than wrapping it in an Expr type?

How about:
@inline f(y) = @inline g(y)

becomes
Expr(:function, Expr(:call, :f, :y), Expr(:block, Expr(:meta, :inline), Expr(:call_inline, :g, :y)))

JeffBezanson · 2013-07-23T18:52:03Z

Yes. But the value of wrapping it in the meta expr is you can easily identify all such non-code things. I think eventually we should use it for line numbers too, but that's too disruptive right now.

Adding :call_inline is not workable since so, so many things look for call exprs. Expr(:inline, Expr(:call, :g, :y)) or Expr(:withmeta, Expr(:call, :g, :y), :inline) is probably easier to work with. Note it should not use meta, since those should never contain executing code.

vtjnash · 2013-07-23T20:02:01Z

I like the :withmeta syntax. And it should be a lot less work than trying to fixup all locations of :call, while being more general.

Perhaps meta should be part of an Expr: then we could write the following:
Expr(:call, :g, :y; meta=(:inline,))

Expr(:function, Expr(:call, :f, :y; meta=(:inline,)), Expr(:block, Expr(:call, :g, :y; meta=(:inline,))))

vtjnash · 2013-08-15T04:19:36Z

@JeffBezanson bump. should I add a meta field to every Expr? It could probably have any of the following layouts:
(:line, 123, (:file, "hello world.jl", (:inline, :always)))
{(line,123),(file,"hello world.jl"),(:inline,:always))
{:line=>123, :file=>"hello world.jl", :inline=>:always}

Or how would you prefer to handle this?

JeffBezanson · 2013-08-15T05:23:02Z

For now, let's remove the :inline thing, deal with declarations later, and separate the inlining heuristic into its own function. Then we can experiment with tweaking it. (after all, inline declarations should not be required to get good performance)
Then we can probably merge the multi-line inlining part pretty easily.

StefanKarpinski · 2013-08-15T13:10:53Z

^^ pro move: break the change into smaller, less disruptive pieces.

On Thursday, August 15, 2013, Jeff Bezanson wrote:

For now, let's remove the :inline thing, deal with declarations later,
and separate the inlining heuristic into its own function. Then we can
experiment with tweaking it. (after all, inline declarations should not be
required to get good performance)
Then we can probably merge the multi-line inlining part pretty easily.

—
Reply to this email directly or view it on GitHubhttps://github.com//pull/3796#issuecomment-22685797
.

Keno · 2013-10-02T21:03:26Z

@vtjnash Would you mind rebasing this? I wanted to play with inling a bit, but didn't want to duplicate any work.

vtjnash · 2013-11-19T05:05:47Z

@loladiro @JeffBezanson this has been rebased. please test and merge when you are happy

JeffBezanson · 2013-11-19T16:35:17Z

Currently this patch undoes my optimization for many-argument operators like + (issue #4374). This probably happened as part of the rebase. This has to be fixed.

Ideally the multi-statement inlining and union-splitting would be separate patches, as well.

vtjnash · 2013-11-20T07:03:16Z

Yeah, it was partially that, and partially that I was being too aggressive at creating local variables while inlining so that optimization could no longer be applied. I fixed that stuff, and reverted the inlining_pass function for simplicity (and because it seems the order of the expanding apply vs calling inlineable really matters).

I also found that I was accidentally inlining some rather large functions, which is why the test were so slow. That's fixed now also.

vtjnash · 2013-11-28T03:51:41Z

@JeffBezanson bump. i think i've removed all the controversial stuff. now it just does inlining based on call1 and inline_worthy, nothing more.

OK to merge? or do you have other suggestions first?

ViralBShah · 2013-11-28T05:27:20Z

Why does github not like automatically merging this? Some rebasing required?

vtjnash · 2013-11-28T07:18:58Z

Probably not. I think github is just refusing to do the file merge caused by b4fa861

timholy · 2014-03-13T09:59:09Z

I'm assuming that at this point this is 0.4 material. When it does come to the fore, here's one vote for considering restoring the :inline keyword. The reasons are an elaboration of my suspicion about notions of "big functions" and "little functions" (see above):

The heuristic is very crude. For example, my version that did get inlined presumably was compiled by LLVM to the same thing as for the version that wasn't being inlined. The distinction was that the original version had (due to auto-generation) lots of statements like if 1 == 3; nothing ; elseif 2 == 3; nothing; elseif 3 == 3; # do something; end. Those add to the expression-count, and hence make it look "big" by the heuristic that's in place, but they should disappear once LLVM has had a chance to do its magic. I was able to add a feature to Cartesian that allowed it to generate prettier and more compact expressions---which is not a bad thing---but it's important to recognize that fundamentally this changed nothing, yet had a dramatic improvement on performance due to the decision about whether to inline.

One might improve the heuristic, but at what point does it simply become its own compiler?
In general, for the vast majority of iterators I suspect that next and done should probably be inlined. How would you feel if we didn't inline them in for i = 1:1000; # do something; end? For a multidimensional iterator, usually only the inner-loop index will need to be incremented, so it's exactly analogous to this case. Moreover, this is just as true for a 6d iterator as it is for a 2d iterator. But iteration in 6d will necessitate a next function body at least 3x as long as a 2d iterator, and might easily cross threshold for not being inlined by whatever heuristic we use. Nevertheless, in reality it's no less worthy of inlining.

I suspect there's some justifiable reluctance to give users too much control over inlining: witness the number of recommendations floating around to avoid overusing inline in C. I agree that you don't want to inline everything into everything. But it's probably also fair to say that one could view Cartesian partially as an exercise in circumventing the problems that arise from not being able to inline things that you really, really need to inline. Some things about the language will be a lot cleaner if we don't have to worry about whether something gets inlined.

All that said, in practice many of our current problems will be solved even by this version. I'd much rather see this merged than to have it held up by a lack of consensus about an :inline keyword; please don't misinterpret these comments as suggesting otherwise.

lindahua · 2014-03-13T13:31:32Z

+1 for @timholy's argument.

I have often found it quite frustrated to see the performance penalty caused by failure of inlining. In performance-demanding settings (which is not uncommon in scientific computing), this virtually forces me to use macros or more verbose codes instead of more elegant abstractions (e.g. iterators).

I support merging this soon and then debating how we may provide ways for developers to specify what to inline.

JeffBezanson · 2014-03-26T17:29:27Z

base/inference.jl

    if is_known_call(e, Core.Intrinsics.ccall, sv)
-        i0 = 3
+        i0 = 5


The return type and param type arguments don't run in enclosing_ast as
expected by inlineable, so we need to skip them also

On Wednesday, March 26, 2014, Jeff Bezanson notifications@github.com
wrote:

In base/inference.jl:

if is_known_call(e, Core.Intrinsics.ccall, sv)

i0 = 3

i0 = 5

Why 5?

—
Reply to this email directly or view it on GitHubhttps://github.com//pull/3796/files#r10987984
.

multi-line inlining is now supported (currently triggered by various simple heuristics)

… variable name

…able in a different module, since `Mod.X=V` isn't allowed

… local in the enclosing ast

…aluation of its arguments if the argument was pure

…list

…ction argument list

…piling a function

…should be fdwatcher_init.

JeffBezanson · 2014-04-02T19:21:15Z

Did I just rebase this? Yes. What can that possibly mean? Who knows.

timholy · 2014-04-02T19:46:59Z

Don't mind me, I'll just stand here in the corner whistling, peering over my shoulder occasionally.

more inlining

jiahao · 2014-04-02T23:27:05Z

🍬

tknopp · 2014-04-03T12:34:12Z

A big "thank you" for this wonderful PR.

timholy mentioned this pull request Jul 24, 2013

RFC: Implement a Counter type for Base.Collections #3702

Closed

vtjnash mentioned this pull request Jul 28, 2013

Return value ignored #3821

Closed

timholy mentioned this pull request Aug 8, 2013

max() could be faster/inlining ternary operations #3030

Closed

vtjnash mentioned this pull request Aug 12, 2013

code_lowered error message #4023

Closed

stevengj mentioned this pull request Aug 13, 2013

RFC: use pairwise summation for sum, cumsum, and cumprod #4039

Merged

simonster mentioned this pull request Aug 14, 2013

Clean up basic functions like mean and std JuliaData/DataFrames.jl#325

Closed

timholy mentioned this pull request Oct 9, 2013

Faster linear indexing for SubArrays, dims 1-5 #4427

Merged

simonster mentioned this pull request Oct 13, 2013

Can't broadcast to DataArrays JuliaData/DataFrames.jl#377

Closed

staticfloat mentioned this pull request Nov 19, 2013

Roadmap for 0.3 #4853

Closed

21 tasks

ghost assigned JeffBezanson Nov 25, 2013

kmsquire mentioned this pull request Dec 3, 2013

Inline constant arrays #5024

Closed

vtjnash mentioned this pull request Mar 13, 2014

Iterator performance #6137

Closed

JeffBezanson reviewed Mar 26, 2014
View reviewed changes

vtjnash and others added 11 commits April 2, 2014 14:55

allow inlining of more functions

9039fdd

multi-line inlining is now supported (currently triggered by various simple heuristics)

fix vargs expansion logic during inlining pass when also used a local…

780e8e9

… variable name

abort when trying to inline a function which assigns to a global vari…

d452498

…able in a different module, since `Mod.X=V` isn't allowed

fix an issue where a global in the inlined ast could be shadowed by a…

144ddac

… local in the enclosing ast

resolve an issue where inlining was allowed to change the order of ev…

c4c8cb5

…aluation of its arguments if the argument was pure

improve list of effect_free expressions to include immutable types

b2d0271

add the rest of the _pure_builtin functions (from Intrinsics) to the …

377904b

…list

fix order-of-execution after inlining a multiline function into a fun…

9cc7de9

…ction argument list

avoid inlining in any part of an internal method definition while com…

8ef82f6

…piling a function

fix inlining for & special-syntax in ccall

baa1acc

fix deprecated a[i:] syntax and a merge error where fdwatcher_reinit …

4d88d8e

…should be fdwatcher_init.

JeffBezanson added 2 commits April 2, 2014 15:48

more compact detection of pure builtins

2cb2c4a

make inlining heuristic more conservative

ca47867

JeffBezanson added a commit that referenced this pull request Apr 2, 2014

Merge pull request #3796 from JuliaLang/jn/callmore

a2acf88

more inlining

JeffBezanson merged commit a2acf88 into master Apr 2, 2014

simonster mentioned this pull request Apr 3, 2014

codegen regression for simple loops #6382

Closed

vtjnash deleted the jn/callmore branch April 3, 2014 00:47

timholy mentioned this pull request Apr 6, 2014

WIP: Add Cartesian product iteration. Fixes #1917 #6437

Closed

jiahao mentioned this pull request Apr 7, 2014

Recent compiler performance regressions for test/linalg1 #6460

Closed

timholy mentioned this pull request Aug 1, 2014

extensible bounds checking removal #7799

Closed

timholy mentioned this pull request Sep 10, 2014

Add inline macro via :meta expressions #8297

Merged

timholy mentioned this pull request Oct 24, 2014

Extend :meta to allow saving extra information. #8779

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

more inlining #3796

more inlining #3796

vtjnash commented Jul 23, 2013

timholy commented Jul 23, 2013

lindahua commented Jul 23, 2013

quinnj commented Jul 23, 2013

StefanKarpinski commented Jul 23, 2013

staticfloat commented Jul 23, 2013

JeffBezanson commented Jul 23, 2013

vtjnash commented Jul 23, 2013

JeffBezanson commented Jul 23, 2013

vtjnash commented Jul 23, 2013

vtjnash commented Aug 15, 2013

JeffBezanson commented Aug 15, 2013

StefanKarpinski commented Aug 15, 2013

Keno commented Oct 2, 2013

vtjnash commented Nov 19, 2013

JeffBezanson commented Nov 19, 2013

vtjnash commented Nov 20, 2013

vtjnash commented Nov 28, 2013

ViralBShah commented Nov 28, 2013

vtjnash commented Nov 28, 2013

timholy commented Mar 13, 2014

lindahua commented Mar 13, 2014

JeffBezanson Mar 26, 2014

vtjnash Mar 26, 2014

JeffBezanson commented Apr 2, 2014

timholy commented Apr 2, 2014

jiahao commented Apr 2, 2014

tknopp commented Apr 3, 2014

more inlining #3796

more inlining #3796

Conversation

vtjnash commented Jul 23, 2013

timholy commented Jul 23, 2013

lindahua commented Jul 23, 2013

quinnj commented Jul 23, 2013

StefanKarpinski commented Jul 23, 2013

staticfloat commented Jul 23, 2013

JeffBezanson commented Jul 23, 2013

vtjnash commented Jul 23, 2013

JeffBezanson commented Jul 23, 2013

vtjnash commented Jul 23, 2013

vtjnash commented Aug 15, 2013

JeffBezanson commented Aug 15, 2013

StefanKarpinski commented Aug 15, 2013

Keno commented Oct 2, 2013

vtjnash commented Nov 19, 2013

JeffBezanson commented Nov 19, 2013

vtjnash commented Nov 20, 2013

vtjnash commented Nov 28, 2013

ViralBShah commented Nov 28, 2013

vtjnash commented Nov 28, 2013

timholy commented Mar 13, 2014

lindahua commented Mar 13, 2014

JeffBezanson Mar 26, 2014

Choose a reason for hiding this comment

vtjnash Mar 26, 2014

Choose a reason for hiding this comment

JeffBezanson commented Apr 2, 2014

timholy commented Apr 2, 2014

jiahao commented Apr 2, 2014

tknopp commented Apr 3, 2014