Optimize insertion sort #40807

ghost · 2017-03-24T23:35:22Z

This change slightly changes the main iteration loop so that LLVM can optimize it more efficiently.

Benchmark:

name                                   before ns/iter   after ns/iter    diff ns/iter   diff %
slice::sort_unstable_small_ascending   39 (2051 MB/s)   38 (2105 MB/s)             -1   -2.56%
slice::sort_unstable_small_big_random  579 (2210 MB/s)  575 (2226 MB/s)            -4   -0.69%
slice::sort_unstable_small_descending  80 (1000 MB/s)   70 (1142 MB/s)            -10  -12.50%
slice::sort_unstable_small_random      396 (202 MB/s)   386                       -10   -2.53%

The benchmark is not a fluke. I can see that performance on small_descending is consistently better after this change. I'm not 100% sure why this makes things faster, but my guess would be that v.len()+1 to the compiler looks like it could in theory overflow.

This change slightly changes the main iteration loop so that LLVM can optimize it more efficiently. Benchmark: name before ns/iter after ns/iter diff ns/iter diff % slice::sort_unstable_small_ascending 39 (2051 MB/s) 38 (2105 MB/s) -1 -2.56% slice::sort_unstable_small_big_random 579 (2210 MB/s) 575 (2226 MB/s) -4 -0.69% slice::sort_unstable_small_descending 80 (1000 MB/s) 70 (1142 MB/s) -10 -12.50% slice::sort_unstable_small_random 396 (202 MB/s) 386 -10 -2.53%

rust-highfive · 2017-03-24T23:35:26Z

r? @aturon

(rust_highfive has picked a reviewer for you, use r? to override)

ghost · 2017-03-24T23:38:54Z

r? @alexcrichton

alexcrichton · 2017-03-25T02:03:52Z

@bors: r+

bors · 2017-03-25T02:03:52Z

📌 Commit 2c816f7 has been approved by alexcrichton

…=alexcrichton Optimize insertion sort This change slightly changes the main iteration loop so that LLVM can optimize it more efficiently. Benchmark: ``` name before ns/iter after ns/iter diff ns/iter diff % slice::sort_unstable_small_ascending 39 (2051 MB/s) 38 (2105 MB/s) -1 -2.56% slice::sort_unstable_small_big_random 579 (2210 MB/s) 575 (2226 MB/s) -4 -0.69% slice::sort_unstable_small_descending 80 (1000 MB/s) 70 (1142 MB/s) -10 -12.50% slice::sort_unstable_small_random 396 (202 MB/s) 386 -10 -2.53% ``` The benchmark is not a fluke. I can see that performance on `small_descending` is consistently better after this change. I'm not 100% sure why this makes things faster, but my guess would be that `v.len()+1` to the compiler looks like it could in theory overflow.

Rollup of 11 pull requests - Successful merges: #40347, #40501, #40516, #40524, #40540, #40642, #40683, #40764, #40778, #40807, #40809 - Failed merges: #40771

nagisa · 2017-03-25T11:56:03Z

Since this is internal to libstd/core, could you check whether the inclusive range syntax makes things better as well?

i.e. 2 ... v.len()

ghost · 2017-03-25T14:02:18Z

@nagisa Inclusive range syntax makes performance slightly worse, actually...

With the old insertion sort and with inclusive range syntax there's a bound check at the beginning of insertion sort. This PR removes the bound check.

If you want to play with this, here's a playpen link.

bors · 2017-03-25T14:34:14Z

⌛ Testing commit 2c816f7 with merge 04e47d7...

arielb1 · 2017-03-25T14:39:09Z

try to fix the build on emscripten #40821
@bors retry

…=alexcrichton Optimize insertion sort This change slightly changes the main iteration loop so that LLVM can optimize it more efficiently. Benchmark: ``` name before ns/iter after ns/iter diff ns/iter diff % slice::sort_unstable_small_ascending 39 (2051 MB/s) 38 (2105 MB/s) -1 -2.56% slice::sort_unstable_small_big_random 579 (2210 MB/s) 575 (2226 MB/s) -4 -0.69% slice::sort_unstable_small_descending 80 (1000 MB/s) 70 (1142 MB/s) -10 -12.50% slice::sort_unstable_small_random 396 (202 MB/s) 386 -10 -2.53% ``` The benchmark is not a fluke. I can see that performance on `small_descending` is consistently better after this change. I'm not 100% sure why this makes things faster, but my guess would be that `v.len()+1` to the compiler looks like it could in theory overflow.

Rollup of 7 pull requests - Successful merges: #40642, #40734, #40740, #40771, #40807, #40820, #40821 - Failed merges:

rust-highfive assigned aturon Mar 24, 2017

rust-highfive assigned alexcrichton and unassigned aturon Mar 24, 2017

alexcrichton mentioned this pull request Mar 25, 2017

Rollup of 11 pull requests #40810

Closed

bors added a commit that referenced this pull request Mar 25, 2017

Auto merge of #40810 - alexcrichton:rollup, r=alexcrichton

40ae49a

Rollup of 11 pull requests - Successful merges: #40347, #40501, #40516, #40524, #40540, #40642, #40683, #40764, #40778, #40807, #40809 - Failed merges: #40771

bors added a commit that referenced this pull request Mar 25, 2017

Auto merge of #40810 - alexcrichton:rollup, r=alexcrichton

ed9108d

Rollup of 11 pull requests - Successful merges: #40347, #40501, #40516, #40524, #40540, #40642, #40683, #40764, #40778, #40807, #40809 - Failed merges: #40771

frewsxcv mentioned this pull request Mar 25, 2017

Rollup of 6 pull requests #40825

Closed

frewsxcv mentioned this pull request Mar 25, 2017

Rollup of 7 pull requests #40826

Merged

bors added a commit that referenced this pull request Mar 26, 2017

Auto merge of #40826 - frewsxcv:rollup, r=frewsxcv

7846dbe

Rollup of 7 pull requests - Successful merges: #40642, #40734, #40740, #40771, #40807, #40820, #40821 - Failed merges:

bors merged commit 2c816f7 into rust-lang:master Mar 26, 2017

ghost deleted the optimize-insertion-sort branch March 26, 2017 16:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize insertion sort #40807

Optimize insertion sort #40807

ghost commented Mar 24, 2017 •

edited by ghost

Loading

rust-highfive commented Mar 24, 2017

ghost commented Mar 24, 2017

alexcrichton commented Mar 25, 2017

bors commented Mar 25, 2017

nagisa commented Mar 25, 2017

ghost commented Mar 25, 2017

bors commented Mar 25, 2017

arielb1 commented Mar 25, 2017

Optimize insertion sort #40807

Optimize insertion sort #40807

Conversation

ghost commented Mar 24, 2017 • edited by ghost Loading

rust-highfive commented Mar 24, 2017

ghost commented Mar 24, 2017

alexcrichton commented Mar 25, 2017

bors commented Mar 25, 2017

nagisa commented Mar 25, 2017

ghost commented Mar 25, 2017

bors commented Mar 25, 2017

arielb1 commented Mar 25, 2017

ghost commented Mar 24, 2017 •

edited by ghost

Loading