static array of zeroes can take minutes to lint check #55795

kazcw · 2018-11-08T20:34:45Z

This program takes an inordinate amount of time and memory to compile (it's worse if the array is actually used, but this is a minimal test case):

const SIZE: usize = 1 << 30;
static SLICE: [u8; SIZE] = [0u8; SIZE];
fn main() {}

I was hoping this would be a viable way to get a big chunk of .bss so I don't have to depend on an mmap crate to get a lot of static zeroes, but the effect on compile time makes that impractical.

If I run rustc -Z time-passes, the big offender is:
time: 47.666; rss: 2486MB lint checking

Although the time (and memory) is reportedly spent checking lints, setting --cap-lints allow doesn't make any difference. I'm guessing the "lint checking" pass includes some things that need to be checked even if lints are suppressed? If not, it seems like a separate issue is that a lot of work could be saved with cap-lints set (e.g. when compiling dependencies).

Here are the top results from perf report, for rustc 1.32.0-nightly (25a42b2ce 2018-11-07):

  31.62%  rustc     librustc_mir-714845413a99e6ff.so              [.] <rustc_mir::interpret::memory::Memory<'a, 'mir, 'tcx, M>>::copy_repeatedly
  22.94%  rustc     librustc_mir-714845413a99e6ff.so              [.] <core::iter::Map<I, F> as core::iter::iterator::Iterator>::fold
   7.32%  rustc     librustc-0eb8c117db37850c.so                  [.] rustc::mir::interpret::UndefMask::grow
   7.31%  rustc     librustc-0eb8c117db37850c.so                  [.] rustc::mir::interpret::UndefMask::set_range
   6.94%  rustc     libc-2.27.so                                  [.] __memmove_sse2_unaligned_erms
   5.91%  rustc     librustc_mir-714845413a99e6ff.so              [.] <rustc_mir::interpret::memory::Memory<'a, 'mir, 'tcx, M>>::check_bytes

So it looks like miri is actually creating the array and folding over it. I know it's not going to find any problems, because I have ECC memory 😆.

There are already some bugs relating to slow compilation of large arrays, with the most relevant I could find being #37155, #49330. I think this is separate from those cases because:

the input in this case is a [0; _] array, whereas the others initialize arrays from sequences of elements
those cases seem to refer to superlinear runtime; this issue appears roughly linear in time and space and only becomes noticeable for much larger arrays
the bottleneck in this case occurs during lint checking, which I didn't see in any other array performance bugs

The text was updated successfully, but these errors were encountered:

oli-obk · 2019-05-13T12:23:10Z

Have you seen any improvement after #58556 ?

jonas-schievink · 2019-05-13T12:28:59Z

rustc 1.34.0-nightly (146aa60f3 2019-02-18)   0:21.37elapsed
rustc 1.36.0-nightly (08bfe1612 2019-05-02)   0:07.22elapsed

oli-obk · 2019-05-14T15:56:11Z

I've wondered before whether we should special-case constants whose bits are all defined and zero. I'll open a discussion in the const-eval zulip channel.

dbdr · 2019-11-14T07:45:04Z

I also ran into this problem. @oli-obk, has that discussion happened?

oli-obk · 2019-11-14T11:02:37Z

Kind of. We will have special treatment for constants that consist solely of undef bytes (see #62655). This scheme can likely be extended to support all bytes being zero withour requiring time or space overhead.

jyn514 · 2021-01-21T04:22:21Z

If not, it seems like a separate issue is that a lot of work could be saved with cap-lints set (e.g. when compiling dependencies).

I opened #74704 for this.

jyn514 · 2021-01-21T04:23:59Z

Kind of. We will have special treatment for constants that consist solely of undef bytes (see #62655). This scheme can likely be extended to support all bytes being zero withour requiring time or space overhead.

#62655 was closed, are there still plans to implement this?

oli-obk · 2021-01-21T09:21:44Z

are there still plans to implement this?

Not right now... there are other more important construction sites in const eval.

Optimize large array creation in const-eval This changes repeated memcpy's to a memset for the case that we're propagating a single byte into a region of memory. It also optimizes the element-by-element copies to have a tighter loop; I'm pretty sure the old code was actually doing a multiply within each loop iteration. For an 8GB array (`static SLICE: [u8; SIZE] = [0u8; 1 << 33];`) this takes us from ~23 seconds to ~6 seconds locally, which is spent roughly 50/50 in (a) memset to zero and (b) memcpy of the original place into a new place, when popping stack frame. The latter seems hard to avoid but is a big memcpy (since we're copying the type rather than initializing a region, so it's pretty fast), and the first is as good as it's going to get without special casing constant-valued arrays. Closes rust-lang#55795. (That issue's references to lint checking don't appear true anymore, but I think this closes that case as something that is slow due to *time* pretty fully. An 8GB array taking only 6 seconds feels reasonable enough to not merit further tracking).

Optimize large array creation in const-eval This changes repeated memcpy's to a memset for the case that we're propagating a single byte into a region of memory. It also optimizes the element-by-element copies to have a tighter loop; I'm pretty sure the old code was actually doing a multiply within each loop iteration. For an 8GB array (`static SLICE: [u8; SIZE] = [0u8; 1 << 33];`) this takes us from ~23 seconds to ~6 seconds locally, which is spent roughly 50/50 in (a) memset to zero and (b) memcpy of the original place into a new place, when popping stack frame. The latter seems hard to avoid but is a big memcpy (since we're copying the type rather than initializing a region, so it's pretty fast), and the first is as good as it's going to get without special casing constant-valued arrays. Closes rust-lang/rust#55795. (That issue's references to lint checking don't appear true anymore, but I think this closes that case as something that is slow due to *time* pretty fully. An 8GB array taking only 6 seconds feels reasonable enough to not merit further tracking).

estebank added I-slow Issue: Problems and improvements with respect to performance of generated code. A-MIR Area: Mid-level IR (MIR) - https://blog.rust-lang.org/2016/04/19/MIR.html labels Nov 8, 2018

ishitatsuyuki added I-compiletime Issue: Problems and improvements with respect to compile times. and removed I-slow Issue: Problems and improvements with respect to performance of generated code. labels Nov 9, 2018

dotdash added the A-const-eval Area: Constant evaluation, covers all const contexts (static, const fn, ...) label Jan 6, 2019

pnkfelix added the T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. label Apr 12, 2019

oli-obk mentioned this issue Jan 20, 2021

rustc hangs when generating very large arrays #81188

Closed

oli-obk mentioned this issue Mar 28, 2022

Don't allocate trailing uninit bits in the InitMap of CTFE Allocations #94936

Closed

Mark-Simulacrum mentioned this issue Jan 17, 2024

Optimize large array creation in const-eval #120069

Merged

bors closed this as completed in 16fadb3 Jan 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

static array of zeroes can take minutes to lint check #55795

static array of zeroes can take minutes to lint check #55795

kazcw commented Nov 8, 2018 •

edited

Loading

oli-obk commented May 13, 2019

jonas-schievink commented May 13, 2019

oli-obk commented May 14, 2019

dbdr commented Nov 14, 2019

oli-obk commented Nov 14, 2019 •

edited

Loading

jyn514 commented Jan 21, 2021

jyn514 commented Jan 21, 2021

oli-obk commented Jan 21, 2021

static array of zeroes can take minutes to lint check #55795

static array of zeroes can take minutes to lint check #55795

Comments

kazcw commented Nov 8, 2018 • edited Loading

oli-obk commented May 13, 2019

jonas-schievink commented May 13, 2019

oli-obk commented May 14, 2019

dbdr commented Nov 14, 2019

oli-obk commented Nov 14, 2019 • edited Loading

jyn514 commented Jan 21, 2021

jyn514 commented Jan 21, 2021

oli-obk commented Jan 21, 2021

kazcw commented Nov 8, 2018 •

edited

Loading

oli-obk commented Nov 14, 2019 •

edited

Loading