Add an env var to artificially limit the stack size #941

konstin · 2024-01-16T16:24:23Z

By default, windows has a stack size limit of 1MB which we run against in debug without any explicit culprit. A new environment variable PUFFIN_STACK_SIZE allows setting an artificially smaller stack size.

BurntSushi

Can we create an issue for looking into this? The thing I'm worried about here is that we have 1MB+ stacks lying around, and that probably isn't a great thing. With that said, if this fixes things on Windows I'm fine with it as a temporary work-around I think.

BurntSushi · 2024-01-16T17:25:22Z

crates/puffin-cli/src/main.rs

+            tokio::runtime::Builder::new_multi_thread()
+                .enable_all()
+                .build()
+                .expect("Failed building the Runtime")


The tokio docs don't make it clear under what conditions this routine fails, but it returns an io::Error on failure. I do wonder under what conditions this can fail, and perhaps it might be better to not panic? On the other hand, if unwrap() is what #[tokio::main] was already doing, then I guess it's probably fine.

#[tokio::main] async fn main() { println!("Hello, world!"); }

expands to

#![feature(prelude_import)] #[prelude_import] use std::prelude::rust_2021::*; #[macro_use] extern crate std; fn main() { let body = async { { ::std::io::_print(format_args!("Hello, world!\n")); }; }; #[allow(clippy::expect_used, clippy::diverging_sub_expression)] { return tokio::runtime::Builder::new_multi_thread() .enable_all() .build() .expect("Failed building the Runtime") .block_on(body); } }

so i think that's what you're expected to do.

Features can change behaviour, e.g. the pyo3 feature setting linker options and #941 setting the stack size. Instead of running tests with all features, we list the features required for tests. The old command ``` cargo nextest run --all --all-features --status-level skip --failure-output immediate-final --no-fail-fast -j 12 ``` and the new command ``` cargo nextest run --all --features all-tests --status-level skip --failure-output immediate-final --no-fail-fast -j 12 ``` run the same tests. This includes a fix for the pep508_rs `evaluate_extras_and_python_version` doc test, which is only run by `cargo test` but not by `cargo nextest`. This is required for #941.

charliermarsh · 2024-01-18T13:39:55Z

@konstin - Do you only see these failures in debug, or release too?

konstin · 2024-01-18T13:42:48Z

Only in debug mode

BurntSushi

Nice I like it.

BurntSushi

Actually, given #963, I now realize I'm less a fan of this approach. It'd be really nice to make --all-features continue to work. Otherwise, it breaks things like this. Requiring a non-standard way to enable all features just so we can test with a smaller stack size seems non-ideal to me.

What do you think about configuring this with an environment variable instead?

BurntSushi · 2024-01-18T16:21:44Z

crates/puffin-cli/src/main.rs

+        .enable_all()
+        .build()
+        .expect("Failed building the Runtime")
+        .block_on(inner())


Do you need to also set the stack size here? i.e., https://docs.rs/tokio/latest/tokio/runtime/struct.Builder.html#method.thread_stack_size

BurntSushi

Love it!

…e futures (#947) Windows has a default stack size of 1MB, which makes puffin often fail with stack overflows. The PR reduces stack size by three changes: * Boxing `File` in `Dist`, reducing the size from 496 to 240. * Boxing the largest futures. * Boxing `CachePolicy` ## Method Debugging happened on linux using #941 to limit the stack size to 1MB. Used ran the command below. ``` RUSTFLAGS=-Zprint-type-sizes cargo +nightly build -p puffin-cli -j 1 > type-sizes.txt && top-type-sizes -w -s -h 10 < type-sizes.txt > sizes.txt ``` The main drawback is top-type-sizes not saying what the `__awaitee` is, so it requires manually looking up with a future with matching size. When the `brotli` features on `reqwest` is active, a lot of brotli types show up. Toggling this feature however seems to have no effect. I assume they are false positives since the `brotli` crate has elaborate control about allocation. The sizes are therefore shown with the feature off. ## Results The largest future goes from 12208B to 6416B, the largest type (`PrioritizedDistribution`, see also #948) from 17448B to 9264B. Full diff: https://gist.github.com/konstin/62635c0d12110a616a1b2bfcde21304f For the second commit, i iteratively boxed the largest file until the tests passed, then with an 800KB stack limit looked through the backtrace of a failing test and added some more boxing. Quick benchmarking showed no difference: ```console $ hyperfine --warmup 2 "target/profiling/main-dev resolve meine_stadt_transparent" "target/profiling/puffin-dev resolve meine_stadt_transparent" Benchmark 1: target/profiling/main-dev resolve meine_stadt_transparent Time (mean ± σ): 49.2 ms ± 3.0 ms [User: 39.8 ms, System: 24.0 ms] Range (min … max): 46.6 ms … 63.0 ms 55 runs Warning: Statistical outliers were detected. Consider re-running this benchmark on a quiet system without any interferences from other programs. It might help to use the '--warmup' or '--prepare' options. Benchmark 2: target/profiling/puffin-dev resolve meine_stadt_transparent Time (mean ± σ): 47.4 ms ± 3.2 ms [User: 41.3 ms, System: 20.6 ms] Range (min … max): 44.6 ms … 60.5 ms 62 runs Warning: Statistical outliers were detected. Consider re-running this benchmark on a quiet system without any interferences from other programs. It might help to use the '--warmup' or '--prepare' options. Summary target/profiling/puffin-dev resolve meine_stadt_transparent ran 1.04 ± 0.09 times faster than target/profiling/main-dev resolve meine_stadt_transparent ```

…and large futures" (#1003) Reverts #947

…e futures (#1004) This is #947 again but this time merging into main instead of downstack, sorry for the noise. --- Windows has a default stack size of 1MB, which makes puffin often fail with stack overflows. The PR reduces stack size by three changes: * Boxing `File` in `Dist`, reducing the size from 496 to 240. * Boxing the largest futures. * Boxing `CachePolicy` ## Method Debugging happened on linux using #941 to limit the stack size to 1MB. Used ran the command below. ``` RUSTFLAGS=-Zprint-type-sizes cargo +nightly build -p puffin-cli -j 1 > type-sizes.txt && top-type-sizes -w -s -h 10 < type-sizes.txt > sizes.txt ``` The main drawback is top-type-sizes not saying what the `__awaitee` is, so it requires manually looking up with a future with matching size. When the `brotli` features on `reqwest` is active, a lot of brotli types show up. Toggling this feature however seems to have no effect. I assume they are false positives since the `brotli` crate has elaborate control about allocation. The sizes are therefore shown with the feature off. ## Results The largest future goes from 12208B to 6416B, the largest type (`PrioritizedDistribution`, see also #948) from 17448B to 9264B. Full diff: https://gist.github.com/konstin/62635c0d12110a616a1b2bfcde21304f For the second commit, i iteratively boxed the largest file until the tests passed, then with an 800KB stack limit looked through the backtrace of a failing test and added some more boxing. Quick benchmarking showed no difference: ```console $ hyperfine --warmup 2 "target/profiling/main-dev resolve meine_stadt_transparent" "target/profiling/puffin-dev resolve meine_stadt_transparent" Benchmark 1: target/profiling/main-dev resolve meine_stadt_transparent Time (mean ± σ): 49.2 ms ± 3.0 ms [User: 39.8 ms, System: 24.0 ms] Range (min … max): 46.6 ms … 63.0 ms 55 runs Warning: Statistical outliers were detected. Consider re-running this benchmark on a quiet system without any interferences from other programs. It might help to use the '--warmup' or '--prepare' options. Benchmark 2: target/profiling/puffin-dev resolve meine_stadt_transparent Time (mean ± σ): 47.4 ms ± 3.2 ms [User: 41.3 ms, System: 20.6 ms] Range (min … max): 44.6 ms … 60.5 ms 62 runs Warning: Statistical outliers were detected. Consider re-running this benchmark on a quiet system without any interferences from other programs. It might help to use the '--warmup' or '--prepare' options. Summary target/profiling/puffin-dev resolve meine_stadt_transparent ran 1.04 ± 0.09 times faster than target/profiling/main-dev resolve meine_stadt_transparent ```

BurntSushi approved these changes Jan 16, 2024

View reviewed changes

konstin marked this pull request as draft January 17, 2024 12:49

konstin mentioned this pull request Jan 17, 2024

Reduce stack usage by boxing File in Dist, CachePolicy and large futures #947

Merged

konstin force-pushed the konsti/set-explicit-thread-size branch from 5587b79 to 56ac1a8 Compare January 18, 2024 10:19

konstin changed the title ~~Set explicit stack size to avoid stack overflows on windows~~ Add a feature to simulate a smaller stack size Jan 18, 2024

konstin force-pushed the konsti/set-explicit-thread-size branch from 56ac1a8 to c3242bb Compare January 18, 2024 11:12

konstin marked this pull request as ready for review January 18, 2024 11:13

konstin requested a review from BurntSushi January 18, 2024 11:13

konstin mentioned this pull request Jan 18, 2024

Fix pep508_rs doc test #963

Merged

konstin force-pushed the konsti/set-explicit-thread-size branch from c3242bb to 7b80943 Compare January 18, 2024 11:45

konstin changed the base branch from main to konsti/test-features January 18, 2024 11:46

BurntSushi approved these changes Jan 18, 2024

View reviewed changes

BurntSushi reviewed Jan 18, 2024

View reviewed changes

konstin force-pushed the konsti/test-features branch from d9efe22 to 41ae1d5 Compare January 18, 2024 14:22

Base automatically changed from konsti/test-features to main January 18, 2024 14:24

konstin force-pushed the konsti/set-explicit-thread-size branch from 7b80943 to 5a63635 Compare January 18, 2024 14:28

konstin changed the title ~~Add a feature to simulate a smaller stack size~~ Add an env var to artificially limit the stack size Jan 18, 2024

BurntSushi reviewed Jan 18, 2024

View reviewed changes

konstin force-pushed the konsti/set-explicit-thread-size branch 5 times, most recently from 52878f7 to 1bb4eb7 Compare January 18, 2024 16:39

BurntSushi approved these changes Jan 18, 2024

View reviewed changes

Add an env var to artificially limit the stack size

1dd47b9

konstin force-pushed the konsti/set-explicit-thread-size branch from 1bb4eb7 to 1dd47b9 Compare January 19, 2024 09:28

konstin enabled auto-merge (squash) January 19, 2024 09:28

konstin disabled auto-merge January 19, 2024 09:29

Revert "Reduce stack usage by boxing File in Dist, CachePolicy …

692ab5b

…and large futures" (#1003) Reverts #947

konstin enabled auto-merge (squash) January 19, 2024 09:30

konstin merged commit 66e6519 into main Jan 19, 2024
3 checks passed

konstin deleted the konsti/set-explicit-thread-size branch January 19, 2024 09:34

konstin mentioned this pull request Jan 19, 2024

Reduce stack usage by boxing File in Dist, CachePolicy and large futures #1004

Merged

konstin mentioned this pull request Jan 24, 2024

Add some instructions about build dependencies #1075

Merged

zanieb mentioned this pull request Apr 3, 2024

Python 32 bit on windows overflowed on installing greenlet. #2803

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add an env var to artificially limit the stack size #941

Add an env var to artificially limit the stack size #941

konstin commented Jan 16, 2024 •

edited

Loading

BurntSushi left a comment

BurntSushi Jan 16, 2024

konstin Jan 18, 2024

charliermarsh commented Jan 18, 2024

konstin commented Jan 18, 2024

BurntSushi left a comment

BurntSushi left a comment

BurntSushi Jan 18, 2024

konstin Jan 18, 2024

BurntSushi left a comment

Add an env var to artificially limit the stack size #941

Add an env var to artificially limit the stack size #941

Conversation

konstin commented Jan 16, 2024 • edited Loading

BurntSushi left a comment

Choose a reason for hiding this comment

BurntSushi Jan 16, 2024

Choose a reason for hiding this comment

konstin Jan 18, 2024

Choose a reason for hiding this comment

charliermarsh commented Jan 18, 2024

konstin commented Jan 18, 2024

BurntSushi left a comment

Choose a reason for hiding this comment

BurntSushi left a comment

Choose a reason for hiding this comment

BurntSushi Jan 18, 2024

Choose a reason for hiding this comment

konstin Jan 18, 2024

Choose a reason for hiding this comment

BurntSushi left a comment

Choose a reason for hiding this comment

konstin commented Jan 16, 2024 •

edited

Loading