Continuous Integration Performance and Robustness Brainstorming #311

bollwyvl · 2020-08-11T00:47:07Z

Elevator Pitch

Let's gather some ideas for how we might improve the value we're getting from CI.

Motivation

CI is not really that broken, but it's pretty slow. There might be some other tools and services we can look into which wouldn't require altering the code that much, so can be done in parallel, and incrementally, while feature work is on-going.

Design Ideas

Post/emote ideas below.

bollwyvl · 2020-08-11T00:48:29Z

Migrate to GitHub Actions

It seems to be where things are going, and 2x simultaneous jobs than Azure Pipelines.

bollwyvl · 2020-08-11T00:49:41Z

Investigate mamba

implemented in #312

Using mamba instead of upgrading conda will potentially save us some time when solving environments and downloading files. It can sometimes return different solutions (but so can different versions of conda).

bollwyvl · 2020-08-11T00:51:16Z

Investigate conda-lock

Instead of even solving the environments in CI, we could generate them quickly, offline, and check them into the ci subfolder. Stacks with mamba (which it will use preferentially).

bollwyvl · 2020-08-11T00:58:47Z

Revisit caching

Being able to cache:

conda lock solutions
the built lab
tectonic cache
conda package cache
yarn package cache

...in roughly that order, would knock some minutes off all of the runs, but particularly on windows which seems to have really bad IO. Github Actions and Azure both have more advanced (but of course, incompatible) caching actions. They all work best if we have things checked in that get hashed. The entropy on the built lab is pretty challenging to describe.

bollwyvl · 2020-08-11T01:18:53Z

Investigate dodo

dodo has been very useful for tying together some large builds, and normalizing ci vs local development. #183 has a good amount of work in it, but can be improved substantially. A challenge: pathlib is pretty important, so supporting 3.5 might be a challenge (pathlib2 isn't quite the same).

krassowski · 2020-08-31T12:37:56Z

Some test cases fail on timeouts (especially on Windows):

Windows38.04 Interface.Statusbar                                              
==============================================================================
Statusbar Popup Opens                                                 | FAIL |
Setup failed:
Element 'css:.jp-mod-accept.jp-mod-warn' did not appear in 5 seconds.

or:

Windows38.04 Interface.Statusbar                                              
==============================================================================
Statusbar Popup Opens                                                 | FAIL |
Element 'css:div.lsp-statusbar-item' did not get text 'Fully initialized' in 1 minute.

or:

Windows38.01 Editor                                                           
==============================================================================
JS                                                                    | FAIL |
Setup failed:
Element 'css:.jp-mod-accept.jp-mod-warn' did not appear in 5 seconds.
------------------------------------------------------------------------------
JSON                                                                  | PASS |
------------------------------------------------------------------------------
SQL                                                                   | PASS |
------------------------------------------------------------------------------
YAML                                                                  | PASS |
------------------------------------------------------------------------------
Markdown                                                              | PASS |
------------------------------------------------------------------------------
Python                                                                | PASS |
------------------------------------------------------------------------------
SCSS                                                                  | PASS |
------------------------------------------------------------------------------
CSS                                                                   | PASS |
------------------------------------------------------------------------------
JSX                                                                   | FAIL |
Setup failed:
Element 'css:.jp-mod-accept.jp-mod-warn' did not appear in 5 seconds.
------------------------------------------------------------------------------

yet, they usually work out in the final attempt. The 5 seconds timeout could probably be extended (10s?).

bollwyvl · 2020-09-08T15:22:10Z

Don't run jest tests on windows

The jest tests take 3 minutes on windows, and don't really provide additional information vs the linux/osx runs (other than node not working great on windows, which is known) since it's all make-believe DOM.

I move we just drop them from all the windows runs on azure.

krassowski · 2020-09-08T20:46:07Z

Playing around with gtihub actions.. is there a reason why we use conda rather than miniconda?

krassowski · 2020-09-08T20:46:58Z

Or, do we? Honestly cannot see when it is getting installed...

bollwyvl · 2020-09-08T23:12:49Z

Miniconda is the _distribution_, while conda is the _tool_. Both azure and GitHub actions have _an_ version preinstalled. I'm happy to take a look at this as well, having recently been using actions more.

krassowski · 2020-09-08T23:43:58Z

Obviously I meant anaconda ;) Trying to use https://github.com/conda-incubator/setup-miniconda (seems to support mamba in a way?) at the moment after previously trying setup-conda and having a brief look at setup-mamba (which does not seem to support Win)

bollwyvl · 2020-09-11T18:50:37Z

Cache built lab static

Rebuilding JupyterLab with all the extensions on Windows is important, at least until 3.0. However, it does take a rather long amount of time, in the 3-4 minute neighborhood.

I have a hunch after we run all of our labextension install and labextension link, the contents of lab/{staging,extensions} and our built tarballs will be sufficiently reproducible that it should be safe to cache the built lab/static folder, and reuse that instead of building it on a cache hit. This would especially be useful for iteration on robot tests. Would need to do some significant investigation, however, which may not be worth it.

bollwyvl · 2020-09-11T19:19:17Z

Run robot tests in parallel

pabot can run tests in parallel. I think our tests are sufficiently self-contained (minus things like the jedi/tectonic caches) that we could run them in parallel, so it might be a one-line change (e.g. import pabot as robot)

The browser tests are the reason we're doing half this stuff, but they also contribute over half of the duration, by themselves. A lot of that is waiting around for the browser to load/do stuff, which might not actually be that taxing on the poor little processor (especially windows). However, given an anecdotal win/py37 runtime of around 27 minutes, even getting a 25% reduction in duration would be beneficial. Additionally, if this allowed local tests to run even faster, it would reduce the pain of developing/maintaining/refactoring robot tests, as well.

Update: this is now available on conda-forge

bollwyvl mentioned this issue Aug 11, 2020

Support JupyterLab 2.2 #301

Merged

4 tasks

bollwyvl mentioned this issue Aug 11, 2020

try mamba, remove docs job in azure #312

Merged

4 tasks

bollwyvl mentioned this issue Sep 8, 2020

[wip] Add completion configurer #328

Open

14 tasks

krassowski mentioned this issue Sep 8, 2020

First attempt to setup GitHub actions #345

Merged

4 tasks

bollwyvl mentioned this issue Sep 10, 2020

[wip] [squash] Add conda locks for CI #348

Closed

8 tasks

bollwyvl added the question Further information is requested label Feb 20, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Continuous Integration Performance and Robustness Brainstorming #311

Continuous Integration Performance and Robustness Brainstorming #311

bollwyvl commented Aug 11, 2020

bollwyvl commented Aug 11, 2020

bollwyvl commented Aug 11, 2020 •

edited

Loading

bollwyvl commented Aug 11, 2020

bollwyvl commented Aug 11, 2020

bollwyvl commented Aug 11, 2020

krassowski commented Aug 31, 2020 •

edited

Loading

bollwyvl commented Sep 8, 2020 •

edited

Loading

krassowski commented Sep 8, 2020

krassowski commented Sep 8, 2020 •

edited

Loading

bollwyvl commented Sep 8, 2020 via email

krassowski commented Sep 8, 2020

bollwyvl commented Sep 11, 2020

bollwyvl commented Sep 11, 2020 •

edited

Loading

Continuous Integration Performance and Robustness Brainstorming #311

Continuous Integration Performance and Robustness Brainstorming #311

Comments

bollwyvl commented Aug 11, 2020

Elevator Pitch

Motivation

Design Ideas

bollwyvl commented Aug 11, 2020

Migrate to GitHub Actions

bollwyvl commented Aug 11, 2020 • edited Loading

Investigate mamba

bollwyvl commented Aug 11, 2020

Investigate conda-lock

bollwyvl commented Aug 11, 2020

Revisit caching

bollwyvl commented Aug 11, 2020

Investigate dodo

krassowski commented Aug 31, 2020 • edited Loading

bollwyvl commented Sep 8, 2020 • edited Loading

Don't run jest tests on windows

krassowski commented Sep 8, 2020

krassowski commented Sep 8, 2020 • edited Loading

bollwyvl commented Sep 8, 2020 via email

krassowski commented Sep 8, 2020

bollwyvl commented Sep 11, 2020

Cache built lab static

bollwyvl commented Sep 11, 2020 • edited Loading

Run robot tests in parallel

bollwyvl commented Aug 11, 2020 •

edited

Loading

krassowski commented Aug 31, 2020 •

edited

Loading

bollwyvl commented Sep 8, 2020 •

edited

Loading

krassowski commented Sep 8, 2020 •

edited

Loading

bollwyvl commented Sep 11, 2020 •

edited

Loading