Fix random failures on functional tests #14331

rchiodo · 2020-10-08T21:48:08Z

For https://github.com/microsoft/vscode-python/issues/14290

Root cause was the port used by the kernels were overlapping.

Fixed this by using a file to keep track of ports in use (poor man's named mutex). File is cleaned up on exit and not disposed as dispose would be called in between each test.

Also added a python file to parse test log files. It has the following parameters:

--testoutput - parses the log looking for test lines (they all have ansi coloring) and prints them
--split - splits the testoutput file into multiple files based on process ids logged so you can debug test failures that are in parallel

IanMatthewHuff · 2020-10-08T22:12:43Z

.vscode/settings.json

@@ -45,7 +45,7 @@
        "source.fixAll.eslint": true,
        "source.fixAll.tslint": true
    },
-    "python.languageServer": "Microsoft",
+    "python.languageServer": "Pylance",


Do you want to change this in the PR?

Oh probably not. I'll put it back

codecov-io · 2020-10-08T22:24:07Z

Codecov Report

Merging #14331 into main will increase coverage by 0.13%.
The diff coverage is 63.69%.

@@            Coverage Diff             @@
##             main   #14331      +/-   ##
==========================================
+ Coverage   59.75%   59.88%   +0.13%     
==========================================
  Files         697      709      +12     
  Lines       38649    39339     +690     
  Branches     5577     5700     +123     
==========================================
+ Hits        23094    23559     +465     
- Misses      14364    14539     +175     
- Partials     1191     1241      +50

Impacted Files	Coverage Δ
src/client/activation/common/activatorBase.ts	`14.41% <0.00%> (+1.19%)`	⬆️
...rc/client/activation/jedi/multiplexingActivator.ts	`17.74% <0.00%> (ø)`
src/client/activation/node/languageServerProxy.ts	`26.58% <0.00%> (ø)`
src/client/activation/refCountedLanguageServer.ts	`41.30% <0.00%> (ø)`
src/client/activation/types.ts	`100.00% <ø> (ø)`
src/client/common/process/baseDaemon.ts	`21.76% <0.00%> (-0.53%)`	⬇️
src/client/common/process/pythonDaemon.ts	`14.28% <0.00%> (ø)`
src/client/common/utils/localize.ts	`96.24% <ø> (ø)`
src/client/datascience/baseJupyterSession.ts	`57.33% <ø> (+0.52%)`	⬆️
.../datascience/interactive-common/interactiveBase.ts	`5.78% <0.00%> (+<0.01%)`	⬆️
... and 83 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update c248197...688fb58. Read the comment docs.

IanMatthewHuff · 2020-10-08T22:28:37Z

src/client/datascience/kernel-launcher/kernelLauncher.ts

@@ -49,18 +96,28 @@ export class KernelLauncher implements IKernelLauncher {
        return kernelProcess;
    }

-    private async getKernelConnection(): Promise<IKernelConnection> {
+    private async getPorts(): Promise<number[]> {


This is really minor, but maybe not the exact same name as the portfinder function that we are using? Or change the other one to portfinderGetPorts.

DonJayamanne · 2020-10-08T22:56:22Z

src/client/common/process/baseDaemon.ts

@@ -177,14 +177,20 @@ export abstract class BasePythonDaemon {
        return Object.keys(options).every((item) => daemonSupportedSpawnOptions.indexOf(item as any) >= 0);
    }
    protected sendRequestWithoutArgs<R, E, RO>(type: RequestType0<R, E, RO>): Thenable<R> {
-        return Promise.race([this.connection.sendRequest(type), this.connectionClosedDeferred.promise]);
+        if (this.proc && this.proc.exitCode === null) {


Suggested change

if (this.proc && this.proc.exitCode === null) {

if (this.proc && typeof this.proc.exitCode !== 'number') {

Else if we have undefined, then above condition doesn't work. I feel that's better than hardcoding null.
Optional change requets

The api says it returns a number or null. Never undefined.

Yes, just that usage of null is frowned upon & comparing against null isn't a good practice either.
I guess its old code that they can't remove..

DonJayamanne · 2020-10-08T22:57:55Z

src/client/datascience/kernel-launcher/kernelLauncher.ts

+    public static async cleanupStartPort() {
+        try {
+            // Destroy the file
+            const port = await KernelLauncher.startPortPromise;


Not sure I like this.
I thought we decided we'd only use such code for the tests.

Yup, all good, I can see it is

DonJayamanne · 2020-10-08T23:00:50Z

pythonFiles/vscode_datascience_helpers/tests/logParser.py

+from pathlib import Path
+import re
+
+parser = argparse.ArgumentParser(description="Parse a test log into its parts")


Is this used?

logParser? Or the args?

This file is used (I mentioned it in the PR description) to split a test output file into synchronized parts so you can debug it.

sonarcloud · 2020-10-08T23:10:21Z

Kudos, SonarCloud Quality Gate passed!

0 Bugs
0 Vulnerabilities (and 0 Security Hotspots to review)
0 Code Smells

No Coverage information
0.0% Duplication

* Splitting test log * Fix problem with kernels ports being reused * Make kernel launcher port round robin only for testing * Make formatters change only apply during testing * Add news entry * Apply black formatting * Code review feedback and skip flakey remote password test * Another flakey test * More CR feedback * Missed a spot

* Fix two problems with escaping (#14228) * Remove unneeded cell keys when exporting (#14241) * Remove transient output when exporting from the interactive window * Add news entry * Test was failing with true jupyter (#14261) * Potential fix for ipywidget flakiness (#14281) * Try running tests with space in root path (#14113) * Add test with a space (only works on flake) * Push to insiders.yml only * Remove test that doesn't really do anything * REmove unused bits * Change path to have unicode too * Get test to run * Set root path differently * Valid dir * A different way * Another way * Try creating the directory first * Another try * Only one env * Pass parameters correctly * Try without unicode * Set working directory directly on xvfb actions * Working-directory not workingDirectory * Cached ts files output * Remove test with space branch for insiders * Update vscode-python-pr-validation.yaml (#14285) REmove missing branch? Might make it work again * Get rid of AZDO yamls. Not used anymore * Dont run on push (#14307) * Fix random failures on functional tests (#14331) * Splitting test log * Fix problem with kernels ports being reused * Make kernel launcher port round robin only for testing * Make formatters change only apply during testing * Add news entry * Apply black formatting * Code review feedback and skip flakey remote password test * Another flakey test * More CR feedback * Missed a spot * More of the functional tests are failing (#14346) * Splitting test log * Fix problem with kernels ports being reused * Make kernel launcher port round robin only for testing * Make formatters change only apply during testing * Add news entry * Apply black formatting * Code review feedback and skip flakey remote password test * Another flakey test * More CR feedback * Missed a spot * Some more log parser changes and try to get interrupt to be less flakey * Fix interrupt killing kernel and add more logging for export * More logging * See if updating fixes the problem * Dont delete temp files * Upload webview output to figure out trust failure * Add name to step * Try another way to upload * Upload doesn't seem to work * Try a different way to upload * Try without webview logging as this makes the test pass * Try fixing test another way. Any logging is making the test pass * Compile error * Add more logging to figure out why raw kernel did not start (#14374) * Some more logging * Some more logging * Move PR changes into pr.yml * Fix multiprocessing problems with setting __file__ (#14376) * Fix multiprocessing problems with setting __file__ * Update news entry * Problem with wait for idle not propagating outwards * Fix unnecessary ask for python extension install * Don't error on warning for kernel install

rchiodo added 4 commits October 7, 2020 15:50

Splitting test log

2aac150

Fix problem with kernels ports being reused

bcd65e9

Make kernel launcher port round robin only for testing

a908fe1

Make formatters change only apply during testing

629e881

rchiodo requested review from DavidKutu, DonJayamanne, greazer, IanMatthewHuff and joyceerhl October 8, 2020 21:48

Add news entry

e64ff0b

rchiodo self-assigned this Oct 8, 2020

Apply black formatting

30aaacd

IanMatthewHuff reviewed Oct 8, 2020

View reviewed changes

IanMatthewHuff approved these changes Oct 8, 2020

View reviewed changes

Code review feedback and skip flakey remote password test

cd0fe62

DonJayamanne reviewed Oct 8, 2020

View reviewed changes

DonJayamanne approved these changes Oct 8, 2020

View reviewed changes

rchiodo added 3 commits October 8, 2020 16:03

Another flakey test

d4e19f7

More CR feedback

56076da

Missed a spot

688fb58

rchiodo merged commit 1a6aa56 into main Oct 8, 2020

rchiodo deleted the rchiodo/unmounted_test_failure branch October 8, 2020 23:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix random failures on functional tests #14331

Fix random failures on functional tests #14331

rchiodo commented Oct 8, 2020

IanMatthewHuff Oct 8, 2020

rchiodo Oct 8, 2020

codecov-io commented Oct 8, 2020 •

edited

Loading

IanMatthewHuff Oct 8, 2020

rchiodo Oct 8, 2020

DonJayamanne Oct 8, 2020

rchiodo Oct 8, 2020

DonJayamanne Oct 8, 2020

DonJayamanne Oct 8, 2020

DonJayamanne Oct 8, 2020

DonJayamanne Oct 8, 2020

rchiodo Oct 8, 2020

rchiodo Oct 8, 2020

sonarcloud bot commented Oct 8, 2020

	if (this.proc && this.proc.exitCode === null) {
	if (this.proc && typeof this.proc.exitCode !== 'number') {

Fix random failures on functional tests #14331

Fix random failures on functional tests #14331

Conversation

rchiodo commented Oct 8, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov-io commented Oct 8, 2020 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sonarcloud bot commented Oct 8, 2020

codecov-io commented Oct 8, 2020 •

edited

Loading