Remove test flakiness #181

Abestanis · 2024-10-27T18:22:54Z

This is an attempt to reduce the test flakiness. I noticed that setUp calls are not executed on the same FakeAsync zone than the tests, which means that they are more likely to leak into other tests and flushing mikro-tasks in the tests doesn't actually flush the ones scheduled in setUp. Maybe this is enough to fix the flakiness in the tests. It definitely helped to reveal two instances where we forgot to dispose a timer or animation controller.

I also removed all runAsync calls. As a general rule of thumb, we don't really want the tests to do any real async work, (network requests, file accesses or even worse, waiting for a timeout should all be mocked), so we should never use runAsync.

Fixes #162.

test/golden/goldens/artist_content_route.artist_content_route_songs.dark.png

test/golden/goldens/player_route.player_route.dark.artColor.png

Abestanis · 2024-10-27T22:30:35Z

Given the fact that 100 cycles of the tests passed (see CI run for 25134ad) I'd say we can be pretty confident that this fixes the flakiness. 😁

test/routes/home_route_test.dart

nt4f04uNd · 2024-11-03T13:20:08Z

test/golden/goldens/artist_content_route.artist_content_route_songs.light.png

Interestengly, a side effect I see on many screenshots is that the shadow from the drawer seems to have gone

It's OK, just something I noticed reviewing golden changes

nt4f04uNd · 2024-11-03T13:31:33Z

test/test.dart

+    AsyncCallback callback, {
+    AsyncCallback? goldenCaptureCallback,
+    VoidCallback? initialization,
+    VoidCallback? postInitialization,


Should we align runAppTestWithoutUi with this one and pass this callback as well?

nt4f04uNd · 2024-11-03T13:32:46Z

test/test.dart

+  ///  4. Runs the test from the [callback].
+  ///  5. Optionally, runs [goldenCaptureCallback].
+  ///  6. Stops and disposes the player.
+  ///  7. Un-pumps the screen and flushes all micro-tasks and stream events.


I would also cross-reference runAppTestWithoutUi here as "See also", discussing their differences and when one should use one above the other one

Same for the comment in runAppTestWithoutUi

nt4f04uNd · 2024-11-03T13:35:06Z

test/test.dart

-      // Wait for ui animations.
-      await pumpAndSettle();
+        // Un-pump, in case we have any real animations running,
+        // so the pumpAndSettle on the next line doesn't hang on.


This is no longer "next line"

nt4f04uNd · 2024-11-03T13:37:07Z

test/test.dart

+        await pumpWidget(const SizedBox());
+        // Wait for any asynchronous events and stream callbacks to finish.
+        await pump(const Duration(seconds: 1));
+        // Wait for ui animations.


I imagine this is referring to "system UI" (but don't remember for sure)

Let's put that into the comment tho

Otherwise pumping out the UI above would already stop animations and we wouldn't need pumpAndSettle

nt4f04uNd · 2024-11-03T13:50:04Z

test/test.dart

+            },
+            () => test(tester),
+            goldenCaptureCallback: () => tester.screenMatchesGolden(
+              Invoker.current!.liveTest.test.name.split(' | theme')[0].replaceAll(' ', '.'),


Filename is already being built here https://github.com/nt4f04uNd/sweyer/blob/master/test/test.dart#L259

I don't completely understand the purpose of this code, but it seems like using the description parameter would already would enrich it with the required parameters, maybe except from the group name

This code just seems to be not very scalable

nt4f04uNd · 2024-11-03T13:53:15Z

test/logic/player/favorites_test.dart

@@ -10,46 +10,46 @@ void main() {
  final favoriteSong2 = songWith(id: 4, title: 'Song 4', isFavoriteInMediaStore: true);
  final favoriteSong3 = songWith(id: 5, title: 'Song 5', isFavoriteInMediaStore: true);

-  setUp(() async {
-    await setUpAppTest(() {


Are we sure this is a good API change that we not longer can use setUp between tests?

I'd argue it's not - it reduces the reusability of setup logic

In some cases it might also make that tests utilities are not composable with each other - this is a real case in tests that I had recently, when I made a not very composable API desicion, which put constraints on how tests could be written, so I had to rewrite it at some point after

From the description of the PR I see that you want them to live inside the same zone as other test logic

It this possible to achieve in some other way?

Abestanis added 4 commits October 27, 2024 19:04

Ensure the app init is run in the same FakeAsync zone as the test

3283a6b

Dispose global controllers on app exit

5a24747

Dispose the playTimer in the MockAudioPlayer on test exit

db8760a

Get rid of some more runAsync calls

a46d382

Abestanis requested a review from nt4f04uNd October 27, 2024 18:22

Abestanis marked this pull request as draft October 27, 2024 18:25

Abestanis added 3 commits October 27, 2024 21:10

Fix some golden tests

7c78ec1

Fix back button test

e2ebcfa

Stop playback before taking the golden screenshot in player_route

fa7b741

Abestanis force-pushed the feature/test_cleanup branch from c67e075 to fa7b741 Compare October 27, 2024 20:31

🤖 Update Golden test artifacts 🤖

334cac1

Abestanis force-pushed the feature/test_cleanup branch from aa6c5f1 to 334cac1 Compare October 27, 2024 20:39

Abestanis commented Oct 27, 2024

View reviewed changes

test/golden/goldens/artist_content_route.artist_content_route_songs.dark.png Outdated Show resolved Hide resolved

Abestanis commented Oct 27, 2024

View reviewed changes

test/golden/goldens/player_route.player_route.dark.artColor.png Outdated Show resolved Hide resolved

Fix second back button test

a529227

Abestanis force-pushed the feature/test_cleanup branch from 2d8d846 to 25134ad Compare October 27, 2024 21:00

Abestanis commented Oct 27, 2024

View reviewed changes

test/routes/home_route_test.dart Show resolved Hide resolved

Get rid of the remaining runAsync in the tests

f5936a1

Abestanis force-pushed the feature/test_cleanup branch from 25134ad to f5936a1 Compare October 27, 2024 23:47

Abestanis added the tests label Oct 27, 2024

Abestanis marked this pull request as ready for review October 27, 2024 23:48

Abestanis changed the title ~~Reduce test flakiness~~ Remove test flakiness Oct 27, 2024

nt4f04uNd reviewed Nov 3, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove test flakiness #181

Remove test flakiness #181

Abestanis commented Oct 27, 2024 •

edited

Loading

Abestanis commented Oct 27, 2024

nt4f04uNd Nov 3, 2024

nt4f04uNd Nov 3, 2024

nt4f04uNd Nov 3, 2024

nt4f04uNd Nov 3, 2024

nt4f04uNd Nov 3, 2024

nt4f04uNd Nov 3, 2024 •

edited

Loading

nt4f04uNd Nov 3, 2024

nt4f04uNd Nov 3, 2024 •

edited

Loading

nt4f04uNd Nov 3, 2024

Remove test flakiness #181

Are you sure you want to change the base?

Remove test flakiness #181

Conversation

Abestanis commented Oct 27, 2024 • edited Loading

Abestanis commented Oct 27, 2024

nt4f04uNd Nov 3, 2024

Choose a reason for hiding this comment

nt4f04uNd Nov 3, 2024

Choose a reason for hiding this comment

nt4f04uNd Nov 3, 2024

Choose a reason for hiding this comment

nt4f04uNd Nov 3, 2024

Choose a reason for hiding this comment

nt4f04uNd Nov 3, 2024

Choose a reason for hiding this comment

nt4f04uNd Nov 3, 2024 • edited Loading

Choose a reason for hiding this comment

nt4f04uNd Nov 3, 2024

Choose a reason for hiding this comment

nt4f04uNd Nov 3, 2024 • edited Loading

Choose a reason for hiding this comment

nt4f04uNd Nov 3, 2024

Choose a reason for hiding this comment

Abestanis commented Oct 27, 2024 •

edited

Loading

nt4f04uNd Nov 3, 2024 •

edited

Loading

nt4f04uNd Nov 3, 2024 •

edited

Loading