feat: support cross-process interception via `setupRemoteServer` #1617

kettanaito · 2023-05-12T17:18:30Z

This is an experimental feature. It's unlikely to ship before 2.0.

Intention

Introduce an API that allows one process to modify the traffic of another process. The most apparent application for this is testing server-side behaviors of a JavaScript application:

// app.js
forwardNetworkToRemote()

export const loader = async () => {
  const res = await fetch('https://example.com/resource')
}

// app.test.js
it('fetches the user server-side', async () => {
  let a = listenToRemoteNetwork(targetProcess)
  modifyNetwork(a)
  // ...render
  // ...assert
})

This is an example API. For the exact proposed API, keep reading.

This API is designed exclusively for use cases when the request-issuing process and the request-resolving process (i.e. where you run MSW) are two different processes.

Proposed API

With consideration to the existing MSW user experience, I suggest we add a setupRemoteServer() API that implements the SetupApi interface and has a similar API to setupServer. The main user-facing distinction here is that setupRemoteServer is affecting a remote process, as indicated by the name.

import { http } from 'msw'
import { setupRemoteServer } from 'msw/node'

const remote = setupRemoteServer(...initialHandlers)

// Notice: async!
beforeAll(async () => await remote.listen())
afterEach(() => remote.resetHandlers())
afterAll(async () => await remote.close())

The .listen() and .close() methods of the remote server become async since they now establish and terminate an internal server instance respectively.

Similar to the setupServer integration, it would be recommended to call setupRemoteServer once as a part of your global testing setup. Closing the WebSocket server after each test suite will have performance implications since each next test suite would wait while remote.listen() spawns that server again.

You can then operate with the remote server as you would with a regular setupServer, keeping in mind that it doesn't affect the current process (your test) but instead, any remote process that runs setupServer (your app).

it('handles user errors', () => {
  // Appending and removing request handlers is sync
  // because they are stored in the current (test) process.
  remote.use(
    http.get('/user', () => {
      return new Response(null, { status: 500 })
    })
  )

  // ...interact and assert your app.
})

By fully extending the SetupApi, the setupRemoteServer API provides the user with full network-managing capabilities. This includes defining initial and runtime request handlers, as well as observing the outgoing traffic of a remote process using the Life-cycle API (remote.events.on(event, listener)). I think this is a nice familiarity that also provides the user with more power when it comes to controlling the network.

Implementation

I've considered multiple ways of implementing this feature. Listing them below.

(Chosen) WebSocket server

The setupRemoteServer API can establish an internal WebSocket server that can route the outgoing traffic from any server-side MSW instance anywhere and deliver it to the remote server to potentially resolve.

Technically, the WebSocket server acts as a resolution point (i.e. your handlers) while the remote MSW process acts as a request supplier (similar to how the Service Worker acts in the browser).

Very roughly, this implies that the regular setupServer instances now have a fixed request handler that tries to check if any outgoing request is potentially handled by an existing remote WebSocket server:

// setupServer.js
await handleRequest(
  request,
  requestId,
  [
    // A very basic idea on how a "remote" request handler works.
    http.all('*', async ({ request }) => {
      wsServer.emit('request', serializeRequest(request))
      await wsServer.on('response', (serializedResponse) => {
        return deserializeResponse(serializedResponse)
      })
    }),
    ...this.currentHandlers,
  ]
)

Unlike request handler (i.e. function) serialization, it is perfectly fine to serialize Request and Response instances and transfer them over any message channel, like a WebSocket transport.

If no WebSocket server was found or establishing a connection with it fails within a sensible timeout period (~500ms), the setupServer instance of the app continues to operate as normal.

Alternatively, we can skip the WebSocket server lookup altogether and make it opt-in via some remote: true option on the app's side.

IPC

The test process and the app process can utilize IPC (interprocess communication) to implement a messaging protocol. Using that protocol, the app can signal back any outgoing requests and the test can try resolving them against the request handlers you defined immediately in the test.

This approach is similar to the WebSocket approach above with the exception that it relies on IPC instead of a standalone running server. With that, it also gains its biggest disadvantage: the app process must be a child process of the test process. This is not easy to guarantee. Depending on the framework's internal implementation, the user may not achieve this parent/child relationship, and the IPC implementation will not work.

Given such a demanding requirement, I've decided not to use this implementation.

Limitations

useRemoteServer() affects the network resolution for the entire app. This means that you cannot have multiple tests that override request handlers for the same app at the same time. I think this is more than reasonable since you know you're running 1 app instance that can only behave in a single way at a single point in time. Still, I expect users to be confused when they parallelize their E2E tests and suddenly see some network behaviors leaking across the test cases.

Concerns

Can we rely on a fixed local port to always be available?
Is it safe to introduce a WebSocket server that will be, effectively, routing HTTP messages over the local network (during tests only)?
- Yes. If someone can intercept that WebSocket communication, they are already in your machine and can do things far worse than that.
Is it clear that setupRemoteServer only affects the server-side network behavior of any running application process with the server-side MSW integration? To affect the client-side network behavior from a test you have to 1) have setupWorker integration in the app; 2) set a global window.worker instance; 3) use window.worker.use() to add runtime request handlers. This stays as it is right now, no changes here.

Remix example repo

The API is TBD and is subjected to change.

Roadmap

Blockers

Types are not installed socketio/socket.io#4621 (type definitions for socket.io-parser are broken for the CJS build).
Modern Remix requires sourcemaps. I added those for Interceptors but adding them to MSW breaks due to tsup: Incorrect JSON in "sourcesContent" with the "sourcemap" option egoist/tsup#1073
- Fixed in fix: publish source and include sourcemaps #1958
Support different tests provisioning overrides for the same running app (feat: support cross-process interception via setupRemoteServer #1617 (comment)).

kettanaito · 2024-11-24T12:10:08Z

@SebastianSedzik, yes, the intention is to iterate on the current design to allow multiple remote connections to the same setupServer instance. There are still problems with differentiating between those connections as there is no way to know what test has triggered a request to respond to it appropriately.

@arjenbloemsma,

Will this feature also allow to mock calls made by Next.js api routes?

This feature will allow you to control the network in one Node.js (e.g. the server-side of your Next.js) from another (e.g. your test).

So imagine I have a client component that calls a nextjs api route on the server, which in turn will make a call to a third party app. I would like to intercept the call to the 3rd party app with MSW and mock it.

For in-application usage, this already works. Check out this Next.js + MSW example for the full setup to enable this.

kettanaito · 2024-12-03T11:54:33Z

I have some thoughts regarding the application spawn order and identifying individual tests/workers. Will share once I have a prototype confirming my ideas. Looks promising.

spuxx1701 · 2024-12-04T08:18:12Z

This is really great. Thank you for your hard work! 🎉

kettanaito · 2024-12-04T14:49:25Z

The tests are failing because I'm on pnpm 9 and #2211 isn't merged yet.

kettanaito · 2024-12-04T14:54:51Z

Blockers

chore: update pnpm to v9.14.0 #2211
Update Vitest to v2 to support disposable objects.

kettanaito · 2024-12-05T14:53:24Z

Update

Alright, there's a lot to unpack. In short, I've got the MVP of CPRI completely working. This includes the test/app paradox solved, the association between the app's runtime and a particular test/worker closure, and the ability to control the server-side network from the test.

Documenting those decisions for posterity.

Test/app paradox

Solved by two things:

You have to build your app before the entire test run. Use the global setup hooks to achieve that.
You have to start individual instances of your built app in each individual test case.

Not only does this solve the test/app paradox because it guarantees the fixed order of things (first test, then remote, then your app spawns), it allows the app to be spawned with a custom contextId, which is important for the next thing.

Test/app binding

Even when spawned within individual test cases, your app still communicates with a single instance of remote (to save bandwidth on spawning a ton of WebSocket servers, those are redundant). This makes remote a shared state across different test scenarios, which isn't nice (but it is nice to have it as a shared fallback!).

This is solved by using the remote.boundary() API. Similar to the existing server.boundary(), the remote boundary utilizes AsyncLocalStorage in Node.js to bind request handler overrides to the test's closure, preventing the shared state issue and allowing for concurrent test runs. In the remote context, the boundary has additional behavior because request resolution doesn't happen within the test's closure like it does for regular Node.js tests. Instead, it happens in the request listener of the WebSocket server, which is "visually" in await remote.listen().

Test/app binding occurs due to remote.contextId grabbing the unique ID of the particular boundary and storing it in a Map<ContextId, () => Context> internal map of the remote. During the request resolution (i.e. the request event handling), the API can lookup any async context by its ID and get the list of relevant handlers. Only those handlers will be used (the happy-path handlers from setupRemoteServer are still applied as with server.boundary()).

What's next?

A ton of todo's to cover! The public API isn't the most ergonomic thing, and I'm considering simplifying it quite a bit while exposing the customization options to the end developer. There are also some security considerations I want to address, mostly concerning safe defaults of the WebSocket server CORS.

This is also a good time to sponsor this effort. Thank you.

kettanaito · 2024-12-12T14:44:20Z

Bug: Empty buffer sent to WebSocket connection

[setupRemoteServer] ws client ERROR! Error: got binary data when not reconstructing a packet
    at Decoder.add (/msw/node_modules/.pnpm/socket.io-parser@4.2.4/node_modules/socket.io-parser/build/cjs/index.js:161:23)
    at Client.ondata (/msw/node_modules/.pnpm/socket.io@4.7.5/node_modules/socket.io/dist/client.js:182:26)
    at Socket.emit (node:events:518:28)
    at Socket.onPacket (/msw/node_modules/.pnpm/engine.io@6.5.4/node_modules/engine.io/build/socket.js:120:22)
    at WebSocket.emit (node:events:518:28)
    at WebSocket.onPacket (/msw/node_modules/.pnpm/engine.io@6.5.4/node_modules/engine.io/build/transport.js:94:14)
    at WebSocket.onData (/msw/node_modules/.pnpm/engine.io@6.5.4/node_modules/engine.io/build/transport.js:103:14)
    at WebSocket.<anonymous> (/msw/node_modules/.pnpm/engine.io@6.5.4/node_modules/engine.io/build/transports/websocket.js:20:19)
    at WebSocket.emit (node:events:518:28)
    at Receiver.emit (node:events:518:28)
    at Receiver.dataMessage (/msw/node_modules/.pnpm/ws@8.11.0/node_modules/ws/lib/receiver.js:514:14)
    at Receiver.getData (/msw/node_modules/.pnpm/ws@8.11.0/node_modules/ws/lib/receiver.js:446:17)
    at Receiver.startLoop (/msw/node_modules/.pnpm/ws@8.11.0/node_modules/ws/lib/receiver.js:148:22)
    at Receiver._write (/msw/node_modules/.pnpm/ws@8.11.0/node_modules/ws/lib/receiver.js:83:10)
    at writeOrBuffer (node:internal/streams/writable:564:12)
    at _write (node:internal/streams/writable:493:10)
    at Receiver.Writable.write (node:internal/streams/writable:502:10)
    at Socket.socketOnData (/msw/node_modules/.pnpm/ws@8.11.0/node_modules/ws/lib/websocket.js:1272:35)
    at Socket.emit (node:events:518:28)
    at addChunk (node:internal/streams/readable:559:12)
    at readableAddChunkPushByteMode (node:internal/streams/readable:510:3)
    at Socket.Readable.push (node:internal/streams/readable:390:5)
    at TCP.onStreamRead (node:internal/stream_base_commons:190:23)
    at TCP.callbackTrampoline (node:internal/async_hooks:130:17)

No idea why this happens. Something down the line sends an empty buffer to the server, SocketIO tries to emit the message with that buffer, and fails on the top-most frame.

Root cause

msw/src/node/SetupServerApi.ts

Line 191 in fa8a197

this.forwardLifeCycleEvents()

Forwarding the response:bypass event for the WebSocket handshake to the remote causes this error:

{
  type: 'response:bypass',
  payload: {
    response: {
      __serializedType: 'response',
      status: 101,
      statusText: 'Switching Protocols',
      headers: [Array],
      body: [ArrayBuffer]
    },
    request: {
      __serializedType: 'request',
      method: 'GET',
      url: 'http://localhost:56957/socket.io/?EIO=4&transport=websocket',
      headers: [Array],
      body: undefined
    },
    requestId: '16ddfd8d917a6'
  }
}

Note that body of the response if an empty ArrayBuffer. Sending an empty buffer errors the SocketIO parsing phase.

kettanaito · 2024-12-12T16:57:26Z

Bug: Deep objects cannot be emitted with SocketIO

Not sure if it's MSW interfering in some way, but emitting the request event with the deep object as an argument never arrives to the server. Shallow objects work fine.

Would be nice to add an integration test to Interceptors to confirm or debunk this.

✅ Solved

Moved away from WebSockets to HTTP for the internal server.

kettanaito · 2024-12-18T13:25:35Z

Test failures

 FAIL  test/node/msw-api/setup-remote-server/response.body.test.ts [ test/node/msw-api/setup-remote-server/response.body.test.ts ]
 Test Files  2 failed | 84 passed (86)
Error: listen EADDRINUSE: address already in use ::1:56957

This is caused by migrating to HTTP for the internal server. Since Vitest likely runs some of these tests in parallel, it attempts to call remote.listen(), which attempts to create multiple internal HTTP servers on the same port, which, naturally, fails.

This is a good precursor into figuring out whether the internal server port is fixed or random. I likely lean toward making it random and passing it along with remoteContext through remote.boundary() (but that will make the boundary required; or we can use a fallback port).

kettanaito added 30 commits October 7, 2022 20:17

feat: wip new api

ded1309

feat: add "HttpResponse"

6a21d47

feat: support "HttpResponse.arrayBuffer()"

90dd65e

fix(HttpResponse): forward cookies in the browser only

49369e6

docs: update readme

3b947c7

fix(pruneGetRequestBody): support HEAD requests

b909e48

chore: polyfill Request in jest tests

f9840b7

feat: support "bypass" utility

b87724d

chore: update "headers-polyfill" to 3.1.2

3315cd4

test(headers-multiple): separate headers by comma

aef1db1

feat: support new api for graphql

b44a993

chore: migrate "msw-api" tests

71753c3

feat: add "delay()"

a5a91dc

feat: migrate to new interceptors

0ee92d8

fix(RequestHandler): clone response in generator

7fcadf2

chore: migrate to fetch api in node

d64713a

feat: export universal fetch classes

fd89534

fix(bypass): use compatible Request type

5166b5e

feat: support one-time request handlers

11fa7c6

feat: add "passthrough", fix tests

adab915

feat: support multi-value response cookies

dde933b

chore: add "File" polyfill for node

13bb61f

feat: add "HttpResponse.formData()"

ca158f3

test: fix request FormData body test

e494683

chore: remove unused code

41ffaa0

docs: add the migration guide

547eecb

feat: support response body type generic

e8fca8b

fix(HttpResponse): accept "string" as input to ".json()"

6771bb3

feat: support strict request body type

a4adce0

test: add path params types tests

490d2b8

kettanaito added 3 commits November 29, 2024 13:39

feat: add remote.boundary()

6e2ee96

Merge branch 'main' into feat/ws-sync-handlers

dbb3cd6

chore(wip): debugging

49357ed

fix: fix ws passthrough, adjust remote tests

a6c6f6f

test(wip): add remote.boundary tests

0b2e2c0

kettanaito mentioned this pull request Dec 5, 2024

chore: update to vitest@2 #2379

Merged

kettanaito added 2 commits December 5, 2024 14:54

Merge branch 'main' into feat/ws-sync-handlers

0e48062

feat: support boundary and contextId

2eba1ad

chore: adjust tests to headers changes

11c466b

kettanaito mentioned this pull request Dec 5, 2024

add remote request interception recipe mswjs/mswjs.io#435

Draft

kettanaito added 2 commits December 10, 2024 14:39

chore(wip): wip

7ad582c

Merge branch 'main' into feat/ws-sync-handlers

fa8a197

kettanaito added 3 commits December 12, 2024 16:04

fix(setupServer): skip forwarding internal websocket events

57ab678

fix(RemoteRequestHandler): handle socket disconnects

c069b25

fix(serializeUtils): preserve empty request/response bodies

c2d5c13

kettanaito added 5 commits December 13, 2024 19:51

chore: last days of websocket server

657d7b1

feat: use http for remote request handling

5264ec4

chore: remove console.log to fix linting

e6fac6d

Merge branch 'main' into feat/ws-sync-handlers

9d04f0a

fix: use consistent localhost for remote server

cfc120c

feat: use random internal server port

310a2c0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support cross-process interception via `setupRemoteServer` #1617

feat: support cross-process interception via `setupRemoteServer` #1617

kettanaito commented May 12, 2023 •

edited

Loading

kettanaito commented Nov 24, 2024

kettanaito commented Dec 3, 2024

spuxx1701 commented Dec 4, 2024

kettanaito commented Dec 4, 2024

kettanaito commented Dec 4, 2024 •

edited

Loading

kettanaito commented Dec 5, 2024

kettanaito commented Dec 12, 2024 •

edited

Loading

kettanaito commented Dec 12, 2024 •

edited

Loading

kettanaito commented Dec 18, 2024

feat: support cross-process interception via setupRemoteServer #1617

Are you sure you want to change the base?

feat: support cross-process interception via setupRemoteServer #1617

Conversation

kettanaito commented May 12, 2023 • edited Loading

Intention

Proposed API

Implementation

(Chosen) WebSocket server

IPC

Limitations

Concerns

Roadmap

Blockers

kettanaito commented Nov 24, 2024

kettanaito commented Dec 3, 2024

spuxx1701 commented Dec 4, 2024

kettanaito commented Dec 4, 2024

kettanaito commented Dec 4, 2024 • edited Loading

Blockers

kettanaito commented Dec 5, 2024

Update

Test/app paradox

Test/app binding

What's next?

kettanaito commented Dec 12, 2024 • edited Loading

Bug: Empty buffer sent to WebSocket connection

Root cause

kettanaito commented Dec 12, 2024 • edited Loading

Bug: Deep objects cannot be emitted with SocketIO

✅ Solved

kettanaito commented Dec 18, 2024

Test failures

feat: support cross-process interception via `setupRemoteServer` #1617

feat: support cross-process interception via `setupRemoteServer` #1617

kettanaito commented May 12, 2023 •

edited

Loading

kettanaito commented Dec 4, 2024 •

edited

Loading

kettanaito commented Dec 12, 2024 •

edited

Loading

kettanaito commented Dec 12, 2024 •

edited

Loading