Improved connection / stream results when wai applications throw exceptions. #740

iand675 · 2019-04-19T07:19:02Z

Bumped the version number

After submitting your PR:

Update the Changelog.md file with a link to your PR
Check that CI passes (or if it fails, for reasons unrelated to your change, like CI timeouts)

Overview

Our services operate behind Google load balancer instances, which assume that opened connections stay open for up to 10 minutes. In the event of uncaught exceptions being handled by Warp, the current behavior for HTTP/1 is to close the connection after sending the error response. The HTTP/2 code ignores the user-provided exception response handler from settings and closes the stream with an HTTP/2 stream INTERNAL_ERROR signal.

This caused the issue for us on HTTP/1 connections that subsequent requests would attempt to reuse the closed connection and fail (since the connection was gone). For GET requests, that wasn't a huge deal, since the load balancer would replay them (yay idempotency 🎉). For non-idempotent requests, we'd just lose the data. In HTTP/2, the issue was just the inconvenience of not getting proper 500s back.

Arguably according to the HTTP specs, Warp's current behavior is acceptable, and probably the easiest way to recover from any odd error states, but this PR attempts to make connection reuse a bit more robust so we can keep our architecture the same.

HTTP/1 Implementation

We need to distinguish between several failure modes:

Issue	Connection Recoverable?
Errors thrown by an `Application` prior to returning a response.	✓(we have a response handler from `Settings`, can generate a well-formed response)
`Socket`s breaking while sending response	✗ (can't send any more data 🤷‍♂️)
`IOException`s from files referenced in `ResponseFile`. (for example: what happens if a while gets modified/deleted while it's being streamed as a result?)	✗ (Content-Length headers except the response to be a certain size, so if we don't kill the connection or stream, we'd have to feed bad data as padding)
Application errors while streaming the body via `ResponseBuilder` or `ResponseStream`.	✗ (Allowing the response to finish would indicate the request completed as intended, possibly feeding the client malformed data)
Clients sending malformed or overly long requests	✗ (malformed requests are generally irrecoverable, overly long requests require us to consume the full body, so the impasse requires a connection close.)

The general heuristic here is that once we start to flush the response and an exception is thrown any time during the flushing process, consider the connection to be broken. In order to accomplish this, we wrap exceptions thrown while responding with ExceptionInsideResponseBody. If an uncaught exception has this wrapper, then we can close the connection, otherwise we can use the provided exception handler from Settings and keep the connection alive since we're not wedged in an irrecoverable state.

HTTP/2 Implementation

HTTP/2 doesn't reuse streams, so if a stream breaks, new requests on the same connection are fine. However, we use similar logic to differentiate between exceptions thrown in the flushing process with the same ExceptionInsideResponseBody.

Before:

curl https://app.lvh.me:3000/error/wai https://app.lvh.me:3000/error/wai -v          
...
* Using HTTP2, server supports multi-use
* Connection state changed (HTTP/2 confirmed)
* Copying HTTP/2 data in stream buffer to connection buffer after upgrade: len=0
* Using Stream ID: 1 (easy handle 0x558f2a82f900)
> GET /error/wai HTTP/2
> Host: app.lvh.me:3000
> User-Agent: curl/7.58.0
> Accept: */*
> 
* Connection state changed (MAX_CONCURRENT_STREAMS updated)!
* HTTP/2 stream 1 was not closed cleanly: INTERNAL_ERROR (err 2)
* Connection #0 to host app.lvh.me left intact
curl: (92) HTTP/2 stream 1 was not closed cleanly: INTERNAL_ERROR (err 2)
* Found bundle for host app.lvh.me: 0x558f2a82f6a0 [can multiplex]
* Re-using existing connection! (#0) with host app.lvh.me
* Connected to app.lvh.me (127.0.0.1) port 3000 (#0)
* Using Stream ID: 3 (easy handle 0x558f2a82f900)
> GET /error/wai HTTP/2
> Host: app.lvh.me:3000
> User-Agent: curl/7.58.0
> Accept: */*
> 
* HTTP/2 stream 3 was not closed cleanly: INTERNAL_ERROR (err 2)
* Connection #0 to host app.lvh.me left intact
curl: (92) HTTP/2 stream 3 was not closed cleanly: INTERNAL_ERROR (err 2)

After:

curl https://app.lvh.me:3000/error/wai https://app.lvh.me:3000/error/wai -v
...
* Using HTTP2, server supports multi-use
* Connection state changed (HTTP/2 confirmed)
* Copying HTTP/2 data in stream buffer to connection buffer after upgrade: len=0
* Using Stream ID: 1 (easy handle 0x5591c789c900)
> GET /error/wai HTTP/2
> Host: app.lvh.me:3000
> User-Agent: curl/7.58.0
> Accept: */*
> 
* Connection state changed (MAX_CONCURRENT_STREAMS updated)!
< HTTP/2 500 
< date: Fri, 19 Apr 2019 06:30:31 GMT
< server: Warp/3.2.26
< content-type: text/plain; charset=utf-8
< 
* Connection #0 to host app.lvh.me left intact
Something went wrong* Found bundle for host app.lvh.me: 0x5591c789c6a0 [can multiplex]
* Re-using existing connection! (#0) with host app.lvh.me
* Connected to app.lvh.me (127.0.0.1) port 3000 (#0)
* Using Stream ID: 3 (easy handle 0x5591c789c900)
> GET /error/wai HTTP/2
> Host: app.lvh.me:3000
> User-Agent: curl/7.58.0
> Accept: */*
> 
< HTTP/2 500 
< date: Fri, 19 Apr 2019 06:30:31 GMT
< server: Warp/3.2.26
< content-type: text/plain; charset=utf-8
< 
* Connection #0 to host app.lvh.me left intact
Something went wrong%

iand675 · 2019-04-19T09:27:59Z

I don't think the Windows test failure is due to this change? If so, would appreciate some help fixing as I don't have a Windows dev environment.

kazu-yamamoto · 2019-04-24T03:50:57Z

CI test runs again.

kazu-yamamoto

This looks terrific!

kazu-yamamoto · 2019-04-25T00:45:30Z

Rebased and merged. Thank you for your contribution!

iand675 added 4 commits April 18, 2019 21:34

Preserve connections in some exceptional situations for HTTP 1.x

5788390

Fix typos

dbaf7fc

Properly use exception response callback in HTTP2 stream responses.

1a63e79

Clean up code a bit after writing clarified some things

8fd84b2

iand675 mentioned this pull request Apr 19, 2019

Safari won't load (also: HTTP/2 stream 1 was not closed cleanly: PROTOCOL_ERROR) #703

Closed

kazu-yamamoto self-requested a review April 24, 2019 03:46

kazu-yamamoto approved these changes Apr 25, 2019

View reviewed changes

kazu-yamamoto added a commit to kazu-yamamoto/wai that referenced this pull request Apr 25, 2019

Merge PR yesodweb#740

2b07b8f

kazu-yamamoto mentioned this pull request Apr 25, 2019

Releasing warp 3.2.27 #742

Closed

kazu-yamamoto closed this Apr 25, 2019

kazu-yamamoto mentioned this pull request Dec 22, 2023

delete unused ExceptionInsideResponseBody exception #962

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improved connection / stream results when wai applications throw exceptions. #740

Improved connection / stream results when wai applications throw exceptions. #740

iand675 commented Apr 19, 2019 •

edited

Loading

iand675 commented Apr 19, 2019

kazu-yamamoto commented Apr 24, 2019

kazu-yamamoto left a comment

kazu-yamamoto commented Apr 25, 2019

Improved connection / stream results when wai applications throw exceptions. #740

Improved connection / stream results when wai applications throw exceptions. #740

Conversation

iand675 commented Apr 19, 2019 • edited Loading

Overview

HTTP/1 Implementation

HTTP/2 Implementation

iand675 commented Apr 19, 2019

kazu-yamamoto commented Apr 24, 2019

kazu-yamamoto left a comment

Choose a reason for hiding this comment

kazu-yamamoto commented Apr 25, 2019

iand675 commented Apr 19, 2019 •

edited

Loading