carbon-language · geoffromer · Jun 18, 2020 · Jun 18, 2020 · Jun 18, 2020 · Jun 23, 2020
diff --git a/docs/project/principles/error_handling.md b/docs/project/principles/error_handling.md
@@ -0,0 +1,238 @@
+# Principles: Error handling
+
+<!--
+Part of the Carbon Language, under the Apache License v2.0 with LLVM
+Exceptions. See /LICENSE for license information.
+SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
+-->
+
+<!-- toc -->
+
+- [Principles](#principles)
+  - [Programming errors are not recoverable](#programming-errors-are-not-recoverable)
+    - [Examples](#examples)
+  - [Memory exhaustion is not a recoverable error](#memory-exhaustion-is-not-a-recoverable-error)
+    - [Examples](#examples-1)
+    - [Caveats](#caveats)
+  - [Recoverable errors are explicit in function declarations](#recoverable-errors-are-explicit-in-function-declarations)
+  - [Recoverable errors are explicit at the callsite](#recoverable-errors-are-explicit-at-the-callsite)
+  - [Error propagation must be straightforward](#error-propagation-must-be-straightforward)
+  - [No universal error categories](#no-universal-error-categories)
+- [Other resources](#other-resources)
+
+<!-- tocstop -->
+
+## Principles
+
+### Programming errors are not recoverable
+
+The Carbon language and standard library will not use recoverable
+error-reporting mechanisms to report programming errors. Furthermore, Carbon's
+design will not prioritize use cases involving recovery from programming errors.
-error-reporting mechanisms to report programming errors. Furthermore, Carbon's
-design will not prioritize use cases involving recovery from programming errors.
+error-reporting mechanisms to report programming errors, i.e. errors caused by
+incorrect user code. Furthermore, Carbon's design will not prioritize use cases
+involving recovery from programming errors.
-error-reporting mechanisms to report programming errors. Furthermore, Carbon's
-design will not prioritize use cases involving recovery from programming errors.
+error-reporting mechanisms to report programming errors, i.e. errors caused by
+incorrect user code. Furthermore, Carbon's design will not prioritize use cases
+involving recovery from programming errors.
+
+Recovering from an error generally consists of discarding or reverting any state
+that might be invalidated by the original cause of the error, and then
+transferring control to a point that doesn't depend on the discarded state. For
+example, a function that reads data from a file and validates a checksum might
+avoid modifying any nonlocal state until validation is successful, and return
+early if validation fails. This recovery strategy relies on the fact that the
+likely causes of the failure are known and bounded (probably a malformed input
+file or an I/O error), which allows us to put a bound on the state that might
+have been invalidated.
+
+A _programming error_ is an error caused by incorrect user code, such as failing
+to satisfy the preconditions of an operation. When such an error is detected,
+it's not possible for the program to know, or even plausibly guess, what the
+original cause is. For example, dereferencing a dangling pointer is
+unambiguously a programming error, but it can have many possible causes. The
+author of the code might have forgotten to check some condition before
+dereferencing, or the caller might have passed a dangling pointer into the
+function, or some other code might have released the memory too early, or any
+number of other possibilities. Without more information, it's impossible to
+know, so the only way to somewhat reliably recover from a programming error is
+to discard the entire address space and terminate the program.
+
+Thus, we expect that supporting recovery from programming errors would provide
+little or no benefit. Furthermore, it would be harmful to several of Carbon's
+primary goals:
+
+- [Performance-critical software](/docs/project/goals.md#performance-critical-software):
+  It would impose a pervasive performance overhead, because recoverable error
+  handling is never free, and a programming error can occur anywhere.
+- [Code that is easy to read, understand, and write](/docs/project/goals.md#code-that-is-easy-to-read-understand-and-write):
+  Because potential programming errors are pervasive, they would have to
+  propagate invisibly, which makes code harder to understand (see
+  [below](#recoverable-errors-are-explicit-at-the-callsite)).
+- [Software and language evolution](/docs/project/goals.md#both-software-and-language-evolution):
+  It would inhibit evolution of Carbon libraries, and the Carbon language, by
+  preventing them from changing how they respond to incorrect code.
+- [Practical safety guarantees and testing mechanisms](/docs/project/goals.md#practical-safety-guarantees-and-testing-mechanisms):
+  Similarly, it would prevent Carbon users from choosing different
+  performance/safety tradeoffs for handling programming errors: if an
+  out-of-bounds array access is required to throw an exception, users can't
+  disable bounds checks, regardless of their risk tolerance, because code might
+  rely on those exceptions being thrown.
+
+#### Examples
+
+If Carbon supports contract checking or other forms of assertions, it will not
+permit callers to detect and handle assertion failures, even as an optional
+build mode. Assertion failures will only be presented in ways that don't alter
+the program state, such as logging, terminating the program, or trapping into a
+debugger.
+
-debugger.
+debugger.
+
+Dereferencing a dangling or null pointer will not be reported as a
+recoverable error. Doing so would impose significant performance
+overhead. It also wouldn't be useful; the original bug that resulted
+in a bad pointer could have been anywhere, so the only reliable way
+to recover from this situation is to  discard the entire address space
+and terminate the program.
-debugger.
+debugger.
+
+Dereferencing a dangling or null pointer will not be reported as a
+recoverable error. Doing so would impose significant performance
+overhead. It also wouldn't be useful; the original bug that resulted
+in a bad pointer could have been anywhere, so the only reliable way
+to recover from this situation is to  discard the entire address space
+and terminate the program.
+### Memory exhaustion is not a recoverable error
+
+The Carbon standard library's common-case APIs will not go out of their way to
+support treating memory exhaustion as a recoverable error.
+
+Memory exhaustion is not a programming error, and it is feasible to write code
+that can successfully recover from it. However, the available evidence indicates
+that very little C++ code actually does so correctly (for example, see section
+4.3 of
+[this paper](http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2019/p0709r4.pdf)),
+which suggests that very little C++ code actually needs to do so, and we see no
+reason to expect Carbon's users to differ in this respect.
+
+Supporting recovery from memory exhaustion would impose many of the same harms
+as supporting recovery from programming errors, and for the same basic reason:
+memory allocation is pervasive, and so a mechanism for recovering from it would
+have to be similarly pervasive. Furthermore, experience with C++ has shown that
+attempting to support memory exhaustion can seriously deform the design of an
+API.
+
+#### Examples
+
+The `pop` operation on a Carbon queue will return the value removed from the
+queue. This is in contrast to C++'s `std::queue::pop()`, which does not return
+the value popped from the queue, because
+[that would not be exception-safe](https://isocpp.org/blog/2016/06/quick-q-why-doesnt-stdqueuepop-return-value)
+due to the possibility of an out-of-memory error while copying that value.
+Instead, the user must first examine the front of the queue, and then pop it as
+a separate operation. Not only is this awkward for users, it means that
+concurrent queues cannot match the API of non-concurrent queues, because
+separate `front()` and `pop()` calls would create a race condition.
+
+#### Caveats
+
+Carbon will probably provide a low-level way to allocate heap memory that makes
+allocation failure recoverable, because doing so appears to have few drawbacks.
+However, users may need to build their own libraries on top of it, rather that
+relying on the Carbon standard library, if they want to take advantage of it.
+There probably will not be a way to recover from _stack_ exhaustion, because
+there is no known way of doing that without major drawbacks, and users who can't
+tolerate crashing due to stack overflow can normally prevent it using static
+analysis.
+
+### Recoverable errors are explicit in function declarations
+
+Carbon functions that can emit recoverable errors will always be explicitly
+marked in all function declarations, either as part of the return type or as a
+separate property of the function.
+
+The possibility of emitting recoverable errors is nearly as fundamental to a
+function's API as its return type, and so Carbon APIs will be substantially
+clearer to read, and safer to use, if we require consistent, compiler-checked
+documentation of that property. Furthermore, as noted above, the mechanisms for
+emitting a recoverable error always impose some performance overhead, so the
+compiler must be able to distinguish the functions that need that overhead from
+the ones that do not.
+
+The default should be that functions do not emit errors, because that's the
+simpler and more efficient behavior, and we also expect it to be the common
+case.
+
+### Recoverable errors are explicit at the callsite
+
+Operations that can emit recoverable errors will always be explicitly marked at
+the point of use.
+
+If errors can propagate silently, as with exceptions in most languages,
+functions that they propagate through will have control flow paths that are not
+visible to the reader. It is extremely difficult to reason about procedural code
+when you aren't aware of all control flow paths, so this approach makes code
+harder to understand, maintain, and debug, especially in large cases where
+readers may not be familiar with the code above and below them in the call
+stack.
+
+Conversely, if errors can be silently ignored, as with error return codes in
+many languages, it creates a major risk of accidentally resuming normal
+execution without actually recovering from the error (that is, without
+discarding invalidated state). This, too, would make it extremely difficult to
+reason correctly about Carbon code.
+
+Either possibility would also allow code to evolve in unsafe ways. Changing a
+function to allow it to emit errors is semantically a breaking change: client
+code must now contend with a previously-impossible failure case. Requiring
+errors to be marked at the callsite ensures that this breakage manifests at
+build time.
+
+### Error propagation must be straightforward
+
+Carbon will provide a means to propagate recoverable errors from any function
+call to the caller of the enclosing function, with minimal textual overhead.
+
+In our experience, it is very common for C++ code to propagate errors across
+multiple layers of the call stack. C++ exceptions support this natively, and
+programmers in environments without exceptions usually develop a lightweight way
+to propagate errors explicitly, typically by using a macro containing a
+conditional `return`. In some cases they even resort to using nonstandard
+language extensions in order to be able to use this operation within
+expressions, rather than only at the statement level.
+
+Given the ubiquity of this use case, Carbon must provide support for it that can
+be used with minimal changes the structure of the code, and without making the
+non-error-case logic less clear.
+
+### No universal error categories
+
+Carbon will not establish an error hierarchy or other reusable error vocabulary,
+and will not prioritize use cases that involve classifying and reacting to the
+properties of a propagated error.
+
+Some languages attempt to impose a hierarchy or some other global classification
+scheme for errors, in order to allow code to respond differently to different
+kinds of errors, even after the errors have propagated some distance from the
+function that originally raised them. However, this practice tends to be quite
+brittle, because it almost inevitably requires relying on implementation
+details: if a function's contract gives different meanings to different errors
+it emits, it generally can't satisfy that contract by blindly propagating errors
+from the functions it calls. Conversely, if it doesn't have such a contract, its
+callers normally can't differentiate among the errors it emits without depending
+on its implementation details.
+
+It may make sense to distinguish certain categories of errors, if any layer of
+the stack can in principle respond to those errors, and the appropriate response
+requires only local knowledge. For example, any layer of the stack can respond
+to an out-of-memory error by releasing any unused caches. Similarly, any layer
+of the stack can respond to thread cancellation by ceasing any new computational
+work and propagating the signal _even if_ it could otherwise continue despite a
+failure at that point.
+
+However, such cases are caught between the horns of a dilemma: any error that's
+universal enough to be meaningful across arbitrary levels of the call stack is
+likely to be too pervasive for explicitly-marked propagation to be tolerable.
+Both of the above examples have that problem; we've already ruled out
+propagating out-of-memory errors because of their pervasiveness, and
+cancellation is likely to pose similar challenges, although cancellation can be
+ignored, which may simplify the problem somewhat.
+
+It is certainly possible to structure a codebase so that you can reliably
+propagate errors across multiple layers of the stack so long as you control
+those layers, and Carbon will support those use cases. However, it will do so as
+a byproduct of general-purpose programming facilities such as pattern matching;
+Carbon will not provide a separate sugar syntax for pattern-matching error
+metadata, especially if that syntax can encompass multiple potentially-failing
+operations. For example, if Carbon supports `try`/`catch` statements, they will
+always have a single `catch` block, which will be invoked for any error that
+escapes the `try` block.
-operations. For example, if Carbon supports `try`/`catch` statements, they will
-always have a single `catch` block, which will be invoked for any error that
-escapes the `try` block.
+operations. For example, if Carbon supports `try`/`catch` statements, the
+`catch` statements should not invent a new mechanism for dispatching on the
+kind of the exception.
-operations. For example, if Carbon supports `try`/`catch` statements, they will
-always have a single `catch` block, which will be invoked for any error that
-escapes the `try` block.
+operations. For example, if Carbon supports `try`/`catch` statements, the
+`catch` statements should not invent a new mechanism for dispatching on the
+kind of the exception.
+
+## Other resources
+
+Several other groups of language designers have arrived at similar principles.
+For example, see Swift's
+[error handling rationale](https://github.com/apple/swift/blob/master/docs/ErrorHandlingRationale.rst),
+[Joe Duffy's account](http://joeduffyblog.com/2016/02/07/the-error-model) of
+Midori's error model, and Herb Sutter's
+[pending proposal](http://wg21.link/P0709) for a new approach to exceptions in
+C++.
diff --git a/proposals/p0084.md b/proposals/p0084.md
@@ -0,0 +1,29 @@
+# Principles: Error handling
+
+<!--
+Part of the Carbon Language project, under the Apache License v2.0 with LLVM
+Exceptions. See /LICENSE for license information.
+SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
+-->
+
+[Pull request](https://github.com/carbon-language/carbon-lang/pull/84)
+
+## Table of contents
+
+<!-- toc -->
+
+- [Problem](#problem)
+- [Proposal](#proposal)
+
+<!-- tocstop -->
+
+## Problem
+
+Error-handling is a pervasive aspect of language and library design, and Carbon
+will need a consistent approach to it.
+
+## Proposal
+
+Introduce a set of
+[principles for error handling](docs/project/principles/error_handling.md). See
+that document for details.