From 8b45c9c4a97c0492b14b8a3692a66eff99ba58e7 Mon Sep 17 00:00:00 2001 From: Stefan Karpinski Date: Thu, 10 Jan 2019 11:53:14 -0500 Subject: [PATCH] string docs: clarify handling of overlong and too-high sequences --- doc/src/manual/strings.md | 5 +++-- 1 file changed, 3 insertions(+), 2 deletions(-) diff --git a/doc/src/manual/strings.md b/doc/src/manual/strings.md index d0f3a38016b80..60eea9cfa772b 100644 --- a/doc/src/manual/strings.md +++ b/doc/src/manual/strings.md @@ -384,8 +384,9 @@ a sequence of code units from left to right characters are formed by the longest * `10xxxxxx`; * `11111xxx`. -In particular this implies that overlong and too high code unit sequences are accepted. -This rule is best explained by an example: +In particular this means that overlong and too-high code unit sequences and prefixes thereof are treated +as a single invalid character rather than multiple invalid characters. +This rule may be best explained with an example: ```julia-repl julia> s = "\xc0\xa0\xe2\x88\xe2|"