Skip to content

Commit

Permalink
string docs: clarify handling of overlong and too-high sequences
Browse files Browse the repository at this point in the history
  • Loading branch information
StefanKarpinski committed Jan 10, 2019
1 parent 3b6773d commit 8b45c9c
Showing 1 changed file with 3 additions and 2 deletions.
5 changes: 3 additions & 2 deletions doc/src/manual/strings.md
Original file line number Diff line number Diff line change
Expand Up @@ -384,8 +384,9 @@ a sequence of code units from left to right characters are formed by the longest
* `10xxxxxx`;
* `11111xxx`.

In particular this implies that overlong and too high code unit sequences are accepted.
This rule is best explained by an example:
In particular this means that overlong and too-high code unit sequences and prefixes thereof are treated
as a single invalid character rather than multiple invalid characters.
This rule may be best explained with an example:

```julia-repl
julia> s = "\xc0\xa0\xe2\x88\xe2|"
Expand Down

0 comments on commit 8b45c9c

Please sign in to comment.