US 3-030 5.2p1 [lex.phases] new-lines after phase 1 #475

wg21bot · 2022-10-30T10:58:24Z

Translation phases 2 and 3 assume that lines are terminated by “new-line characters”. However, the current specification of phase 1 does not guarantee that to be true. In particular, for a UTF-8 file the verbatim sequence of source file characters forms the input for phase 2, even on systems where the line terminator is a carriage return. The non-UTF-8 specification is also defective in that it speaks of “introducing” new-line characters, even for encodings like Latin-1 where new-lines might already be present and no “introduction” is needed or appropriate.

Proposed change:

If an input file is determined to be a UTF-8 file, then it shall be a well-formed UTF-8 code unit sequence and it is decoded to produce a sequence of UCS scalar values that constitutes the sequence of elements of the translation character set, representing each line-termination character or character sequence as a new-line character.
For any other kind of input file supported by the implementation, characters are mapped, in an implementation-defined manner, to a sequence of translation character set elements (5.3) (~~introducing new-line characters for~~ representing end-of-line indicators as new-line characters).

jensmaurer · 2022-11-07T07:40:52Z

CWG2639

jensmaurer · 2022-11-08T23:26:15Z

CWG 2022-11-08: Accept with Modifications. See CWG2639 for details.

wg21bot changed the title ~~US 5.2p1 [lex.phases]~~ US 5.2p1 [lex.phases] new-lines after phase 1 Oct 30, 2022

wg21bot added the CWG Core label Oct 31, 2022

jensmaurer changed the title ~~US 5.2p1 [lex.phases] new-lines after phase 1~~ US 3-030 5.2p1 [lex.phases] new-lines after phase 1 Nov 3, 2022

jensmaurer transferred this issue from another repository Nov 3, 2022

jensmaurer added this to the CD C++23 milestone Nov 3, 2022

jensmaurer added the accepted label Nov 8, 2022

jensmaurer mentioned this issue Nov 19, 2022

[Motions 2022 11 cwg 4] Issues 2615, 2639, 2640, 2652, and 2653 from P2710R0 cplusplus/draft#5988

Merged

tkoeppe closed this as completed in cplusplus/draft#5988 Nov 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

US 3-030 5.2p1 [lex.phases] new-lines after phase 1 #475

US 3-030 5.2p1 [lex.phases] new-lines after phase 1 #475

wg21bot commented Oct 30, 2022

jensmaurer commented Nov 7, 2022

jensmaurer commented Nov 8, 2022

US 3-030 5.2p1 [lex.phases] new-lines after phase 1 #475

US 3-030 5.2p1 [lex.phases] new-lines after phase 1 #475

Comments

wg21bot commented Oct 30, 2022

jensmaurer commented Nov 7, 2022

jensmaurer commented Nov 8, 2022