[9.x] Replace raw invisible characters in regex expressions with counterpart Unicode regex notations #45680

restuff · 2023-01-17T10:14:07Z

It is always a good practice to avoid or at least minimize the usage of unescaped non-ascii characters in source code, especially if they are invisible characters.

#41949 introduced an addition to TrimStrings middleware's regex to filter out "Zero Width No-Break Space" (\x{FEFF}) characters, however it was implemented as a raw unescaped invisible character in the source code.

#44906 added one more invisible character "Zero Width Space" (\x{200B}) to the same regex.

However, these symbols aren't visible in GitHub diff

as well as in some text editors like Notepad++ (even with Show All Characters option turned on)

This PR adds more readability to the source code by replacing those invisible characters in both places where they are used with their counterpart Unicode regex notations without affecting the functionality.

\Illuminate\Foundation\Http\Middleware::transform():

\Illuminate\Support\Str::squish():

There's no need to modify corresponding tests that use "real-life" unescaped raw invisible characters as functionality hasn't changed, however there were added some spelling corrections.

…t Unicode regex notations

taylorotwell · 2023-01-18T15:07:35Z

Thanks

Replace raw invisible characters in regex expressions with counterpar…

5cfa997

…t Unicode regex notations

restuff changed the title ~~Replace raw invisible characters in regex expressions with counterpar…~~ [9.x] Replace raw invisible characters in regex expressions with counterpar… Jan 17, 2023

restuff changed the title ~~[9.x] Replace raw invisible characters in regex expressions with counterpar…~~ [9.x] Replace raw invisible characters in regex expressions with counterpart Unicode regex notations Jan 17, 2023

taylorotwell merged commit e4f05eb into laravel:9.x Jan 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[9.x] Replace raw invisible characters in regex expressions with counterpart Unicode regex notations #45680

[9.x] Replace raw invisible characters in regex expressions with counterpart Unicode regex notations #45680

restuff commented Jan 17, 2023 •

edited

Loading

taylorotwell commented Jan 18, 2023

[9.x] Replace raw invisible characters in regex expressions with counterpart Unicode regex notations #45680

[9.x] Replace raw invisible characters in regex expressions with counterpart Unicode regex notations #45680

Conversation

restuff commented Jan 17, 2023 • edited Loading

taylorotwell commented Jan 18, 2023

restuff commented Jan 17, 2023 •

edited

Loading