-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use v
regexp flag instead of u
#178
Comments
The “set operations” would be useful. I‘m pretty sure the second never will be, though, because ... — the illusion that non-ASCII is matchable is limited to literal input: it converts non-ASCII input to percent encoded UTF-8, but regexp pattern components aren’t likewise “translated”, so expressing things like |
This CL follows the recent spec update in [1]. After this CL, the regular expression works with the unicodeSet mode, which allows the API to interpret set notations, multi-codepoint properties etc. Actually, this CL ends up only adding the support for set notations. At the moment the API doesn't accept unicode character class escape (\p{}, \P{}) [2], thus we wouldn't add the multi-codepoint match functionarity. There are some incompatibility between "u" and "v", some patterns are privously valid but now errors. However, from UMA the impact looks very limited, it's only around 0.3% of the total constructor calls. So this CL changes the default flag to "v". Also add a kill switch. [1] whatwg/urlpattern#178 [2] https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Regular_expressions/Unicode_character_class_escape Change-Id: I14cc3420d57cca44c0c25867d05802a8a666cd8c Bug: 1482263
This CL follows the recent spec update in [1]. After this CL, the regular expression works with the unicodeSet mode, which allows the API to interpret set notations, multi-codepoint properties etc. Actually, this CL ends up only adding the support for set notations. At the moment the API doesn't accept unicode character class escape (\p{}, \P{}) [2], thus we wouldn't add the multi-codepoint match functionarity. There are some incompatibility between "u" and "v", some patterns are privously valid but now errors. However, from UMA the impact looks very limited, it's only around 0.3% of the total constructor calls. So this CL changes the default flag to "v". Also add a kill switch. [1] whatwg/urlpattern#178 [2] https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Regular_expressions/Unicode_character_class_escape Change-Id: I14cc3420d57cca44c0c25867d05802a8a666cd8c Bug: 1482263 Reviewed-on: https://chromium-review.googlesource.com/c/chromium/src/+/4861342 Reviewed-by: Jeremy Roman <jbroman@chromium.org> Commit-Queue: Shunya Shishido <sisidovski@chromium.org> Cr-Commit-Position: refs/heads/main@{#1221823}
This CL follows the recent spec update in [1]. After this CL, the regular expression works with the unicodeSet mode, which allows the API to interpret set notations, multi-codepoint properties etc. Actually, this CL ends up only adding the support for set notations. At the moment the API doesn't accept unicode character class escape (\p{}, \P{}) [2], thus we wouldn't add the multi-codepoint match functionarity. There are some incompatibility between "u" and "v", some patterns are privously valid but now errors. However, from UMA the impact looks very limited, it's only around 0.3% of the total constructor calls. So this CL changes the default flag to "v". Also add a kill switch. [1] whatwg/urlpattern#178 [2] https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Regular_expressions/Unicode_character_class_escape Change-Id: I14cc3420d57cca44c0c25867d05802a8a666cd8c Bug: 1482263 Reviewed-on: https://chromium-review.googlesource.com/c/chromium/src/+/4861342 Reviewed-by: Jeremy Roman <jbroman@chromium.org> Commit-Queue: Shunya Shishido <sisidovski@chromium.org> Cr-Commit-Position: refs/heads/main@{#1221823}
…attern, a=testonly Automatic update from web-platform-tests Use kUnicodeSets (v regexp flag) in URLPattern This CL follows the recent spec update in [1]. After this CL, the regular expression works with the unicodeSet mode, which allows the API to interpret set notations, multi-codepoint properties etc. Actually, this CL ends up only adding the support for set notations. At the moment the API doesn't accept unicode character class escape (\p{}, \P{}) [2], thus we wouldn't add the multi-codepoint match functionarity. There are some incompatibility between "u" and "v", some patterns are privously valid but now errors. However, from UMA the impact looks very limited, it's only around 0.3% of the total constructor calls. So this CL changes the default flag to "v". Also add a kill switch. [1] whatwg/urlpattern#178 [2] https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Regular_expressions/Unicode_character_class_escape Change-Id: I14cc3420d57cca44c0c25867d05802a8a666cd8c Bug: 1482263 Reviewed-on: https://chromium-review.googlesource.com/c/chromium/src/+/4861342 Reviewed-by: Jeremy Roman <jbroman@chromium.org> Commit-Queue: Shunya Shishido <sisidovski@chromium.org> Cr-Commit-Position: refs/heads/main@{#1221823} -- wpt-commits: 3ce3e9794fcd97ff24506f5c5325f91fc00ef79c wpt-pr: 43014
…attern, a=testonly Automatic update from web-platform-tests Use kUnicodeSets (v regexp flag) in URLPattern This CL follows the recent spec update in [1]. After this CL, the regular expression works with the unicodeSet mode, which allows the API to interpret set notations, multi-codepoint properties etc. Actually, this CL ends up only adding the support for set notations. At the moment the API doesn't accept unicode character class escape (\p{}, \P{}) [2], thus we wouldn't add the multi-codepoint match functionarity. There are some incompatibility between "u" and "v", some patterns are privously valid but now errors. However, from UMA the impact looks very limited, it's only around 0.3% of the total constructor calls. So this CL changes the default flag to "v". Also add a kill switch. [1] whatwg/urlpattern#178 [2] https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Regular_expressions/Unicode_character_class_escape Change-Id: I14cc3420d57cca44c0c25867d05802a8a666cd8c Bug: 1482263 Reviewed-on: https://chromium-review.googlesource.com/c/chromium/src/+/4861342 Reviewed-by: Jeremy Roman <jbroman@chromium.org> Commit-Queue: Shunya Shishido <sisidovski@chromium.org> Cr-Commit-Position: refs/heads/main@{#1221823} -- wpt-commits: 3ce3e9794fcd97ff24506f5c5325f91fc00ef79c wpt-pr: 43014
…attern, a=testonly Automatic update from web-platform-tests Use kUnicodeSets (v regexp flag) in URLPattern This CL follows the recent spec update in [1]. After this CL, the regular expression works with the unicodeSet mode, which allows the API to interpret set notations, multi-codepoint properties etc. Actually, this CL ends up only adding the support for set notations. At the moment the API doesn't accept unicode character class escape (\p{}, \P{}) [2], thus we wouldn't add the multi-codepoint match functionarity. There are some incompatibility between "u" and "v", some patterns are privously valid but now errors. However, from UMA the impact looks very limited, it's only around 0.3% of the total constructor calls. So this CL changes the default flag to "v". Also add a kill switch. [1] whatwg/urlpattern#178 [2] https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Regular_expressions/Unicode_character_class_escape Change-Id: I14cc3420d57cca44c0c25867d05802a8a666cd8c Bug: 1482263 Reviewed-on: https://chromium-review.googlesource.com/c/chromium/src/+/4861342 Reviewed-by: Jeremy Roman <jbroman@chromium.org> Commit-Queue: Shunya Shishido <sisidovski@chromium.org> Cr-Commit-Position: refs/heads/main@{#1221823} -- wpt-commits: 3ce3e9794fcd97ff24506f5c5325f91fc00ef79c wpt-pr: 43014
…attern, a=testonly Automatic update from web-platform-tests Use kUnicodeSets (v regexp flag) in URLPattern This CL follows the recent spec update in [1]. After this CL, the regular expression works with the unicodeSet mode, which allows the API to interpret set notations, multi-codepoint properties etc. Actually, this CL ends up only adding the support for set notations. At the moment the API doesn't accept unicode character class escape (\p{}, \P{}) [2], thus we wouldn't add the multi-codepoint match functionarity. There are some incompatibility between "u" and "v", some patterns are privously valid but now errors. However, from UMA the impact looks very limited, it's only around 0.3% of the total constructor calls. So this CL changes the default flag to "v". Also add a kill switch. [1] whatwg/urlpattern#178 [2] https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Regular_expressions/Unicode_character_class_escape Change-Id: I14cc3420d57cca44c0c25867d05802a8a666cd8c Bug: 1482263 Reviewed-on: https://chromium-review.googlesource.com/c/chromium/src/+/4861342 Reviewed-by: Jeremy Roman <jbroman@chromium.org> Commit-Queue: Shunya Shishido <sisidovski@chromium.org> Cr-Commit-Position: refs/heads/main@{#1221823} -- wpt-commits: 3ce3e9794fcd97ff24506f5c5325f91fc00ef79c wpt-pr: 43014
The
v
flags add more support to regular expression features:[[a-z]--[dhk]]
to express "the seta-z
, excluding the setdhk
(https://github.com/tc39/proposal-regexp-v-flag)\p
, such as\p{RGI_Emoji_ZWJ_Sequence}
(https://github.com/tc39/proposal-regexp-unicode-sequence-properties)The HTML
<input>
'spattern
attribute has also been recently updated to usev
instead ofu
(https://html.spec.whatwg.org/#compiled-pattern-regular-expression).The text was updated successfully, but these errors were encountered: