Actions
Bug #21503
open\p{Word} does not match on \p{Join_Control} while docs say it does
Description
in the docs it is mentioned that \p{Word}
matches the equivalent of: [\p{M}\p{Nd}\p{Pc}\p{Alpha}\p{Join_Control}]
as it's also defined in the unicode spec
the issue is that it does not seem to be the case
irb(main):018> REGEX = /\p{Word}/u
=> /\p{Word}/
irb(main):019> "\u200D".gsub(REGEX, "-")
=> ""
irb(main):020> REGEX2 = /\p{Join_Control}/u
=> /\p{Join_Control}/
irb(main):021> "\u200D".gsub(REGEX2, "-")
=> "-"
There's 2 solutions here, either we change the docs or the code.
Updated by procmarco (Marco Concetto Rudilosso) about 22 hours ago
What I mean is that the current implementation of \p{Word}
does not seem to match \p{Join_Control}
even though it should and it also says so in the docs
Updated by mame (Yusuke Endoh) about 14 hours ago
- Related to Bug #19417: Regexp \p{Word} and [[:word:]] do not match Unicode Other_Number character added
Updated by mame (Yusuke Endoh) about 14 hours ago
There is already a PR for that: https://github.com/ruby/ruby/pull/7711
Can you take a look? @duerst (Martin Dürst) @naruse (Yui NARUSE)
Actions
Like0
Like0Like0Like0