[Python-Dev] Regular expressions: splitting on zero-width patterns
MRAB
python at mrabarnett.plus.com
Tue Nov 28 17:23:04 EST 2017
On 2017-11-28 20:04, Serhiy Storchaka wrote:
> The two largest problems in the re module are splitting on zero-width
> patterns and complete and correct support of the Unicode standard. These
> problems are solved in regex. regex has many other features, but they
> are less important.
>
> I want to tell the problem of splitting on zero-width patterns. It
> already was discussed on Python-Dev 13 years ago [3] and maybe later.
> See also issues: [4], [5], [6], [7], [8].
>
[snip]
After some thought, I've decided that if this happens in the re module
in Python 3.7, then, for the sake of compatibility (and because the edge
cases are debatable anyway), I'll have the regex module do the same when
used on Python 3.7.
More information about the Python-Dev
mailing list