[Python-Dev] Regular expressions: splitting on zero-width patterns
MRAB
python at mrabarnett.plus.com
Tue Nov 28 18:39:09 EST 2017
On 2017-11-28 22:27, Guido van Rossum wrote:
> On Tue, Nov 28, 2017 at 2:23 PM, MRAB <python at mrabarnett.plus.com
> <mailto:python at mrabarnett.plus.com>> wrote:
>
> On 2017-11-28 20:04, Serhiy Storchaka wrote:
>
> The two largest problems in the re module are splitting on
> zero-width
> patterns and complete and correct support of the Unicode
> standard. These
> problems are solved in regex. regex has many other features,
> but they
> are less important.
>
> I want to tell the problem of splitting on zero-width patterns. It
> already was discussed on Python-Dev 13 years ago [3] and maybe
> later.
> See also issues: [4], [5], [6], [7], [8].
>
> [snip]
> After some thought, I've decided that if this happens in the re
> module in Python 3.7, then, for the sake of compatibility (and
> because the edge cases are debatable anyway), I'll have the regex
> module do the same when used on Python 3.7.
>
>
> Maybe it should also be selectable with a version flag?
>
Well, when anyone who uses re updates to Python 3.7, they'll be faced
with the change anyway.
More information about the Python-Dev
mailing list