28 Nov
2017
28 Nov
'17
5:23 p.m.
On 2017-11-28 20:04, Serhiy Storchaka wrote:
The two largest problems in the re module are splitting on zero-width patterns and complete and correct support of the Unicode standard. These problems are solved in regex. regex has many other features, but they are less important.
I want to tell the problem of splitting on zero-width patterns. It already was discussed on Python-Dev 13 years ago [3] and maybe later. See also issues: [4], [5], [6], [7], [8].
[snip] After some thought, I've decided that if this happens in the re module in Python 3.7, then, for the sake of compatibility (and because the edge cases are debatable anyway), I'll have the regex module do the same when used on Python 3.7.