<div dir="ltr"><div class="gmail_default" style="font-family:verdana,sans-serif"><br></div><div class="gmail_extra"><br><div class="gmail_quote">On 28 October 2017 at 16:48, Steven D'Aprano <span dir="ltr"><<a href="mailto:steve@pearwood.info" target="_blank" class="gmail-cremed gmail-cremed gmail-cremed cremed">steve@pearwood.info</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><span class="gmail-">On Sun, Oct 29, 2017 at 12:31:01AM +0100, MRAB wrote:<br>
<br>
> Not that I'm planning on making any further additions, just bug fixes<br>
> and updates to follow the Unicode updates. I think I've crammed enough<br>
> into it already. There's only so much you can do with the regex syntax<br>
> with its handful of metacharacters and possible escape sequences...<br>
<br>
</span>What do you think of the Perl 6 regex syntax?<br>
<br>
<a href="https://en.wikipedia.org/wiki/Perl_6_rules#Changes_from_Perl_5" rel="noreferrer" target="_blank" class="gmail-cremed gmail-cremed gmail-cremed cremed">https://en.wikipedia.org/wiki/<wbr>Perl_6_rules#Changes_from_<wbr>Perl_5</a></blockquote><div><br></div><div><div class="gmail_default" style="font-family:verdana,sans-serif">​If you're going to change the notation, why not use notations similar to what linguists use for FSTs? These allow building FSTs (with operations such as adding/subtracting/composing/projecting FSTs) with millions of states — and there are some impressive optimisers for them also, so that encoding a dictionary with inflections is both more compact and faster than a hash of just the words without inflections. Some of this work is open source, but I haven't kept up with it.</div><div class="gmail_default" style="font-family:verdana,sans-serif"><br></div><div class="gmail_default" style="font-family:verdana,sans-serif">If you're interested, you can start here:</div><div class="gmail_default" style="font-family:verdana,sans-serif"><a href="http://web.stanford.edu/~laurik/">http://web.stanford.edu/~laurik/</a>​</div><div class="gmail_default"><font face="verdana, sans-serif"><a href="http://web.stanford.edu/~laurik/publications/TR-2010-01.pdf">http://web.stanford.edu/~laurik/publications/TR-2010-01.pdf</a></font><br></div><div class="gmail_default"><font face="verdana, sans-serif"><a href="http://web.stanford.edu/group/cslipublications/cslipublications/site/1575864347.shtml">http://web.stanford.edu/group/cslipublications/cslipublications/site/1575864347.shtml</a><br></font></div><div class="gmail_default"><font face="verdana, sans-serif">etc.</font></div><div class="gmail_default"><font face="verdana, sans-serif"><br></font></div><div class="gmail_default"><font face="verdana, sans-serif">;)</font></div><div class="gmail_default"><font face="verdana, sans-serif"><br></font></div><br></div><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><br>
<span class="gmail-HOEnZb"><font color="#888888"><br>
<br>
<br>
--<br>
Steve<br>
</font></span><div class="gmail-HOEnZb"><div class="gmail-h5">______________________________<wbr>_________________<br>
Python-Dev mailing list<br>
<a href="mailto:Python-Dev@python.org" class="gmail-cremed gmail-cremed gmail-cremed cremed">Python-Dev@python.org</a><br>
<a href="https://mail.python.org/mailman/listinfo/python-dev" rel="noreferrer" target="_blank" class="gmail-cremed gmail-cremed gmail-cremed cremed">https://mail.python.org/<wbr>mailman/listinfo/python-dev</a><br>
Unsubscribe: <a href="https://mail.python.org/mailman/options/python-dev/pludemann%40google.com" rel="noreferrer" target="_blank" class="gmail-cremed gmail-cremed gmail-cremed cremed">https://mail.python.org/<wbr>mailman/options/python-dev/<wbr>pludemann%40google.com</a><br>
</div></div></blockquote></div><br></div></div>