regex for href substitution

Jp Calderone exarkun at intarweb.us
Tue Feb 18 18:20:19 EST 2003


On Tue, Feb 18, 2003 at 10:44:44PM +0000, Robin Becker wrote:
> I'm sure this must have been done before, but has anyone got a regex for
> extracting/changing html href attributes. I've done this before with
> htmllib, but it's been suggested that we can do this with re.
> 
> The application involves doing a special purpose forwarding proxy, so
> perhaps someone has already done something similar.

  Wow.

  Regular expressions are the wrong solution for manipulating sgml. 
htmllib is the right solution.  I'd stick with what I had, if I were you.

  Jp

--
http://catandgirl.com/view.cgi?44
-- 
 up 10 days, 4:28, 4 users, load average: 0.08, 0.07, 0.03
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 196 bytes
Desc: not available
URL: <http://mail.python.org/pipermail/python-list/attachments/20030218/03d84068/attachment.sig>


More information about the Python-list mailing list