[Tutor] htmllib vs re question
tvbare at socket.net
Fri Mar 10 00:18:46 CET 2006
-----BEGIN PGP SIGNED MESSAGE-----
I want to parse some text from an HTML file that contains
blocks of pre-formatted text. All I'm after is what's between
the <pre> and </pre> tags.
My first thought was to use re for this, but looking through
the Library Reference, I see the htmllib module. Is htmllib
overkill for this job?
The HTML file size varies, but I don't expect the size to exceed
150-200k. Speed is not a bug concern.
What is the Pythonic way and why?
Any recommendations or comments?
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.7 (GNU/Linux)
-----END PGP SIGNATURE-----
More information about the Tutor