htmldata 1.0.4 - Manipulate HTML documents via data structure
connellybarnes at yahoo.com
Sat Dec 11 07:14:22 CET 2004
htmldata 1.0.4 is available.
The htmldata module allows one to translate HTML
documents back and forth to list data structures.
This allows for programmatic reading and writing
of HTML documents, with much flexibility.
Functions are also available for extracting
and/or modifying all URLs present in the HTML
or stylesheets of a document.
Version 1.0.4 is a bugfix release offering:
* Python 2.0-2.4 support (thanks to Paul Clinch
for the Python 2.2 patch)
* Properly working XHTML parsing.
* Miscellaneous other fixes (see the changelog
I have found this library useful for writing
robots, for "wrapping" all of the URLs on
websites inside my own proxy CGI script, for
filtering HTML, and for doing flexible wget-like
It keeps things as simple as possible, so it
should be easy to learn.
- Connelly Barnes
Do you Yahoo!?
Send holiday email and support a worthy cause. Do good.
More information about the Python-list