html processing

John J. Lee jjl at pobox.com
Sat Sep 25 09:27:19 EDT 2004


Ismael Herrera <ismaelherreragasser at hotmail.com> writes:
[...]
> So the htmllib is a regex search with empty methods? i can do that by
> myself, no need use a parser, i am a litle confused in this.

It's a little more complicated than that.  But feel free to rewrite
sgmllib.py if you so please.


> Is there a python html parser that creates objects from html sources,like
> xml.dom?  

Yes, many.

Which one is best for you depends on what it is you're trying to do.


John



More information about the Python-list mailing list