issues with htmlparser.getpos
Steve Holden
steve at holdenweb.com
Wed Jul 4 08:34:32 EDT 2007
dysmas wrote:
> Hi,
>
>
> Im having an issue with HTMLParser, the getpos() funtion sometimes
> returns things like :
>
> (1, 1247)
> (1, 2114)
> (1, 2168)
> (1, 2228)
> (1, 2295)
> (1, 2382)
> (1, 2441)
> (1, 2963)
> (1, 3040)
>
> i guess this is because the HTMLParser has not correctly parsed the
> newline characters in the string fed to it... is there a workaround
> for this, without checking the string every time i feed it some data?
>
Have you verified that these results aren't correct? There is no
requirements for newlines in HTML, and some computer-generated pages
don't bother to insert them.
regards
Steve
--
Steve Holden +1 571 484 6266 +1 800 494 3119
Holden Web LLC/Ltd http://www.holdenweb.com
Skype: holdenweb http://del.icio.us/steve.holden
--------------- Asciimercial ------------------
Get on the web: Blog, lens and tag the Internet
Many services currently offer free registration
----------- Thank You for Reading -------------
More information about the Python-list
mailing list