HTMLParser.HTMLParseError: EOF in middle of construct

none "sergio\" at (none)
Tue Jun 19 16:27:00 CEST 2007


Gabriel Genellina wrote:
> En Mon, 18 Jun 2007 16:38:18 -0300, Sergio Monteiro Basto 
> <sergio at sergiomb.no-ip.org> escribió:
> 
>> Can someone explain me, what is wrong with this site ?
>>
>> python linkExtractor3.py http://www.noticiasdeaveiro.pt > test
>>
>> HTMLParser.HTMLParseError: EOF in middle of construct, at line 1173,
>> column 1
>>
>> at line 1173 of test file is perfectly normal .
> 
> That page is not valid HTML - http://validator.w3.org/ finds 726 errors 
> in it.

ok but my problem is not understand what is the specific problem at line 
1173

> HTMLParser expects valid HTML - try a different tool, like 
> BeautifulSoup, which is specially designed to handle malformed pages.
> 
> --Gabriel Genellina
> 



More information about the Python-list mailing list