HTMLParser.HTMLParseError: EOF in middle of construct
none
"sergio\" at (none)
Tue Jun 19 10:27:00 EDT 2007
Gabriel Genellina wrote:
> En Mon, 18 Jun 2007 16:38:18 -0300, Sergio Monteiro Basto
> <sergio at sergiomb.no-ip.org> escribió:
>
>> Can someone explain me, what is wrong with this site ?
>>
>> python linkExtractor3.py http://www.noticiasdeaveiro.pt > test
>>
>> HTMLParser.HTMLParseError: EOF in middle of construct, at line 1173,
>> column 1
>>
>> at line 1173 of test file is perfectly normal .
>
> That page is not valid HTML - http://validator.w3.org/ finds 726 errors
> in it.
ok but my problem is not understand what is the specific problem at line
1173
> HTMLParser expects valid HTML - try a different tool, like
> BeautifulSoup, which is specially designed to handle malformed pages.
>
> --Gabriel Genellina
>
More information about the Python-list
mailing list