How to read webpage

Marcus Wanner marcusw at cox.net
Sat Aug 1 11:43:15 EDT 2009


On 8/1/2009 11:31 AM, Jon Clements wrote:
> On 1 Aug, 14:52, MRAB <pyt... at mrabarnett.plus.com> wrote:
>> tarun wrote:
>>> Dear All,
>>> I want to read a webpage and copy the contents of it in word file. I
>>> tried to write following code:
>>> import urllib2
>>> urllib2.urlopen("http://www.rediff.com/")
>>> *Error:-*
>>>     urllib2.urlopen("http://www.icicibank.com/")
>>>   File "C:\Python25\lib\urllib2.py", line 121, in urlopen
>>>     return _opener.open(url, data)
>>>   File "C:\Python25\lib\urllib2.py", line 374, in open
>>>     response = self._open(req, data)
>>>   File "C:\Python25\lib\urllib2.py", line 392, in _open
>>>     '_open', req)
>>>   File "C:\Python25\lib\urllib2.py", line 353, in _call_chain
>>>     result = func(*args)
>>>   File "C:\Python25\lib\urllib2.py", line 1100, in http_open
>>>     return self.do_open(httplib.HTTPConnection, req)
>>>   File "C:\Python25\lib\urllib2.py", line 1075, in do_open
>>>     raise URLError(err)
>>> urllib2.URLError: <urlopen error (11001, 'getaddrinfo failed')>
>> I've just tried it. I didn't get an exception, so your problem must be
>> elsewhere.
> 
> I'm hoping this adds to MRAB's reply; it is intended however for the
> OP.
> 
> Jeeze -- been a while since I've had to deal with Sockets (directly
> anyway).
> If memory serves correctly, it's where the system can't name resolve
> the required address.
> So best guess is it's either a temporary glitch, or an issue with your
> routing.
> 
> Jon.
> Jon.

'getaddrinfo failed' means that the nameserver can't be found, or that 
it has no records of that address (I'm 90% sure of that).

Marcus



More information about the Python-list mailing list