How to read webpage
Marcus Wanner
marcusw at cox.net
Sat Aug 1 11:43:15 EDT 2009
On 8/1/2009 11:31 AM, Jon Clements wrote:
> On 1 Aug, 14:52, MRAB <pyt... at mrabarnett.plus.com> wrote:
>> tarun wrote:
>>> Dear All,
>>> I want to read a webpage and copy the contents of it in word file. I
>>> tried to write following code:
>>> import urllib2
>>> urllib2.urlopen("http://www.rediff.com/")
>>> *Error:-*
>>> urllib2.urlopen("http://www.icicibank.com/")
>>> File "C:\Python25\lib\urllib2.py", line 121, in urlopen
>>> return _opener.open(url, data)
>>> File "C:\Python25\lib\urllib2.py", line 374, in open
>>> response = self._open(req, data)
>>> File "C:\Python25\lib\urllib2.py", line 392, in _open
>>> '_open', req)
>>> File "C:\Python25\lib\urllib2.py", line 353, in _call_chain
>>> result = func(*args)
>>> File "C:\Python25\lib\urllib2.py", line 1100, in http_open
>>> return self.do_open(httplib.HTTPConnection, req)
>>> File "C:\Python25\lib\urllib2.py", line 1075, in do_open
>>> raise URLError(err)
>>> urllib2.URLError: <urlopen error (11001, 'getaddrinfo failed')>
>> I've just tried it. I didn't get an exception, so your problem must be
>> elsewhere.
>
> I'm hoping this adds to MRAB's reply; it is intended however for the
> OP.
>
> Jeeze -- been a while since I've had to deal with Sockets (directly
> anyway).
> If memory serves correctly, it's where the system can't name resolve
> the required address.
> So best guess is it's either a temporary glitch, or an issue with your
> routing.
>
> Jon.
> Jon.
'getaddrinfo failed' means that the nameserver can't be found, or that
it has no records of that address (I'm 90% sure of that).
Marcus
More information about the Python-list
mailing list