newbie problem: use socke lib to retrieve one web page:
Cameron Laird
claird at starbase.neosoft.com
Thu Sep 5 20:24:26 EDT 2002
In article <mailman.1031240588.2234.python-list at python.org>,
Erik Price <erikprice at mac.com> wrote:
>
>On Wednesday, September 4, 2002, at 11:53 PM, koko wrote:
>
>> I write this to retrieve one web page using socket lib.
.
.
.
>Works for me:
>
> >>> import socket
> >>> host = 'www.uic.edu'
> >>> port = 80
> >>> s = socket.socket(socket.AF_INET, socket.SOCK_STREAM)
> >>> s.connect((host, port))
> >>> header = """HEAD /home/events.shtml HTTP/1.0
>... From: hh at uic.edu
>... User-Agent: test/1.0
>...
>... """
> >>> s.send(header)
>72
> >>> data = s.recv(4096)
> >>> print data
>HTTP/1.1 200 OK
>Date: Thu, 05 Sep 2002 15:40:00 GMT
>Server: Apache/1.3.26 (Unix) PHP/4.1.2 mod_perl/1.27 mod_ssl/2.8.10
>OpenSSL/0.9.6
>Connection: close
>Content-Type: text/html
>
>
> >>> s.close()
>
>
>I used the HEAD method instead of GET for brevity. But GET works too.
.
.
.
To criticize code that's already giving satisfaction
is nearly beyond me. I'll point out, though, that
the original questioner might consider this alterna-
tive which has, I believe, evident advantages for
long-term maintenance:
from urllib import urlopen
URL = "http://www.uic.edu"
page = urlopen(URL).read()
print page
Note that urllib is one of the batteries the standard
Python distribution includes.
My summary: use higher-order facilities when applicable.
--
Cameron Laird <Cameron at Lairds.com>
Business: http://www.Phaseit.net
Personal: http://starbase.neosoft.com/~claird/home.html
More information about the Python-list
mailing list