Can't get the real contents form page in internet as the tag "no-chche"

Diez B. Roggisch deets at
Thu Mar 23 11:50:54 CET 2006

dongdong wrote:

> using web browser can get page's content formally, but when use
> the result is
> <META http-equiv="Pragma"
> content="no-cache"></HEAD><body>?y?ú'ò?aò3??...</body></html>
> ,I think the reson is the no-cache, are there person would help me?

No, the reason is the <META HTTP-EQUIV=REFRESH

that redirects you to the real site. Extract that url from the page and
request that. Or maybe you can use webunit, which acts more like a "real"
http-client with interpreting such content.


More information about the Python-list mailing list