how to get 20000 html pages content quickly from one server?

Zachery Bir zbir at
Wed Mar 15 18:53:27 CET 2006

On Mar 15, 2006, at 11:58 AM, JuHui wrote:

> in fact, I want to do a script to get news on others site.
> I must use script get the content and analyze the html code, where is
> the title, where is the body....
> so, I can't ask permission, use wget  and "Physically remove the
> harddrive and reinstall it locally"

The only one it looks like you *can't* do is physically remove the  
hard drive and reinstall it locally. Seems more like you *won't* do  
the other two.


More information about the Python-list mailing list