mr.spoon21 at gmail.com
Wed Nov 19 18:23:16 CET 2008
On Wed, Nov 19, 2008 at 3:41 PM, Philip Semanchuk <philip at semanchuk.com> wrote:
> I'm not sure why you'd need to host the Python code anywhere other than your
> home computer. If you wanted to pull thousands of pages from a site like
> that, you'd need to respect their robots.txt file. Don't forget to look for
> a crawl-delay specification. Even if they don't specify one, you shouldn't
> let your bot hammer their servers at full speed -- give it a delay, let it
> run in the background, it might take you three days versus an hour to
> collect the data you need but that's not too big of deal in the service of
> good manners, is it?
Mmm, I didn't really mean the possibility to just host the code, but
to run. I mean, like server side code, so that my programs keep
running and updating, in my case, the RSS feed, without the need for
me to be online and run it.
More information about the Python-list