internet searching program
greg at cosc.canterbury.ac.nz
Sun Aug 10 03:12:53 CEST 2008
Michael Tobis wrote:
> I think you are talking about "screen scraping".
> Your program can get the html for the page, and search for an
> appropriate pattern.
However, it wouldn't be "really fast", because you
still have to fetch all the pages that might contain
data you're looking for.
Google searches are fast because they've already
fetched all the web pages in the world and indexed
You might get somewhere using a program that does
a site-specific google search to find potentially
relevant pages, then goes and looks at those pages
for further information.
Another possibility might be to crawl the site and
build your own index based on the information you're
More information about the Python-list