[Tutor] python module to search a website
Corey Richardson
kb1pkl at aim.com
Sun Feb 27 05:05:22 CET 2011
On 02/26/2011 10:11 PM, vineeth wrote:
> Hello all,
>
> I am looking forward for a python module to search a website and extract
> the url.
What website, what is it searching for, and what URL is it looking for?
>
> For example I found a module for Amazon with the name "amazonproduct",
> the api does the job of extracting the data based on the query it even
> parses the url data. I am looking some more similar query search python
> module for other websites like Amazon.
The only module I found for amazon-product was a python interface to
Amazon's advertising API. What data does it extract, what query, and
which URL does it parse? From what I found that module uses the API to
search the website, a service provided by Amazon and not something
Python is doing itself.
You may want to look into urlparse and urllib2, for parsing URLs and
opening websites respectively.
http://docs.python.org/library/urlparse.html
http://docs.python.org/library/urllib2.html
If that isn't what you're looking for, you'll need to be a bit more
descriptive.
If you are going to be parsing the HTML and then searching for specific
elements you might look into BeautifulSoup.
--
Corey Richardson
More information about the Tutor
mailing list