how can i use lxml with win32com?
elca
highcar at gmail.com
Sun Oct 25 03:06:42 EDT 2009
Hello,
yes there is some reason why i nave to insist internet explorere interface.
because of javascript im trying to insist use PAMIE.
i was tried some other solution urlopen or mechanize and so on.
but it hard to use javascript.
can you show me some sample for me ? :)
such like if i want to extract some text in CNN website with 'CNN Shop'
'Site map' in bottom of CNN website page by use PAMIE.
thanks for your help.
motoom wrote:
>
>
> On 25 Oct 2009, at 07:45 , elca wrote:
>
>> i want to make web scraper.
>> if possible i really want to make it work together with
>> beautifulsoup or
>> lxml with PAMIE.
>
> Scraping information from webpages falls apart in two tasks:
>
> 1. Getting the HTML data
> 2. Extracting information from the HTML data
>
> It looks like you want to use Internet Explorer for getting the HTML
> data; is there any reason you can't use a simpler approach like using
> urllib2.urlopen()?
>
> Once you have the HTML data, you could feed it into BeautifulSoup or
> lxml.
>
> Mixing up 1 and 2 into a single statement created some confusion for
> you, I think.
>
> Greetings,
> --
> http://mail.python.org/mailman/listinfo/python-list
>
>
--
View this message in context: http://www.nabble.com/how-can-i-use-lxml-with-win32com--tp26044339p26045673.html
Sent from the Python - python-list mailing list archive at Nabble.com.
More information about the Python-list
mailing list