how can i use lxml with win32com?

elca highcar at gmail.com
Sun Oct 25 03:06:42 EDT 2009


Hello,
yes there is some reason why i nave to insist internet explorere interface.
because of javascript im trying to insist use PAMIE.
i was tried some other solution urlopen or mechanize and so on.
but it hard to use javascript.
can you show me some sample for me ? :)
such like if i want to extract some text in CNN website with 'CNN Shop'
'Site map' in bottom of CNN website page by use PAMIE.
thanks for your help.



motoom wrote:
> 
> 
> On 25 Oct 2009, at 07:45 , elca wrote:
> 
>> i want to make web scraper.
>> if possible i really want to make it work together with  
>> beautifulsoup or
>> lxml with PAMIE.
> 
> Scraping information from webpages falls apart in two tasks:
> 
> 1. Getting the HTML data
> 2. Extracting information from the HTML data
> 
> It looks like you want to use Internet Explorer for getting the HTML  
> data; is there any reason you can't use a simpler approach like using  
> urllib2.urlopen()?
> 
> Once you have the HTML data, you could feed it into BeautifulSoup or  
> lxml.
> 
> Mixing up 1 and 2 into a single statement created some confusion for  
> you, I think.
> 
> Greetings,
> -- 
> http://mail.python.org/mailman/listinfo/python-list
> 
> 

-- 
View this message in context: http://www.nabble.com/how-can-i-use-lxml-with-win32com--tp26044339p26045673.html
Sent from the Python - python-list mailing list archive at Nabble.com.




More information about the Python-list mailing list