[Tutor] how to use lxml and win32com?

elca highcar at gmail.com
Mon Oct 26 23:02:56 CET 2009




Alan Gauld wrote:
> 
> 
> "elca" <highcar at gmail.com> wrote
> 
>>>> i want to use IE.navigate function with beautifulsoup or lxml..
>>>> if anyone know about this  or sample.
>>> Why do you want to use navigate()? What are you trying to do?
>>> There is likely to be another way to do it from Python.
> 
>> so why i try to use PAMIE or IE
>> http://elca.pastebin.com/m52e7d8e0
>> i was attached current scraper script source.
> 
> OK TherR are several problems in there.
> First, are you sure you want to define the function getit() inside a
> while loop? (I'm pretty sure you don't) And are you sure you want
> the function to recurse infinitely - see the last line (I'm pretty sure
> you
> don't)
> Next, do you really want page_check()  to sleep for 13 seconds?
> (Once per letter in the url)
> --> 
> all your words is correct.  i don't need such like getit() and other
> function .
>> especially  i want to change 'thepage = urllib.urlopen(theurl).read()' to
>> PAMIE method.
> 
> And you still don't explain why you don;t want to use urlopen?
> What advantage does using PAMIE offer? I'd expect it to be slower
> and more memory hungry (since it uses IE under the covers).
> --> after make scraper i will add some other function that time i need to
> handle javascript,
> but as you already know urlopen method don't have such can handling
> javasript option,
> so why i want to use pamie, in addition i was tried other kind of method,
> such like Selenium,webdriver
> but not so much good for me, thanks for your help 
> HTH,
> 
> 
> -- 
> Alan Gauld
> Author of the Learn to Program web site
> http://www.alan-g.me.uk/ 
> 
> 
> _______________________________________________
> Tutor maillist  -  Tutor at python.org
> To unsubscribe or change subscription options:
> http://mail.python.org/mailman/listinfo/tutor
> 
> 

-- 
View this message in context: http://www.nabble.com/how-to-use-lxml-and-win32com--tp26045028p26068522.html
Sent from the Python - tutor mailing list archive at Nabble.com.



More information about the Tutor mailing list