[Tutor] how to use lxml and win32com?
elca
highcar at gmail.com
Mon Oct 26 23:02:56 CET 2009
Alan Gauld wrote:
>
>
> "elca" <highcar at gmail.com> wrote
>
>>>> i want to use IE.navigate function with beautifulsoup or lxml..
>>>> if anyone know about this or sample.
>>> Why do you want to use navigate()? What are you trying to do?
>>> There is likely to be another way to do it from Python.
>
>> so why i try to use PAMIE or IE
>> http://elca.pastebin.com/m52e7d8e0
>> i was attached current scraper script source.
>
> OK TherR are several problems in there.
> First, are you sure you want to define the function getit() inside a
> while loop? (I'm pretty sure you don't) And are you sure you want
> the function to recurse infinitely - see the last line (I'm pretty sure
> you
> don't)
> Next, do you really want page_check() to sleep for 13 seconds?
> (Once per letter in the url)
> -->
> all your words is correct. i don't need such like getit() and other
> function .
>> especially i want to change 'thepage = urllib.urlopen(theurl).read()' to
>> PAMIE method.
>
> And you still don't explain why you don;t want to use urlopen?
> What advantage does using PAMIE offer? I'd expect it to be slower
> and more memory hungry (since it uses IE under the covers).
> --> after make scraper i will add some other function that time i need to
> handle javascript,
> but as you already know urlopen method don't have such can handling
> javasript option,
> so why i want to use pamie, in addition i was tried other kind of method,
> such like Selenium,webdriver
> but not so much good for me, thanks for your help
> HTH,
>
>
> --
> Alan Gauld
> Author of the Learn to Program web site
> http://www.alan-g.me.uk/
>
>
> _______________________________________________
> Tutor maillist - Tutor at python.org
> To unsubscribe or change subscription options:
> http://mail.python.org/mailman/listinfo/tutor
>
>
--
View this message in context: http://www.nabble.com/how-to-use-lxml-and-win32com--tp26045028p26068522.html
Sent from the Python - tutor mailing list archive at Nabble.com.
More information about the Tutor
mailing list