[Tutor] How to read websites - Web Scraping or Parsing in python

Yashwin Kanchan yashwinkanchan at gmail.com
Wed Jun 13 14:24:52 CEST 2012


Hi Surya

Have you tried using IE automation (assuming you are using windows).

I used the library from http://www.mayukhbose.com/python/IEC/index.php

import IEC
ie = IEC.IEController()
ie.Navigate('http://knolzone.com/unlock-hidden-themes-in-windows-7-and-other-useful-tips-part-5-of-7/')
ie.GetDocumentText()

This gives the complete text of the webpage , you can then work on
cutting out the section that you actually require.

This isnt a complete solution to your problem , but it would bring you
a bit closer.

Regards
Yashwin Kanchan


More information about the Tutor mailing list