parse html rendered by js
alex at moreati.org.uk
Fri Feb 11 15:46:01 CET 2011
On Feb 11, 8:20 am, yanghq <yan... at neusoft.com> wrote:
> I wanna get attribute value like href,src... in html.
> for simple html page libxml2dom can help me parse it into dom, and
> get what I want;
> but for some pages rendered by js, like:
> '<frameset border="0" frameborder="no" rows="0,*,0" onLoad="start()"
> onUnload="end()" onResize="change()">'+
> '<frameset border="0" frameborder="no" cols="*,*,*,*,*,0">'+
> '<frame name="cfgFrame" noresize scrolling="no"
> src="../frame.html?rtfPossible=' + rtfPossibleString + '">'+
> how can I get the atrribute value of 'src', thank you for any help.
your duplicate code.
- Use or write a Python module that uses a web browser to download/
execute the page. I'm not aware of any that exist.
Neither option is very good, and that is one reason why such
testing a web application, screen scraping a web site)? There may be a
More information about the Python-list