C Gillespie wrote:
> I have hopefully a very simple problem. I wish to parse an html page and
> extract everything between the <body> tags.
People are actually suggesting using DOM for this?! A simple approach is
much better:
def get_body(html):
body_start = html.find('<body')
body_end = html.find('</body>', body_start) + 7
return html[body_start:body_end]