beautiful soup get class info
gordon at panix.com
Thu Mar 6 23:28:06 CET 2014
In <ae5b837c-501d-498e-bd3a-3b2c709c42b0 at googlegroups.com> teddybubu at gmail.com writes:
> > soup.find_all(name="span", class="date")
> I have python 2.7.2 and it does not like class in the code you provided.
Oh right, 'class' is a reserved word. I imagine beautifulsoup has
a workaround for that.
> Now when I take out [ class="date"], this is returned:
> [<span class="date">March 5, 2014</span>, <span class="date">March 5, 2014</span>]
> This is the code I am using: "data = soup.find_all(name="span")
> print (data)"
> 1. it returns today's date instead of the actual date
> 2. returns it twice
Are there two occurrences of '<span class="date">March 5, 2014</span>'
in the HTML? If so, then beautifulsoup is doing its job correctly.
It might help if you posted the sample HTML data you're working with.
John Gordon Imagine what it must be like for a real medical doctor to
gordon at panix.com watch 'House', or a real serial killer to watch 'Dexter'.
More information about the Python-list