beautiful soup get class info
John Gordon
gordon at panix.com
Thu Mar 6 17:28:06 EST 2014
In <ae5b837c-501d-498e-bd3a-3b2c709c42b0 at googlegroups.com> teddybubu at gmail.com writes:
> > soup.find_all(name="span", class="date")
> I have python 2.7.2 and it does not like class in the code you provided.
Oh right, 'class' is a reserved word. I imagine beautifulsoup has
a workaround for that.
> Now when I take out [ class="date"], this is returned:
> [<span class="date">March 5, 2014</span>, <span class="date">March 5, 2014</span>]
>
> This is the code I am using: "data = soup.find_all(name="span")
> print (data)"
> 1. it returns today's date instead of the actual date
> 2. returns it twice
Are there two occurrences of '<span class="date">March 5, 2014</span>'
in the HTML? If so, then beautifulsoup is doing its job correctly.
It might help if you posted the sample HTML data you're working with.
--
John Gordon Imagine what it must be like for a real medical doctor to
gordon at panix.com watch 'House', or a real serial killer to watch 'Dexter'.
More information about the Python-list
mailing list