[Tutor] converting encoded symbols from rss feed?

Serdar Tumgoren zstumgoren at gmail.com
Wed Jun 17 16:19:44 CEST 2009


> Upon searching for &#150 in google, I came up with this:
> http://www.siber-sonic.com/mac/charsetstuff/Soniccharset.html

The character table definitely helps. Thanks.

Some additional googling suggests that I need to unescape HTML
entities. I'm planning to try the below approach from Frederik Lundh.
It relies on the "re" and "htmlentitydefs" modules.

http://effbot.org/zone/re-sub.htm#unescape-html

I'll report back with my results. Meantime, I welcome any other suggestions.

Thanks!


More information about the Tutor mailing list