exhaustive mapping from html entities to unicode ?

Irmen de Jong irmen at -NOSPAM-REMOVETHIS-xs4all.nl
Fri Mar 7 08:00:04 EST 2003


shagshag13 wrote:

> i'm looking for something which would contain :
> .... mapping['€'] : u'\u20ac', mapping['&#8364'] : u'\u20ac' ....

You certainly don't want to have every possible numeric character entity,
such as € in your mapping. Because it is a *numeric* entity you
can translate it directly.  8364 is 0x20ac in hex.

About the others, I don't know.

Irmen





More information about the Python-list mailing list