exhaustive mapping from html entities to unicode ?
Irmen de Jong
irmen at -NOSPAM-REMOVETHIS-xs4all.nl
Fri Mar 7 08:00:04 EST 2003
shagshag13 wrote:
> i'm looking for something which would contain :
> .... mapping['€'] : u'\u20ac', mapping['€'] : u'\u20ac' ....
You certainly don't want to have every possible numeric character entity,
such as € in your mapping. Because it is a *numeric* entity you
can translate it directly. 8364 is 0x20ac in hex.
About the others, I don't know.
Irmen
More information about the Python-list
mailing list