Encode HTML CDATA name token

Gregor Horvath gh at gregor-horvath.com
Mon Sep 20 04:45:46 EDT 2010


Hi,

ID and NAME tokens must begin with a letter ([A-Za-z]) and may be
followed by any number of letters, digits ([0-9]), hyphens ("-"),
underscores ("_"), colons (":"), and periods ("."). [1]

Is there a encoder / decoder in Python that can convert arbitrary text
to and from this encoding in an readable manner?

base64 is not readable:

In [31]: base64.encodestring("/TEST")
Out[31]: 'L1RFU1Q=\n'

quopri does not encode for example "/".

In [25]: quopri.encodestring("/test")
Out[25]: '/test'

Anything else?

--
Greg

[1] http://www.w3.org/TR/html4/types.html#type-cdata



More information about the Python-list mailing list