[RELEASED] Python 3.1 final
Hallvard B Furuseth
h.b.furuseth at usit.uio.no
Sun Jun 28 17:34:10 EDT 2009
Benjamin Peterson writes:
>Nobody <nobody <at> nowhere.com> writes:
>> On Sun, 28 Jun 2009 19:21:49 +0000, Benjamin Peterson wrote:
>> 1. Does Python offer any assistance in doing so, or do you have to
>> manually convert the surrogates which are generated for unrecognised bytes?
>
> fs_encoding = sys.getfilesystemencoding()
> bytes_argv = [arg.encode(fs_encoding, "surrogateescape") for arg in sys.argv]
>
>> 2. How do you do this for non-invertible encodings (e.g. ISO-2022)?
>
> What's a non-invertible encoding? I can't find a reference to the term.
Different ISO-2022 strings can map to the same Unicode string.
Thus you can convert back to _some_ ISO-2022 string, but it won't
necessarily match the original.
--
Hallvard
More information about the Python-list
mailing list