a simple unicode question

Chris Jones cjns1989 at gmail.com
Wed Oct 28 01:28:01 EDT 2009


On Tue, Oct 27, 2009 at 06:21:11AM EDT, Lie Ryan wrote:
> Chris Jones wrote:

[..]

>> Best part of Unicode is that there are multiple encodings, right? ;-)
>
> No, the best part about Unicode is there is no encoding!

> Unicode does not define any encoding; 

RFC 3629:

"ISO/IEC 10646 and Unicode define several encoding forms of their
common repertoire: UTF-8, UCS-2, UTF-16, UCS-4 and UTF-32."

> what it defines is code-points for  characters which is not related to
> how characters are encoded in files or network transmission.

In other words, Unicode is "not related to any encoding" .. and yet the
UTF-8, UTF-16.. "encoding forms" are clearly "related" to Unicode.

How is that possible?

CJ



More information about the Python-list mailing list