UTF-8 question from Dive into Python 3

Raymond Hettinger python at rcn.com
Tue Jan 18 13:14:49 EST 2011


On Jan 17, 2:19 pm, carlo <syseng... at gmail.com> wrote:
> Hi,
> recently I had to study *seriously* Unicode and encodings for one
> project in Python but I left with a couple of doubts arised after
> reading the unicode chapter of Dive into Python 3 book by Mark
> Pilgrim.
>
> 1- Mark says:
> "Also (and you’ll have to trust me on this, because I’m not going to
> show you the math), due to the exact nature of the bit twiddling,
> there are no byte-ordering issues. A document encoded in UTF-8 uses
> the exact same stream of bytes on any computer."
  . . .
> 2- If that were true, can you point me to some documentation about the
> math that, as Mark says, demonstrates this?

I believe Mark was referring to the bit-twiddling described in
the Design section at http://en.wikipedia.org/wiki/UTF-8 .

Raymond



More information about the Python-list mailing list