[Python-Dev] PEP 383: Non-decodable Bytes in System Character Interfaces

Cameron Simpson cs at zip.com.au
Wed Apr 29 23:49:31 CEST 2009


On 29Apr2009 17:03, Terry Reedy <tjreedy at udel.edu> wrote:
> Thomas Breuel wrote:
>>     Sure. However, that requires you to provide meaningful, reproducible
>>     counter-examples, rather than a stenographic formulation that might
>>     hint some problem you apparently see (which I believe is just not
>>     there).
>>
>> Well, here's another one: PEP 383 would disallow UTF-8 encodings of 
>> half surrogates. 
>
> By my reading, the current Unicode 5.1 definition of 'UTF-8' disallows that.

5.0 also disallows it. No surprise I guess.
-- 
Cameron Simpson <cs at zip.com.au> DoD#743
http://www.cskk.ezoshosting.com/cs/

Out on the road, feeling the breeze, passing the cars.  - Bob Seger


More information about the Python-Dev mailing list