[Python-ideas] Adding 'bytes' as alias for 'latin_1' codec.
Terry Reedy
tjreedy at udel.edu
Thu Jun 2 08:30:28 CEST 2011
On 6/2/2011 1:37 AM, Nick Coghlan wrote:
> On Thu, Jun 2, 2011 at 3:58 AM, Ethan Furman<ethan at stoneleaf.us> wrote:
>> A byte stream with multiple encodings? Now *that* seems wrong!
>
> Unicode encodings are just one serialisation format specific to text
> data. bytes objects may contain *any* serialisation format (e.g. zip
> archives, Python pickles, Python marshal files, packed binary data,
> innumerable wire protocols both standard and proprietary).
One result of this thread is that I see much better the value of
separating the ancient human level concepts of character and text from
the (3) decades old computer concept of byte. Numbers, lists, and dicts
are other old human concepts. As Nick implies above, bytes (or bits
within them) are used to encode all data for computer processing.
The confusion of character with byte in the original design of Python
both privileged and burdened text processing.
--
Terry Jan Reedy
More information about the Python-ideas
mailing list