[Python-Dev] bytes / unicode

Glyph Lefkowitz glyph at twistedmatrix.com
Wed Jun 23 03:01:17 CEST 2010


On Jun 22, 2010, at 8:57 PM, Robert Collins wrote:

> bzr has a cache of decoded strings in it precisely because decode is
> slow. We accept slowness encoding to the users locale because thats
> typically much less data to examine than we've examined while
> generating the commit/diff/whatever. We also face memory pressure on a
> regular basis, and that has been, at least partly, due to UCS4 - our
> translation cache helps there because we have less duplicate UCS4
> strings.

Thanks for setting the record straight - apologies if I missed this earlier in the thread.  It does seem vaguely familiar.

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/python-dev/attachments/20100622/889be45e/attachment-0001.html>


More information about the Python-Dev mailing list