In message <4c51d3b6$0$1638$742ec2ed at news.sonic.net>, John Nagle wrote: > UTF-8 is a stream format for Unicode. It's slightly compressed ... “Variable-length” is not the same as “compressed”. Particularly if you’re mainly using non-Roman scripts...