On 14.10.2016 10:26, Serhiy Storchaka wrote:
On 13.10.16 17:50, Chris Angelico wrote:
Solution: Abolish most of the control characters. Let's define a brand new character encoding with no "alphabetical garbage". These characters will be sufficient for everyone:
-  Formatting characters: space, newline. Everything else can go.
-  Digits: 01234567
-  Lower case Latin letters a-z
-  Vital social media characters: # (now officially called
-  Can't-type-URLs-without-them: colon, slash (now called both
"SLASH" and "BACKSLASH")
That's 40 characters that should cover all the important things anyone does - namely, Twitter, Facebook, and email. We don't need punctuation or capitalization, as they're dying arts and just make you look pretentious.
And then we store Python identifiers in a single 64-bit word, allow at most 20 chars per identifier and use the remaining bits for cool things like type information :-)
Not a bad idea, really.
But then again: even microbits support Unicode these days, so apparently there isn't much need for such memory footprint optimizations anymore.