On Sun, May 3, 2009 at 1:27 PM, "Martin v. Löwis"
> If the error handler is supposed to be used for codecs other than utf-8, > perhaps it should renamed something more generic, e.g. "surrogate-escape"?
Perhaps. However, utf-8b doesn't really have to do anything with
utf-8 -
it's an algorithm based on 16-bit or 32-bit code points.
To me that lack of relationship with utf8 suggests that it should not be called utf8b
Perhaps. However, giving it that name was Markus Kuhn's choice - and while it may be confusing, it's (IMO) useful to be consistent with this background.
Regards, Martin
Ah, right. My original searches for utf8b didn't turn up much but searching on his name turns some up. Good choice of name then. http://mail.nl.linux.org/linux-utf8/2000-07/msg00040.html http://bsittler.livejournal.com/10381.html http://hyperreal.org/~est/utf-8b/ -gps