[Tutor] UTF-8 filenames encountered in os.walk
Kent Johnson
kent37 at tds.net
Thu Jul 5 03:05:43 CEST 2007
Terry Carroll wrote:
> I'm just saying that UTF-8 encodes ascii characters to themselves; but
> UTF-8 is not the same as ascii.
>
> I think we're ultimately saying the same thing; to merge both our ways of
> putting it, I think, is that ascii will map to UTF-8 identically; but
> UTF-8 may map back or it will raise UnicodeDecodeError.
>
> I just didn't want to leave the impression "Yeah, UTF-8 & ascii, they're
> the same thing."
I hope neither of us gave that impression! I think you are right, we
just have different ways of thinking about it. Any ascii string is also
a valid utf-8 string (and latin-1, and many other encodings), but the
opposite is not true.
Kent
More information about the Tutor
mailing list