The Unicode code points in the U+DC00-DFFF range (low surrogate area) can't be encoded in UTF-8. Quoting from RFC 3629:

The definition of UTF-8 prohibits encoding character numbers between U+D800 and U+DFFF, which are reserved for use with the UTF-16 encoding form (as surrogate pairs) and do not directly represent characters.

It looks like this test was doing something specific with regards to this. So, I am curious as well about this change.



On Sat, Nov 3, 2012 at 10:13 AM, Antoine Pitrou <solipsis@pitrou.net> wrote:
On Sat,  3 Nov 2012 13:37:48 +0100 (CET)
andrew.svetlov <python-checkins@python.org> wrote:
> http://hg.python.org/cpython/rev/95d1adf144ee
> changeset:   80187:95d1adf144ee
> user:        Andrew Svetlov <andrew.svetlov@gmail.com>
> date:        Sat Nov 03 14:37:37 2012 +0200
> summary:
>   Issue #16218: skip test if filesystem doesn't support required encoding
>
> files:
>   Lib/test/test_cmd_line_script.py |  7 ++++++-
>   1 files changed, 6 insertions(+), 1 deletions(-)
>
>
> diff --git a/Lib/test/test_cmd_line_script.py b/Lib/test/test_cmd_line_script.py
> --- a/Lib/test/test_cmd_line_script.py
> +++ b/Lib/test/test_cmd_line_script.py
> @@ -366,7 +366,12 @@
>      def test_non_utf8(self):
>          # Issue #16218
>          with temp_dir() as script_dir:
> -            script_basename = '\udcf1\udcea\udcf0\udce8\udcef\udcf2'
> +            script_basename = '\u0441\u043a\u0440\u0438\u043f\u0442'

Why exactly did you change the tested name here?

Regards

Antoine.


_______________________________________________
Python-Dev mailing list
Python-Dev@python.org
http://mail.python.org/mailman/listinfo/python-dev
Unsubscribe: http://mail.python.org/mailman/options/python-dev/alexandre%40peadrop.com