<div><span style="line-height:19.200000762939453px;font-size:13px;font-family:sans-serif">The Unicode code points in the </span><a href="http://www.unicode.org/charts/PDF/UDC00.pdf" target="_blank">U+DC00-DFFF</a><span style="line-height:19.200000762939453px;font-size:13px;font-family:sans-serif"> range</span><span style="line-height:19.200000762939453px;font-size:13px;font-family:sans-serif"> (low surrogate area) can't be encoded in UTF-8. Quoting from </span><a href="http://tools.ietf.org/html/rfc3629" target="_blank">RFC 3629</a><span style="line-height:19.200000762939453px;font-size:13px;font-family:sans-serif">:</span></div>
<div><font color="#000000" face="sans-serif"><span style="line-height:19.200000762939453px"><br></span></font></div><blockquote style="margin:0px 0px 0px 40px;border:none;padding:0px"><div><i><font color="#000000" face="sans-serif"><span style="line-height:19.200000762939453px">The definition of UTF-8 prohibits encoding character numbers between U+D800 and U+DFFF, which are reserved for use with the UTF-16 encoding form (as surrogate pairs) and do not directly represent characters.</span></font></i></div>
</blockquote><div><br></div><div>It looks like this test was doing something specific with regards to this. So, I am curious as well about this change.</div><div><br></div>
<div class="gmail_extra"><br><br><div class="gmail_quote">On Sat, Nov 3, 2012 at 10:13 AM, Antoine Pitrou <span dir="ltr"><<a href="mailto:solipsis@pitrou.net" target="_blank">solipsis@pitrou.net</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div class="im">On Sat, 3 Nov 2012 13:37:48 +0100 (CET)<br>
andrew.svetlov <<a href="mailto:python-checkins@python.org">python-checkins@python.org</a>> wrote:<br>
> <a href="http://hg.python.org/cpython/rev/95d1adf144ee" target="_blank">http://hg.python.org/cpython/rev/95d1adf144ee</a><br>
> changeset: 80187:95d1adf144ee<br>
> user: Andrew Svetlov <<a href="mailto:andrew.svetlov@gmail.com">andrew.svetlov@gmail.com</a>><br>
> date: Sat Nov 03 14:37:37 2012 +0200<br>
> summary:<br>
> Issue #16218: skip test if filesystem doesn't support required encoding<br>
><br>
> files:<br>
> Lib/test/test_cmd_line_script.py | 7 ++++++-<br>
> 1 files changed, 6 insertions(+), 1 deletions(-)<br>
><br>
><br>
> diff --git a/Lib/test/test_cmd_line_script.py b/Lib/test/test_cmd_line_script.py<br>
> --- a/Lib/test/test_cmd_line_script.py<br>
> +++ b/Lib/test/test_cmd_line_script.py<br>
> @@ -366,7 +366,12 @@<br>
> def test_non_utf8(self):<br>
> # Issue #16218<br>
> with temp_dir() as script_dir:<br>
> - script_basename = '\udcf1\udcea\udcf0\udce8\udcef\udcf2'<br>
> + script_basename = '\u0441\u043a\u0440\u0438\u043f\u0442'<br>
<br>
</div>Why exactly did you change the tested name here?<br>
<br>
Regards<br>
<br>
Antoine.<br>
<br>
<br>
_______________________________________________<br>
Python-Dev mailing list<br>
<a href="mailto:Python-Dev@python.org">Python-Dev@python.org</a><br>
<a href="http://mail.python.org/mailman/listinfo/python-dev" target="_blank">http://mail.python.org/mailman/listinfo/python-dev</a><br>
Unsubscribe: <a href="http://mail.python.org/mailman/options/python-dev/alexandre%40peadrop.com" target="_blank">http://mail.python.org/mailman/options/python-dev/alexandre%40peadrop.com</a><br>
</blockquote></div><br></div>