[Python-checkins] r79315 - in python/branches/py3k: Doc/howto/unicode.rst Doc/library/unicodedata.rst Doc/whatsnew/2.7.rst Modules/unicodedata.c
ezio.melotti
python-checkins at python.org
Tue Mar 23 00:16:42 CET 2010
Author: ezio.melotti
Date: Tue Mar 23 00:16:42 2010
New Revision: 79315
Log:
Merged revisions 79314 via svnmerge from
svn+ssh://pythondev@svn.python.org/python/trunk
........
r79314 | ezio.melotti | 2010-03-23 01:07:32 +0200 (Tue, 23 Mar 2010) | 1 line
Update the version number of the Unicode Database in a few more places.
........
Modified:
python/branches/py3k/ (props changed)
python/branches/py3k/Doc/howto/unicode.rst
python/branches/py3k/Doc/library/unicodedata.rst
python/branches/py3k/Doc/whatsnew/2.7.rst
python/branches/py3k/Modules/unicodedata.c
Modified: python/branches/py3k/Doc/howto/unicode.rst
==============================================================================
--- python/branches/py3k/Doc/howto/unicode.rst (original)
+++ python/branches/py3k/Doc/howto/unicode.rst Tue Mar 23 00:16:42 2010
@@ -403,7 +403,7 @@
from the above output, ``'Ll'`` means 'Letter, lowercase', ``'No'`` means
"Number, other", ``'Mn'`` is "Mark, nonspacing", and ``'So'`` is "Symbol,
other". See
-<http://unicode.org/Public/5.1.0/ucd/UCD.html#General_Category_Values> for a
+<http://www.unicode.org/reports/tr44/#General_Category_Values> for a
list of category codes.
References
Modified: python/branches/py3k/Doc/library/unicodedata.rst
==============================================================================
--- python/branches/py3k/Doc/library/unicodedata.rst (original)
+++ python/branches/py3k/Doc/library/unicodedata.rst Tue Mar 23 00:16:42 2010
@@ -15,12 +15,12 @@
This module provides access to the Unicode Character Database which defines
character properties for all Unicode characters. The data in this database is
-based on the :file:`UnicodeData.txt` file version 5.1.0 which is publicly
+based on the :file:`UnicodeData.txt` file version 5.2.0 which is publicly
available from ftp://ftp.unicode.org/.
The module uses the same names and symbols as defined by the UnicodeData File
-Format 5.1.0 (see http://www.unicode.org/Public/5.1.0/ucd/UCD.html). It defines
-the following functions:
+Format 5.2.0 (see http://www.unicode.org/reports/tr44/). It defines the
+following functions:
.. function:: lookup(name)
Modified: python/branches/py3k/Doc/whatsnew/2.7.rst
==============================================================================
--- python/branches/py3k/Doc/whatsnew/2.7.rst (original)
+++ python/branches/py3k/Doc/whatsnew/2.7.rst Tue Mar 23 00:16:42 2010
@@ -933,11 +933,13 @@
a timeout was provided and the operation timed out.
(Contributed by Tim Lesher; :issue:`1674032`.)
-* The Unicode database provided by the :mod:`unicodedata` module
- remains at version 5.1.0, but Python now uses it internally to
- determine which characters are numeric, whitespace, or represent
- line breaks. The database also now includes information from the
- :file:`Unihan.txt` data file. (Patch by Anders Chrigström
+* The Unicode database has been updated to the version 5.2.0.
+ (Updated by Florent Xicluna; :issue:`8024`.)
+
+* The Unicode database provided by the :mod:`unicodedata` is used
+ internally to determine which characters are numeric, whitespace,
+ or represent line breaks. The database also now includes information
+ from the :file:`Unihan.txt` data file. (Patch by Anders Chrigström
and Amaury Forgeot d'Arc; :issue:`1571184`.)
* The :class:`UserDict` class is now a new-style class. (Changed by
Modified: python/branches/py3k/Modules/unicodedata.c
==============================================================================
--- python/branches/py3k/Modules/unicodedata.c (original)
+++ python/branches/py3k/Modules/unicodedata.c Tue Mar 23 00:16:42 2010
@@ -1,8 +1,8 @@
/* ------------------------------------------------------------------------
- unicodedata -- Provides access to the Unicode 5.1 data base.
+ unicodedata -- Provides access to the Unicode 5.2 data base.
- Data was extracted from the Unicode 5.1 UnicodeData.txt file.
+ Data was extracted from the Unicode 5.2 UnicodeData.txt file.
Written by Marc-Andre Lemburg (mal at lemburg.com).
Modified for Python 2.0 by Fredrik Lundh (fredrik at pythonware.com)
@@ -1235,11 +1235,10 @@
"This module provides access to the Unicode Character Database which\n\
defines character properties for all Unicode characters. The data in\n\
this database is based on the UnicodeData.txt file version\n\
-5.1.0 which is publically available from ftp://ftp.unicode.org/.\n\
+5.2.0 which is publically available from ftp://ftp.unicode.org/.\n\
\n\
The module uses the same names and symbols as defined by the\n\
-UnicodeData File Format 5.1.0 (see\n\
-http://www.unicode.org/Public/5.1.0/ucd/UCD.html).");
+UnicodeData File Format 5.2.0 (see http://www.unicode.org/reports/tr44/).");
static struct PyModuleDef unicodedatamodule = {
More information about the Python-checkins
mailing list