[Python-3000] String comparison
Jim Jewett
jimjjewett at gmail.com
Thu Jun 14 19:56:20 CEST 2007
On 6/14/07, Stephen J. Turnbull <stephen at xemacs.org> wrote:
> > There are also plenty of things that a native speaker may view as a
> > single character, but which unicode treats as (at most) a Named
> > Sequence.
> Eg, the New Line Function (Unicode's name for "universal newline"),
> which can be any of the usual suspects (CR, LF, CRLF) depending on
> context.
I hadn't even thought of such abstract chracters; I was thinking of
(Normative Appendix) UAX 34: Unicode Named Character Sequences at
http://unicode.org/reports/tr34/
These are more like õ, or the NJ digraph, except that a
single-character equivalent has not been coded (and probably never
will be coded -- see
http://www.unicode.org/faq/ligature_digraph.html#3).
The current list of named sequences is available at
http://www.unicode.org/Public/UNIDATA/NamedSequences.txt
-jJ
More information about the Python-3000
mailing list