[Python-3000] String comparison

Jim Jewett jimjjewett at gmail.com
Thu Jun 14 19:56:20 CEST 2007


On 6/14/07, Stephen J. Turnbull <stephen at xemacs.org> wrote:
>  > There are also plenty of things that a native speaker may view as a
>  > single character, but which unicode treats as (at most) a Named
>  > Sequence.

> Eg, the New Line Function (Unicode's name for "universal newline"),
> which can be any of the usual suspects (CR, LF, CRLF) depending on
> context.

I hadn't even thought of such abstract chracters; I was thinking of
(Normative Appendix) UAX 34: Unicode Named Character Sequences at
http://unicode.org/reports/tr34/

These are more like õ, or the NJ digraph, except that a
single-character equivalent has not been coded (and probably never
will be coded -- see
http://www.unicode.org/faq/ligature_digraph.html#3).

The current list of named sequences is available at
http://www.unicode.org/Public/UNIDATA/NamedSequences.txt

-jJ


More information about the Python-3000 mailing list