[I18n-sig] Character Model for the World Wide Web
M.-A. Lemburg
mal@lemburg.com
Thu, 20 Dec 2001 20:15:51 +0100
Misha.Wolf@reuters.com wrote:
>
> I'm very pleased to be able to announce the publication of a new Working
> Draft of the Character Model for the World Wide Web:
> http://www.w3.org/TR/charmod/
>
> An extract from the document follows:
>
> Abstract
>
> This Architectural Specification provides authors of specifications,
> software developers, and content developers with a common reference for
> interoperable text manipulation on the World Wide Web. Topics addressed
> include encoding identification, early uniform normalization, string
> identity matching, string indexing, and URI conventions, building on the
> Universal Character Set, defined jointly by Unicode and ISO/IEC 10646.
> Some introductory material on characters and character encodings is also
> provided.
Looks like we'll need Unicode normalization support in Python
soon in order to reach at least some compatibility with this
proposed standard.
--
Marc-Andre Lemburg
CEO eGenix.com Software GmbH
______________________________________________________________________
Company & Consulting: http://www.egenix.com/
Python Software: http://www.egenix.com/files/python/