[XML-SIG] Re: Issues with Unicode type

Martin v. Loewis martin@v.loewis.de
26 Sep 2002 19:47:22 +0200


Daniel Veillard <veillard@redhat.com> writes:

>   Hum, I think you would need a rewrite anyway for full conformance, 
> the XML Schemas regexp have more complext constructs than standard regexps
> the quantifiers may be more rich (not 100% sure I didn't checked fully)
> and all the character classes/group/category/blocks are not part of
> "normal" regexps (well I never saw any such description in regexps help
> or man before, so I doubt it appeared magically in python).

You can get categories and blocks by mapping them onto "normal"
Unicode character classes.

For a specific version of the Unicode character database, this is a
fixed mapping.

Regards,
Martin