[XML-SIG] Re: Issues with Unicode type
Martin v. Loewis
martin@v.loewis.de
26 Sep 2002 19:49:54 +0200
Eric van der Vlist <vdv@dyomedea.com> writes:
> As for the character classes/group/category/blocks, I was wondering if
> they couldn't be described and generated with chargen.py.
No; this doesn't parse the Unicode character database;
Tools/unicode/makeunicodedata.py parses the Unicode character
database.
Generating regexes for classes is straight-forward from
that. Generating regexes for blocks is not possible, since the
standard Unicode database file does not list the blocks; that's a
different file (AFAIK).
Regards,
Martin