[XML-SIG] Re: Issues with Unicode type

Martin v. Loewis martin@v.loewis.de
26 Sep 2002 19:49:54 +0200


Eric van der Vlist <vdv@dyomedea.com> writes:

> As for the character classes/group/category/blocks, I was wondering if
> they couldn't be described and generated with chargen.py.

No; this doesn't parse the Unicode character database;
Tools/unicode/makeunicodedata.py parses the Unicode character
database.

Generating regexes for classes is straight-forward from
that. Generating regexes for blocks is not possible, since the
standard Unicode database file does not list the blocks; that's a
different file (AFAIK).

Regards,
Martin