[XML-SIG] Entity managment question --

Martin v. Loewis martin@v.loewis.de
05 May 2002 09:57:32 +0200


Dennis Allison <allison@sumeru.stanford.EDU> writes:

> The problem is recapturing the HTML-ish entities that have been converted
> to unicode.  Does such a beast exist?  And where can it be found?

Why is it desirable to restore those entities? I can offer a number of
alternatives:

- generate UTF-8 on output, then you will never ever need to create
  references
- generate character reference instead of entity references

Either approach will create well-formed HTML. If you think you must
have entity references, you can use the htmlentitydefs module to
generate *all* entity references.

Regards,
Martin