[XML-SIG] Possible bug in sgmlop (charcter entityrefs in attributes)

Uche Ogbuji uche.ogbuji@fourthought.com
Fri, 27 Apr 2001 17:57:34 -0600


Seems that sgmlop.c does not handle character entities refs in attributes.  If 
the following HTML is parsed using 4DOM's HtmlLib reader, which uses 
Sgmlop.py, which uses sgmlop.c from PyXML, the handle_entiryref call-back is 
not being invoked for the "Ä" in the attribute, although it is working 
for the one in regular CDATA.

<HTML>
  <HEAD>
    <TITLE>Character entities in attributes</TITLE>
  </HEAD>
  <BODY>
    <FORM>
      &Auml;<INPUT VALUE='&Auml;RGERLICH!' NAME='murx' TYPE='text'>
    </FORM>
  </BODY>
</HTML>

I don't know enough about sgmlop to easily sort this out myself.

Any ideas?

Thanks


-- 
Uche Ogbuji                               Principal Consultant
uche.ogbuji@fourthought.com               +1 303 583 9900 x 101
Fourthought, Inc.                         http://Fourthought.com 
4735 East Walnut St, Ste. C, Boulder, CO 80301-2537, USA
Software-engineering, knowledge-management, XML, CORBA, Linux, Python