[XML-SIG] [ pyxml-Bugs-603325 ] ns_parse creates extra nested folders

noreply@sourceforge.net noreply@sourceforge.net
Sun, 01 Sep 2002 20:14:00 -0700


Bugs item #603325, was opened at 2002-09-01 20:14
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=106473&aid=603325&group_id=6473

Category: XBEL
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Nobody/Anonymous (nobody)
Assigned to: Nobody/Anonymous (nobody)
Summary: ns_parse creates extra nested folders

Initial Comment:
ns_parse creates undesired additional nested folders
when a folder title contains & (or any other
character or entity reference).  The problem is that
characters() does not take into account that the
character data may arrive in several chunks, and
creates a new folder for each chunk.

This problem is best seen using the sgmllib driver,
becasue the current sgmlop driver (the usual default)
does not report the  entity references, although it
does create multiple chunks when they occur.

Here is a short NS bookmarks file on which ns_parse
demonstrates the problem.  You will see that a folder
that should be titled "B&B" becomes three nested
folders, titled "B", "&", and "B" -

<META HTTP-EQUIV="Content-Type" CONTENT="text/html;
charset=UTF-8">
<TITLE>Bookmarks</TITLE>
<H1>Bookmarks</H1>

<DL><p>
    <DT><H3 ADD_DATE="1027911711"
LAST_MODIFIED="1027912953"
ID="NC:BookmarksRoot#$8bfa2554">Travel</H3>
    <DL><p>
        <DT><H3 ADD_DATE="1027911720"
LAST_MODIFIED="1027911747"
ID="NC:BookmarksRoot#$8bfa2555">B&amp;B</H3>
        <DL><p>
            <DT><A
HREF="http://home.gci.net/~burl/pci/"
ADD_DATE="1027911731" LAST_VISIT="1027911379"
LAST_CHARSET="ISO-8859-1">Peters Creek Inn The Bed and
Breakfast of Distinction</A>
        </DL><p>
	</DL><p>
</DL><p>


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=106473&aid=603325&group_id=6473