Is there a HTML parser who can reconstruct the original html EXACTLY?

kliu ioscas at gmail.com
Wed Jan 23 08:20:32 EST 2008


On Jan 23, 7:39 pm, "A.T.Hofkamp" <h... at se-162.se.wtb.tue.nl> wrote:
> On 2008-01-23, ios... at gmail.com <ios... at gmail.com> wrote:
>
> >     Hi, I am looking for a HTML parser who can parse a given page into
> > a DOM tree,  and can reconstruct the exact original html sources.
>
> Why not keep a copy of the original data instead?
>
> That would be VERY MUCH SIMPLER than trying to reconstruct a parsed tree back
> to original source text.
>
> sincerely,
> Albert

Thank u for your reply. but what I really need is the mapping between
each DOM nodes and
the corresponding original source segment.



More information about the Python-list mailing list