Is there a HTML parser who can reconstruct the original html EXACTLY?
kliu
ioscas at gmail.com
Wed Jan 23 08:20:32 EST 2008
On Jan 23, 7:39 pm, "A.T.Hofkamp" <h... at se-162.se.wtb.tue.nl> wrote:
> On 2008-01-23, ios... at gmail.com <ios... at gmail.com> wrote:
>
> > Hi, I am looking for a HTML parser who can parse a given page into
> > a DOM tree, and can reconstruct the exact original html sources.
>
> Why not keep a copy of the original data instead?
>
> That would be VERY MUCH SIMPLER than trying to reconstruct a parsed tree back
> to original source text.
>
> sincerely,
> Albert
Thank u for your reply. but what I really need is the mapping between
each DOM nodes and
the corresponding original source segment.
More information about the Python-list
mailing list