[Chicago] web page content scraper

Tom Printy tprinty at mail.edisonave.net
Tue Apr 8 16:25:59 CEST 2008


Wow this library is super cool. Anyone got slides or notes from the
talk?

On Tue, 2008-04-08 at 08:54 -0500, Lukasz Szybalski wrote:
> http://code.google.com/p/templatemaker/
> 
> On Tue, Apr 8, 2008 at 8:47 AM, Carl Karsten <carl at personnelware.com> wrote:
> > In the last year a parser was demoed that would first analyze examples of what
> >  was to be parsed, and create a template so that the static parts could be
> >  ignored.  The example was scraping wikipedia, and being able to get the title
> >  and content identified without having to wade thought the html.
> >
> >  Now I cant find it.  Anyone know what it was called?
> >
> >  Carl K
> >
> >
> >
> >  _______________________________________________
> >  Chicago mailing list
> >  Chicago at python.org
> >  http://mail.python.org/mailman/listinfo/chicago
> >
> 
> 
> 



More information about the Chicago mailing list