[Chicago] web page content scraper

Carl Karsten carl at personnelware.com
Tue Apr 8 15:47:36 CEST 2008


In the last year a parser was demoed that would first analyze examples of what 
was to be parsed, and create a template so that the static parts could be 
ignored.  The example was scraping wikipedia, and being able to get the title 
and content identified without having to wade thought the html.

Now I cant find it.  Anyone know what it was called?

Carl K





More information about the Chicago mailing list