[Chicago] web page content scraper

Tim Gebhardt tim at gebhardtcomputing.com
Thu Apr 10 03:40:57 CEST 2008


On Wed, Apr 9, 2008 at 6:59 PM, Massimo Di Pierro <mdipierro at cs.depaul.edu>
wrote:

> Sorry typo
>
> http://mdp.cti.depaul.edu/examples/static/scraping.py
>
> _______________________________________________
> Chicago mailing list
> Chicago at python.org
> http://mail.python.org/mailman/listinfo/chicago
>

Not sure if it'll help you, but python comes with a difflib module built in
that you may be able to use for LCS, and it's implemented in C.

http://docs.python.org/lib/module-difflib.html

-Tim Gebhardt
tim at gebhardtcomputing.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mail.python.org/pipermail/chicago/attachments/20080409/3772c8eb/attachment.htm 


More information about the Chicago mailing list