extract news article from web

Zhang Le sigu4wa02 at sneakemail.com
Wed Dec 22 23:11:38 CET 2004


Thanks for the hint. The xml-rpc service is great, but I want some
general techniques to parse news information in the usual html pages.

Currently I'm looking at a script-based approach found at:
http://www.namo.com/products/handstory/manual/hsceditor/
User can write some simple template to extract certain fields from a
web page. Unfortunately, it is not open source, so I can not look
inside the blackbox.:-(

Zhang Le




More information about the Python-list mailing list