Generic web parser
nytrokiss at gmail.com
Sun May 17 05:51:55 EDT 2009
I don't see the issue of using urllib and Sqllite for everything you mention
On Sat, May 16, 2009 at 4:18 PM, S.Selvam <s.selvamsiva at gmail.com> wrote:
> Hi all,
> I have to design web parser which will visit the given list of websites and
> need to fetch a particular set of details.
> It has to be so generic that even if we add new websites, it must fetch
> those details if available anywhere.
> So it must be something like a framework.
> Though i have done some parsers ,but they will parse for a given
> format(For. eg It will get the data from <title> tag).But here each website
> may have different format and the information may available within any tags.
> I know its a tough task for me,but i feel with python it should be
> My request is, if such thing is already available please let me know ,also
> your suggestions are welcome.
> Note: I planned to use BeautifulSoup for parsing.
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the Python-list