Generic web parser

James Matthews nytrokiss at gmail.com
Sun May 17 05:51:55 EDT 2009


I don't see the issue of using urllib and Sqllite for everything you mention
here.

On Sat, May 16, 2009 at 4:18 PM, S.Selvam <s.selvamsiva at gmail.com> wrote:

> Hi all,
>
> I have to design web parser which will visit the given list of websites and
> need to fetch a particular set of details.
> It has to be so generic that even if we add new websites, it must fetch
> those details if available anywhere.
> So it must be something like a framework.
>
> Though i have done some parsers ,but they will parse for a given
> format(For. eg It will get the data from <title> tag).But here each website
> may have different format and the information may available within any tags.
>
> I know its a tough task for me,but i feel with python it should be
> possible.
> My request is, if such thing is already available please let me know ,also
> your suggestions are welcome.
>
> Note: I planned to use BeautifulSoup for parsing.
>
> --
> Yours,
> S.Selvam
>
> --
> http://mail.python.org/mailman/listinfo/python-list
>
>


-- 
http://www.goldwatches.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/python-list/attachments/20090517/b1512f57/attachment.html>


More information about the Python-list mailing list