[BangPypers] How to compare the relevancy between news headlines?

Anand Balachandran Pillai abpillai at gmail.com
Wed Jun 15 09:12:28 CEST 2011


On Wed, Jun 15, 2011 at 8:25 AM, Gopalakrishnan Subramani <
gopalakrishnan.subramani at gmail.com> wrote:

> Thank you Anand.. OpenCalais seems more promising.. I will play around this
> weekend..
>

If you want any code to start with, email me. I can share what I wrote
with you. python-calais is pretty much full featured, so you shouldn't
have any problems running it.


>
> --
>
> Gopal
>
> On Tue, Jun 14, 2011 at 5:44 PM, Anand Balachandran Pillai <
> abpillai at gmail.com> wrote:
>
> > On Tue, Jun 14, 2011 at 5:16 PM, Gopalakrishnan Subramani <
> > gopalakrishnan.subramani at gmail.com> wrote:
> >
> > > Dear All,
> > >
> > > Thanks for very informative answers. I would follow NLT & also try out
> > > Patra's advice as well. Post you my updates.
> > >
> >
> > You can also do this by utilizing the entity extraction/categorization
> > provided by
> > online Linked-data/clustering providers. One such service is OpenCalais
> > provided by Thomson-Reuters.
> >
> > I had played around with Calais one year back and found it very useful
> > to learn about Linked-data and entity extraction.
> >
> > In fact, there is an already existing news clustering service written
> > on top of OpenCalais.
> >
> > http://www.opencalais.com/applications/interceder
> >
> > Klezio.com was also a very good example of this, but now it seems
> > to be down.
> >
> > In fact I cooked up an application to pull out the cost of mobile
> > phones using Opencalais and demoed it in a BangPypers meeting
> > some time back, here is an archive of it.
> >
> > http://web.archiveorange.com/archive/v/nwVbqtZ9jcT56TZEpxjw
> >
> > HTH.
> >
> > --Anand
> >
> >
> >
> > > --
> > >
> > > Gopal
> > >
> > > On Tue, Jun 14, 2011 at 4:55 PM, Gora Mohanty <gora at mimirtech.com>
> > wrote:
> > >
> > > > On Tue, Jun 14, 2011 at 1:28 PM, Gopalakrishnan Subramani
> > > > <gopalakrishnan.subramani at gmail.com> wrote:
> > > > > Thanks for suggestion. Can you give me a specific NLT
> > toolset/approach
> > > > with
> > > > > example if you have experience already?
> > > >
> > > > Try clustering with Python NLTK ( http://www.nltk.org/ ).
> > > > NLTK is well documented, and what you are looking for
> > > > will probably be available from nltk.cluster. The site has
> > > > a free, downloadable book , and extensive documentation.
> > > > See, e.g.,
> > > http://www.opendocs.net/nltk/0.9.5/api/nltk.cluster-module.html
> > > >
> > > > If you are willing to go away from Python, there are various
> > > > options, such as:
> > > > * Carrot 2: http://search.carrot2.org/stable/search
> > > > * Mahout: http://lucene.apache.org/mahout/
> > > >
> > > > Regards,
> > > > Gora
> > > > _______________________________________________
> > > > BangPypers mailing list
> > > > BangPypers at python.org
> > > > http://mail.python.org/mailman/listinfo/bangpypers
> > > >
> > > _______________________________________________
> > > BangPypers mailing list
> > > BangPypers at python.org
> > > http://mail.python.org/mailman/listinfo/bangpypers
> > >
> >
> >
> >
> > --
> > --Anand
> > _______________________________________________
> > BangPypers mailing list
> > BangPypers at python.org
> > http://mail.python.org/mailman/listinfo/bangpypers
> >
> _______________________________________________
> BangPypers mailing list
> BangPypers at python.org
> http://mail.python.org/mailman/listinfo/bangpypers
>



-- 
--Anand


More information about the BangPypers mailing list