[BangPypers] Newspaper in Python

Dhawal Joharapurkar dmjan21 at gmail.com
Fri Apr 24 09:56:30 CEST 2015


import newspaper
outfile = open('outfile.txt', 'a+)  # Create a file handle

Sc_paper = newspaper.build(u'http://scroll.in/')
for article in Sc_paper.articles:
    print(article.url)
    outfile.write(article.url)  # Write to file
    outfile.write('\n')

outfile.close()  # Close the file handle

This should work.

On Fri, Apr 24, 2015 at 1:23 PM, Nimish Dalal <nimish.s.dalal at gmail.com>
wrote:

> On Fri, Apr 24, 2015 at 12:48 PM, sshabinesh at gmail.com <
> sshabinesh at gmail.com
> > wrote:
>
> > -OR-
> >
> > import newspaper
> >
> > articles = [ ]
> > Sc_paper = newspaper.build(u'http://scroll.in/')
> >
> > for article in Sc_paper.articles:
> >     articles.append(article)
> >
> > with open("filename", "w") as f:
> >     f.write( "\n".join(articles) )
> >
> > Thanks  Shabinesh,
>
> I tried this code but gives me an error
> TypeError: sequence item 0: expected string, Article found
>
> On Fri, Apr 24, 2015 at 12:29 PM, Prince Sharma <prince09cs at gmail.com>
> > wrote:
> >
> > > By export you mean you want to create a text file for every URL?
> > > Hi, I am new to python and need help with newspaper.
> > > I am using this module as I find it easier to extract the urls from the
> > > website.
> > >
> > > Here's my code:
> > >
> > > import newspaper
> > > Sc_paper = newspaper.build(u'http://scroll.in/')
> > > for article in Sc_paper.articles:
> > >     print(article.url)
> > > http://scroll.in/...
> > > http://scroll.in/...
> > >
> > > Instead of  print I want the urls to export as a .txt file or .csv
> file.
> > >
> > > Thanks in advance.
> > >
> > >
> > > --
> > > Nimish S. Dalal
> > > Cell: 9819670499
> > > Facebook: http://www.facebook.com/nimish.s.dalal
> > > Twitter: http://twitter.com/nimishdalal <
> http://twitter.com/nimishsdalal
> > >
> > > Linkedin:http://in.linkedin.com/in/nimishsdll
> > > Url: http://www.nimishdalal.me
> > >
> > > Our generation has had no Great Depression, no Great War. Our war is a
> > > spiritual war. Our great depression is our lives.
> > > _______________________________________________
> > > BangPypers mailing list
> > > BangPypers at python.org
> > > https://mail.python.org/mailman/listinfo/bangpypers
> > > _______________________________________________
> > > BangPypers mailing list
> > > BangPypers at python.org
> > > https://mail.python.org/mailman/listinfo/bangpypers
> > >
> > _______________________________________________
> > BangPypers mailing list
> > BangPypers at python.org
> > https://mail.python.org/mailman/listinfo/bangpypers
> >
>
>
>
> --
> Nimish S. Dalal
> Cell: 9819670499
> Facebook: http://www.facebook.com/nimish.s.dalal
> Twitter: http://twitter.com/nimishdalal <http://twitter.com/nimishsdalal>
> Linkedin:http://in.linkedin.com/in/nimishsdll
> Url: http://www.nimishdalal.me
>
> Our generation has had no Great Depression, no Great War. Our war is a
> spiritual war. Our great depression is our lives.
> _______________________________________________
> BangPypers mailing list
> BangPypers at python.org
> https://mail.python.org/mailman/listinfo/bangpypers
>


More information about the BangPypers mailing list