[Distutils] dump of all PyPI project metadata available?

Wes Turner wes.turner at gmail.com
Wed Jul 22 23:19:46 CEST 2015



- [ ] a flat bigquery w/ pandas.io.gbq ala GitHub Archive would be great
- [ ] it's probably worth it to add RDFa to PyPi and warehouse pages (in
addition to the auxiliary executed/extracted JSON) for #search
On Jul 22, 2015 4:08 PM, "Brett Cannon" <bcannon at gmail.com> wrote:

> When I wrote https://nothingbutsnark.svbtle.com/python-3-support-on-pypi
> I wrote a script to download every project's JSON metadata by scraping the
> simple index and then making the appropriate GET request for the JSON
> metadata. It worked, but somewhat of a hassle.
> Is there some dump somewhere that is built daily, weekly, or monthly of
> all the metadata on PyPI for offline analysis?
> _______________________________________________
> Distutils-SIG maillist  -  Distutils-SIG at python.org
> https://mail.python.org/mailman/listinfo/distutils-sig
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/distutils-sig/attachments/20150722/bee53e0e/attachment.html>

More information about the Distutils-SIG mailing list