[Catalog-sig] How about a dedicated web service mirror?

Daniel Greenfeld pydanny at gmail.com
Tue Apr 19 07:22:13 CEST 2011


Thanks for this link but I'm not sure about what the entries mean.
Using some entries from SqlAlchemy as an example:


I guess the first column is the package identifier, the second is the
file + ???, and the fourth column the download count? Am I close?

Is there any formal documentation for this portion of PyPI? I'm know
this is easier on PyP's server but I'm hesitant to interact with an
undocumented data source. If documentation doesn't exist for it, I'm
happy enough to write the formal documentation once I understand it.

Also, other tools are using the XMLRPC API to make large requests
against PyPI. http://pypi.python.org/pypi/vanity and
http://pypi.appspot.com come to mind.



On Mon, Apr 18, 2011 at 1:30 PM, "Martin v. Löwis" <martin at v.loewis.de> wrote:
> Am 18.04.2011 22:20, schrieb Daniel Greenfeld:
>> All the packages need to have their download numbers checked for metrics.
> Please do that incrementally, downloading a single file per day from
> http://pypi.python.org/stats/days/
> All files displayed there have been incorporated in the total stats,
> so if you download the total stats once, then you can start downloading
> the incremental changes the next day. Notice that some of the mirrors
> are a few days behind in generating these files, so even the older files
> may still change after first being generated (so you may need to fetch
> three or so files, or the monthly one).
> Regards,
> Martin

'Knowledge is Power'
Daniel Greenfeld

More information about the Catalog-SIG mailing list