[Catalog-sig] Proposal: Move PyPI static data to the cloud for better availability
Terry Reedy
tjreedy at udel.edu
Wed Jun 16 21:27:09 CEST 2010
On 6/16/2010 8:20 AM, M.-A. Lemburg wrote:
> Antoine Pitrou wrote:
>> Tarek Ziadé<ziade.tarek<at> gmail.com> writes:
>>>
>>> And we happen to have this network already: lots of people
>>> will host a PyPI mirror as soon as it's easy to set one imho.
>>
>> You must be careful that the mirrors are properly managed and administered,
>> though. Having stale/dysfunctioning mirrors is worse than having no mirrors at
>> all.
>> It is likely that some people will setup a mirror and then "forget" to take care
>> about it. Like our buildbots really.
>
> Right, it's that administration overhead I was referring to.
>
> Perhaps we should just let the users decide:
>
> a) they use the default PyPI access (which we then enhance by
> caching the content in the cloud)
>
> b) they setup their easy_install or zc.buildout to pull data
> from a mirror network by enabling a configuration option
>
> Since implementing option b) will require updating existing
> package tools on the client side anyway, the extra configuration
> shouldn't be a problem.
>
> Option a) requires no changes whatsoever on the client side.
It seems to me that:
If the problem of availability with pypi is anything like the problems
with bugs... and extending 'pypi...' with cloud service could be done
relatively quickly (within a month), then that seems reasonable.
If 'free to psf' mirrors are feasible and needed, then they will still
be useful, especially is high-download regions. Since Amazon's cloud
service is metered on a region by region basis, any off-loading of
demand to regional mirrors will reduce PSF charges. Based on what I have
read in the thread, I would not be surprised if full mirror deployment
takes a year. After that, the cloud service could remain to pick up
slack in a region should the mirror in a region go down.
Any move to incremental update from time-based replacement will benefit
either system.
Terry Jan Reedy
More information about the Catalog-SIG
mailing list