[Catalog-sig] distribute D.C. sprint tasks

Tarek Ziadé ziade.tarek at gmail.com
Tue Oct 21 15:11:23 CEST 2008


On Tue, Oct 21, 2008 at 3:07 PM, Chris Withers <chris at simplistix.co.uk> wrote:
> Tarek Ziadé wrote:
>>
>> On Tue, Oct 21, 2008 at 2:59 PM, Chris Withers <chris at simplistix.co.uk>
>> wrote:
>>>
>>> Martin v. Löwis wrote:
>>>>
>>>> Hmm. Yesterday, there were 199250 accesses to PyPI through wget.
>>>> Of those, 169971 requests came from a single address (from Dedibox in
>>>> France), 28966 requests from a second one (from Sakura in Japan).
>>>>
>>>> So it *is* wget mirrors that make the whole traffic in PyPI.
>>>
>>> If it were me, I'd just IP firewall the offendors. There's not need for
>>> this
>>> kind of behaviour if there's an acceptable mirror protocol available...
>>
>> Well not yet... but the PEP should be finished sometimes this week,
>
> I'm pretty sure that Martin said something was already available...

I am not sure what you are talking about, the only protocol published
is pje's documentation
on Peak, that explains how a package index should be layered + some
insights from Martin in this thread afaik.

now, both Andreas and I have worked on the topic, and even if our mirrors have
created too much hits on PyPI lately, the "clean" protocol, and the
right client behavior
that has been described by Martin will be described in the PEP and
applied in all clients
at some point. And a User-agent request header will be added to
identify clients.

Cheers
Tarek
>
> Chris
>
> --
> Simplistix - Content Management, Zope & Python Consulting
>           - http://www.simplistix.co.uk
>



-- 
Tarek Ziadé | Association AfPy | www.afpy.org
Blog FR | http://programmation-python.org
Blog EN | http://tarekziade.wordpress.com/


More information about the Catalog-SIG mailing list