Looking for some PyPI query help

Travis Griggs travisgriggs at gmail.com
Wed Feb 23 01:53:51 EST 2011


Howdy Python brethren. I'm a Smalltalker, doing a bit of research on packaging ecospaces, in other languages and environments (I just finished examining Debian for example). I found what seems to be the big repository at PyPI. What would be enough for me, is to enumerate all of the packages there, and associate it with the number of prerequisite/dependent/imports it specifies. I don't need it to recurse thru dependents, just the immediate count.

My goal is to be able to histogram the data, and be able to answer questions along the following lines:
1) What's the average/median dependent count in the Python ecospace.
2) For a given threshold (say 90%) when sorted by count, what's the count (e.g. "90% of Python packages depend directly on 5 or less other packages").
3) What's the most number of dependents any package has ever specified?

Thanks for any help or pointers or hints or data you can give me.

(I apologize if this is overtly naive).

--
Travis Griggs
Objologist
"I did not have time to write you a short program, so I wrote you a long one instead."




More information about the Python-list mailing list