On Wed, Feb 8, 2017, at 11:06 PM, Wes Turner wrote:
So, IIUC,
you're looking to emit
((URL, release, platform), namespaces_odict)
for each new and all existing packages;
by uncompressing every package and running every setup.py (hopefully in a container)?
Something like that, yes. For packages that publish wheels, we can analyse those directly without needing to run setup.py. Of course there are many packages with only sdists published.
Could this (namespace extraction) be added to 'setup.py build' for the future?
Potentially. As I mentioned, there is a place in the metadata to put this information - the 'Provides' field. However, relying on package uploaders would take a long time to build up decent coverage of the available packages, so I'm inclined to focus on scanning PyPI, similar to the tool Chris already showed. Thomas