[Catalog-sig] simple package index has links back into the human interface

Jim Fulton jim at zope.com
Tue Aug 28 00:59:53 CEST 2007

A while ago, I created an experimental PyPI mirror:


Recently, I've been working on a mirror of the new simple index:


This mirrors the pages at:


In experimenting with this, I found that buildouts were taking much  
longer (e.g. 70 second vs 40 seconds) using the simpler mirror than  
using the ppix mirror.  I added some additional logging and found  
that when using the simple index, buildout was getting a lot of non- 
simple pages.

A common practice is to use the package index page for a project as  
the project home page.  There's no point in a simple page including a  
link to the non-simple page as it contains the same or less  
information.  I filter these pages out in the ppix index.  The simple  
index doesn't. For example, the simple page for zc.buildout:


has home page links to http://www.python.org/pypi/zc.buildout.

Martin, can you filter links like this out of the simple output?  (If  
not, I'll filter them out when I mirror.)


Jim Fulton			mailto:jim at zope.com		Python Powered!
CTO 				(540) 361-1714			http://www.python.org
Zope Corporation	http://www.zope.com		http://www.zope.org

More information about the Catalog-SIG mailing list