[Catalog-sig] Ickiness

Phillip J. Eby pje at telecommunity.com
Sat Jul 8 20:33:48 CEST 2006


At 06:24 PM 7/8/2006 +1000, richardjones at optusnet.com.au wrote:
>With the new display formatting implemented yesterday, there is now a 
><table style="display: none"> in the package view page so that setuptools 
>may scrape "<th>Download URL" etc.

You can also do this:  <!-- <th>Download URL --> right before the 
link.  EasyInstall is very stupid and doesn't actually parse any HTML.

The same thing is true for the "'Index of Packages</title>'" string, which 
just has to appear somewhere in the page if it's a multi-package list.

While these strings obviously aren't as "clean" as using rel="" info in the 
links, they will work for now and allow you to do whatever you want with 
the visual appearance.

The only other visual dependency is on MD5 information, which is extracted 
using this pattern:

PYPI_MD5 = re.compile(
     '<a href="([^"#]+)">([^<]+)</a>\n\s+\\(<a href="[^?]+\?:action=show_md5'
     '&amp;digest=([0-9a-f]{32})">md5</a>\\)'
)

However, if this pattern doesn't match, then EasyInstall will simply 
proceed without MD5 verification.



More information about the Catalog-sig mailing list