[Distutils] python eggs and lxml
Phillip J. Eby
pje at telecommunity.com
Sat Sep 24 17:23:52 CEST 2005
At 11:13 AM 9/24/2005 +0200, Martijn Faassen wrote:
>I guess for Windows, I'd have make distutils run the configure script,
>then extract the dlls it produces and stuff them in the egg somehow. Any
>direction you'd point me towards for this?
You'll need to have your setup script determine programmatically that it's
running on Windows, and then add data files and 'eager_resources' to the
project accordingly. In other words, to include shared libraries that
aren't distutils-built extensions, you have to treat them as data files,
but you must also list them in the 'eager_resources' setup keyword, as
described here:
http://peak.telecommunity.com/DevCenter/setuptools#automatic-resource-extraction
so that the shared libraries will be automatically extracted to disk before
they're linked to by any C extensions.
You can do this on Unix platforms, too, of course, but the filenames will
naturally be different.
>In this case, on Linux, I'd want to run the configure script when the egg
>is installed instead of when it's created, and stuff the .so files in the
>same place the egg is being installed to.
Eggs are not source distributions - they're prebuilt binaries for a
particular platform. EasyInstall will find and use source distributions if
there's no binary for the system, so from a user point of view it certainly
happens when the egg is "installed", but that's only because the actual egg
is being *built* locally.
So, in any situation where you're not using the platform libraries, you'll
need to follow the same embedding steps as for Windows - i.e., build the
libraries and include them as data files.
If you're linking with the platform libraries, you don't need to include
them as data files.
It's best, however, if you not think of this as install vs. create. The
only installation steps eggs have are:
* adding them to sys.path
* creating local wrappers to run programs contained in them
Everything else occurs when the egg is built, it's just that on Unix-y
platforms you're more likely to be building the egg locally from source,
rather than downloading a pre-built egg, if it contains C extensions. For
Python-only projects, of course, eggs are cross-platform and ready-to-use
binaries.
>>Not all libraries can be bundled by source, of course. Sometimes you
>>really need to use whatever the "system version" is, for one reason or
>>another. Database clients, for example, are something you really really
>>want to use the local version for.
>
>Right, there are competing use cases here. What I'd like is an easy
>install for lxml that just works for people, without them having to worry
>about the right lxml2 versions being installed, etc. On Windows this means
>binaries, and on Linux this likely means it'll just compile upon install.
Or more precisely, it means compiling on egg build, regardless of
platform. It's just that for some platforms, you'll build the egg and
distribute it instead of the end user building their own.
Probably the simplest way to do this is to subclass setuptools 'build_ext'
command and extend run() to 'configure' and 'make' the libraries before
proceeding normally.
>Some classes of people, like distributors and some sysadmins, care about
>using the platform version of libxml2, and I'd also want to create an egg
>that allows you to install against the platform libraries. Would this be
>possible to be the same egg or would a different egg be needed? If a
>different egg, how does this work with the dependency system? I.e. these
>two eggs would be alternatives of each other dependency-wise.
An egg is not a project, or vice versa. What you mean is that you want
your project to be able to be built against different targets. The eggs
are what gets built. If you've only used EasyInstall as a user, not a
developer, this line is more blurry because EasyInstall downloads projects
and builds the eggs for you if the author did not provide any eggs usable
on your platform. But there is a distinction, and on the authoring side
you should focus on the project and its ability to be built for different
targets. Providing actual eggs is a convenience for your users, and does
not enter into your development process. It's strictly a
deployment/publishing step.
Anyway, since the audience that wants to do this are more advanced users,
and probably building your project individually rather than as part of a
multi-project EasyInstall, I think it's reasonable to just have some value
in a configuration file or environment variable or the standard distutils
setup.cfg that allows skipping the configure+make. For that matter, just
documenting how to patch the setup script to use the platform libraries
might be sufficient for that audience in the case of this kind of embedding.
More information about the Distutils-SIG
mailing list