[Numpy-discussion] StarCluster 0.91 - NumPy/SciPy Clusters on EC2

Justin Riley justin.t.riley at gmail.com
Thu May 27 13:53:31 EDT 2010


This is a one-time message to announce the availability of version 0.91
of the StarCluster package.

Why should you care? StarCluster allows you to create NumPy/SciPy
clusters configured with NFS-shared filesystems and the Sun Grid Engine
queueing system out of the box on Amazon's Elastic Compute Cloud (EC2).
The NumPy/SciPy installations have been compiled against a
custom-compiled ATLAS for the larger EC2 instances.

About
-----

There is an article about StarCluster on www.hpcinthecloud.com:

http://www.hpcinthecloud.com/features/StarCluster-Brings-HPC-to-the-Amazon-Cloud-94099324.html

There is also a screencast of installing, configuring, launching, and
terminating an HPC cluster on Amazon EC2:

http://www.hpcinthecloud.com/blogs/MITs-StarCluster-An-Update-with-Screencast-94599554.html

Project description from PyPI:

StarCluster is a utility for creating and managing scientific computing
clusters hosted on Amazon's Elastic Compute Cloud (EC2). StarCluster
utilizes Amazon's EC2 web service to create and destroy clusters of
Linux virtual machines on demand.

To get started, the user creates a simple configuration file with their
AWS account details and a few cluster preferences (e.g. number of
machines, machine type, ssh keypairs, etc). After creating the
configuration file and running StarCluster's "start" command, a cluster
of Linux machines configured with the Sun Grid Engine queuing system, an
NFS-shared /home directory, and OpenMPI with password-less ssh is
created and ready to go out-of-the-box. Running StarCluster's "stop"
command will shutdown the cluster and stop paying for service. This
allows the user to only pay for what they use.

StarCluster provides a Ubuntu-based Amazon Machine Image (AMI) in 32bit
and 64bit architectures. The AMI contains an optimized
NumPy/SciPy/Atlas/Blas/Lapack installation compiled for the larger
Amazon EC2 instance types. The AMI also comes with Sun Grid Engine (SGE)
and OpenMPI compiled with SGE support. The public AMI can easily be
customized by launching a single instance of the public AMI, installing
additional software on the instance, and then using

StarCluster can also utilize Amazon's Elastic Block Storage (EBS)
volumes to provide persistent data storage for a cluster. EBS volumes
allow you to store large amounts of data in the Amazon cloud and are
also easy to back-up and replicate in the cloud. StarCluster will mount
and NFS-share any volumes specified in the config. StarCluster's
"createvolume" command provides the ability to automatically create,
format, and partition new EBS volumes for use with StarCluster.


Download
--------
StarCluster is available on PyPI
(http://pypi.python.org/pypi/StarCluster) and also on the project's website:

http://web.mit.edu/starcluster

You will find the docs as well as links to the StarCluster mailing list
on the website.


New in this version:
--------------------

* support for launching and managing multiple clusters on EC2

* added "listclusters" command for showing all active clusters on EC2

* support for attaching and NFS-sharing multiple EBS volumes

* added createimage and createvolume commands for easily creating new
AMIs and EBS volumes for use with StarCluster

* experimental support for launching clusters using spot instances

* added support for StarCluster "plugins" that provide the ability to
perform additional configuration/setup routines on top of StarCluster's
default cluster configuration

* added "listpublic" command for listing all available public StarCluser
AMIs that can be used with StarCluster

* bash/zsh command line completion for StarCluster's command line interface






More information about the NumPy-Discussion mailing list