[Numpy-discussion] ANN: HDF5 for Python 1.1

Stephen Simmons mail at stevesimmons.com
Tue Feb 10 01:30:41 EST 2009


Hi Andrew,

Do you have any plans to support LZO compression in h5py?
 
I have lots of LZO-compressed datasets created with PyTables.
There's a real barrier to using both h5py and PyTables if the fast
decompressor options are just LZF on h5py and LZO on PyTables.

Many thanks
Stephen


Andrew Collette wrote:
> =====================================
> Announcing HDF5 for Python (h5py) 1.1
> =====================================
>
> What is h5py?
> -------------
>
> HDF5 for Python (h5py) is a general-purpose Python interface to the
> Hierarchical Data Format library, version 5.  HDF5 is a versatile,
> mature scientific software library designed for the fast, flexible
> storage of enormous amounts of data.
>
> >From a Python programmer's perspective, HDF5 provides a robust way to
> store data, organized by name in a tree-like fashion.  You can create
> datasets (arrays on disk) hundreds of gigabytes in size, and perform
> random-access I/O on desired sections.  Datasets are organized in a
> filesystem-like hierarchy using containers called "groups", and
> accesed using the tradional POSIX /path/to/resource syntax.
>
> In addition to providing interoperability with existing HDF5 datasets
> and platforms, h5py is a convienient way to store and retrieve
> arbitrary NumPy data and metadata.
>
>
> New features in 1.1
> -------------------
>
>   - A new compression filter based on the LZF library, which provides
>     transparent compression many times faster than the standard HDF5
>     GZIP filter.
>
>   - Efficient broadcasting using HDF5 hyperslab selections; for example,
>     you can write to a (2000 x 100 x 50) selection from a (100 x 50)
>     source array.
>
>   - Now supports the NumPy boolean type
>
>   - Auto-completion for IPython 0.9.X (contributed by Darren Dale)
>
>   - Installable via easy_install
>
>
> Standard features
> -----------------
>
>   - Supports storage of NumPy data of the following types:
>
>     * Integer/Unsigned Integer
>     * Float/Double
>     * Complex/Double Complex
>     * Compound ("recarray")
>     * Strings
>     * Boolean
>     * Array (as members of a compound type only)
>     * Void
>
>   - Random access to datasets using the standard NumPy slicing syntax,
>     including fancy indexing and point-based selection
>
>   - Transparent compression of datasets using GZIP, LZF or SZIP,
>     and error-detection using Fletcher32
>
>   - "Pythonic" interface supporting dictionary and NumPy-array metaphors
>     for the high-level HDF5 abstrations like groups and datasets
>
>   - A comprehensive, object-oriented wrapping of the HDF5 low-level C API
>     via Cython, in addition to the NumPy-like high-level interface.
>
>   - Supports many new features of HDF5 1.8, including recursive iteration
>     over entire files and in-library copy operations on the file tree
>
>   - Thread-safe
>
>
> Where to get it
> ---------------
>
> * Main website, documentation:  http://h5py.alfven.org
>
> * Downloads, bug tracker:       http://h5py.googlecode.com
>
>
> Requires
> --------
>
> * Linux, Mac OS-X or Windows
>
> * Python 2.5 (Windows), Python 2.5 or 2.6 (Linux/Mac OS-X)
>
> * NumPy 1.0.3 or later
>
> * HDF5 1.6.5 or later (including 1.8); HDF5 is included with
>   the Windows version.
>
>
> Thanks
> ------
>
> Thanks to D. Dale, E. Lawrence and other for their continued support
> and comments.  Also thanks to the Francesc Alted and the PyTables project,
> for inspiration and generously providing their code to the community. Thanks
> to everyone at the HDF Group for creating such a useful piece of software.
> _______________________________________________
> Numpy-discussion mailing list
> Numpy-discussion at scipy.org
> http://projects.scipy.org/mailman/listinfo/numpy-discussion
>
>
>   




More information about the NumPy-Discussion mailing list