[Numpy-discussion] Re: ENH: Introducing a pipe Method for Numpy arrays

15 Feb 2024

      Hi Oyibo,
...
I'm proposing the introduction of a `pipe` method for NumPy arrays to enhance their usability and expressiveness.
I think it is an interesting idea, but agree with Robert that it is
unlikely to fly on its own.  Part of the logic of even frowning on
methods like .mean() and .sum() is that ndarray is really a data
container, and should have methods related to that, as much as possible
independent of the meaning of those data (which is given by the dtype).

A bit more generally, your example is nice, but a pipe can have just one
input, while of course many operations require two or more.
...
- Optimization: While NumPy may not currently optimize chained
expressions, the introduction of pipe lays the groundwork for
potential future optimizations with lazy evaluation.
Optimization might indeed be made possible, though I would think that
for that one may be better off with something like dask.

That said, I've been playing with the ability to chain ufuncs to
optimize their execution, by applying the ufuncs in series on small
pieces of larger arrays, thus avoiding large temporaries (a bit like
numexpr but with the idea of defining a fast function rather than giving
an expression as a string); see https://github.com/mhvk/chain_ufunc

All the best,

Marten

[Numpy-discussion] Re: ENH: Introducing a pipe Method for Numpy arrays

Marten van Kerkwijk