Mailman 3 June 2020 - NumPy-Discussion

`np.array()`, array-likes, nested sequences and subclasses
by Sebastian Berg June 18, 2020

June 18, 2020

Hi all, tl;dr: `np.array()` is somewhat ill-defined, also creating issues for Quantities. In a recent PR I am cementing, and slightly broadening, its definition. So we have to decide how we wish to handle code such as in the long run: np.array([array-like, array-like]) --- Traditionally, we have two meanings of "array-like" as understood by `np.array()` (In the text I use array-like for the second point here): 1. Nested sequences of scalars. 2. A single array-like object, … [View More]meaning a buffer-interface, an array subclass, a pandas dataframe (`__array__()`), etc. However, the boundaries between these are fuzzy, and over the years became more fuzzy. The reason is that a NumPy array (and many array- likes) are also nested sequences of scalars. I defined the current behaviour slightly clearer in my PR, but by that also subtly broadened it up [0]: 1. Any array-like embedded in the nested-sequences is converted to a NumPy array. [1] (Any array-like is never interpreted as a sequence) 2. Any array-like's elements will be elements of the output. We never enter array-likes recursively (including object arrays). 3. The `subok=True` parameter is implicitly ignored, unless the input is a single ndarray sublcass. Now to the issues at hand: * We should make sure those defintions are good, they mainly cement current behaviour, but if we want to roll back on features, we should do it now. * There are some issues around Quantity and masked arrays, because their "scalars" are (sometimes) 0-D arrays. And they currently rely on NumPy considering them to be scalars. This has its own set of long term issues [2]. For now, I can simply roll the changes to 0-D array behaviour back. But in the mid-to-long run, we have to make a decision, or perpetually live with array subclasses being subtly broken: 1. Define Quantity and Masked arrays as wrong. They must use a special DType, which consistently tells NumPy that the elements cannot simply be copied by converting the Quantity to an array. The up-side is, that it generalizes to N-D. 2. Independently, but partially addressing the Quantity issue, we have to decide what `np.array()` should actually do. A sequence containing array-likes, in most cases is better written using `np.stack()`, but due to the fuzzy boundaries, code like `np.array([dataframe, dataframe])` is probably common. We could try to deprecate though. The downsides to deprecation seem to me that I feel we have to reject viewing array-likes as sequences. To me doing that has its own set of issues. If just that `np.array([arraylike])` seems perfectly reasonable, but may be very slow. - Sebastian [0] It is hard to list how exactly it is broadened up, because the current behaviour has very subtle behaviours, such as actually iterating a `memoryview()`, which does always the same thing, but only works for 1-D memoryviews, and fails for both 0-D and N-D. [1] There are some subtleties which are not important here, such that I do anticipate the possibility of having array-likes which are considered scalars with respect to a given dtype, such as `np.array([poly], dtype=Polynomial)` where a poly object itself is an array-like. [2] Basically: np.array([0d_array], dtype=user_dtype) works, by ending up calling: res[0] = float(0d_array) # quantity.__float__ is used! which works nice for the typical float/int dtype, is tricky to get right for general dtypes (e.g. longdouble/clongdouble). This is a small issue now, but it could become a problem when more user-dtypes are defined. [View Less]

1 0

help translating Hindi into English
by Inessa Pawson June 18, 2020

June 18, 2020

The initial stage of the NumPy community survey project in partnership with the students and faculty from the Master’s program in Survey Methodology at the University of Michigan and the University of Maryland has been successfully completed. Currently, we are looking for a volunteer to help with the back translation of the Hindi version of the survey questionnaire (Hindi into English). If you are available, or you know someone who would be interested to help, please leave a comment here: … [View More]

2 2

NumPy Development Meeting Tomorrow - Triage Focus
by Sebastian Berg June 16, 2020

June 16, 2020

Hi all, Our bi-weekly triage-focused NumPy development meeting is tomorrow (Wednesday, June 17th) at 11 am Pacific Time (18:00 UTC). Everyone is invited to join in and edit the work-in-progress meeting topics and notes: https://hackmd.io/68i_JvOYQfy9ERiHgXMPvg I encourage everyone to notify us of issues or PRs that you feel should be prioritized or simply discussed briefly. Just comment on it so we can label it, or add your PR/issue to this weeks topics for discussion. Best regards Sebastian

1 0

ANN: SciPy 1.5.0rc2 -- please test
by Tyler Reddy June 13, 2020

June 13, 2020

Hi all, On behalf of the SciPy development team I'm pleased to announce the release candidate SciPy 1.5.0rc2. Please help us test this pre-release. Sources and binary wheels can be found at: https://pypi.org/project/scipy/ and at: https://github.com/scipy/scipy/releases/tag/v1.5.0rc2 One of a few ways to install the release candidate with pip: pip install scipy==1.5.0rc2 ========================== SciPy 1.5.0 Release Notes ========================== .. note:: Scipy 1.5.0 is not released … [View More]

1 0

Deprecating python type aliases (np.int, np.long, np.str, ...)
by Sebastian Berg June 11, 2020

June 11, 2020

Hi all, In the pull request: https://github.com/numpy/numpy/pull/14882 Eric proposes to deprecate the type aliases which NumPy imports into its main namespace (e.g. np.int, np.bool, see table below [1]). Right now there seems to be a consensus to move this forward and I plan on doing that, so this is a heads-up in case anyone has a differing opinion. The deprecation should not be very noisy as such, but I expect it will require many projects to update their code in the long run (although … [View More]

1 1

Armv8 server donation
by ChunLin Fang June 11, 2020

June 11, 2020

Hi, all: I noticed that the shippable CI always skipped after PR submitted , The reason why it's skip seems to be "No active nodes found in shared node pool "shippable_shared_aarch64"" Potential bugs may buried through out numpy without shippable CI. I happened to own an idle armv8 server that can donate to numpy community, please let me know if that can improve numpy's CI/CD environment , also need somebody help me set up the CI/CD environment on that server. Best wishes Fang ChunLin.… [View More]

3 2

NumPy Community Meeting Wednesday
by Sebastian Berg June 9, 2020

June 9, 2020

Hi all, There will be a NumPy Community meeting Wednesday May 27th at 1pm Pacific Time (20:00 UTC [0]). Everyone is invited and encouraged to join in and edit the work-in-progress meeting topics and notes at: https://hackmd.io/76o-IxCjQX2mOXO_wwkcpg?both Best wishes Sebastian

1 0

Re: [Numpy-discussion] Numpy Documentation: How-to content
by Ryan C. Cooper June 8, 2020

June 8, 2020

> This sounds fantastic. Great! > In what context would the students be creating the notebooks -- as > part of one of your existing ME courses, as a for-credit project, as a > supervised but non-credit project? These would be supervised projects either for work-study or credit. > What were your thoughts on submission workflow? You review initially, > then the student directly submits a PR? My plan was to mentor the initial idea and creation and help the students submit … [View More]

2 1

introducing autoreg and autoregnn
by rondall jones June 5, 2020

June 5, 2020

Hello! I have supported constrained solvers for linear matrix problems for about 10 years in C++, but have now switched to Python. I am going to submit a couple of new routines for linalg called autoreg(A,b) and autoregnn(A,b). They work just like lstsq(A,b) normally, but when they detect that the problem is dominated by noise they revert to an automatic regularization scheme that returns a better behaved result than one gets from lstsq. In addition, autoregnn enforces a nonnegativity … [View More]

3 2

Call for expertise: Blocked iteration
by Sebastian Berg June 5, 2020

June 5, 2020

Hi all, I am curious about exploring whether or not we could add simple blocked iteration to NumPy. It seems like a long standing small deficiency in NumPy that we do not support blocked iteration. I do not know how much speed gain we would actually have in real world code, but I assume some bad-memory-order copies could be drastically faster. Implementing blocked iteration for NumPy seems pretty complicated on first sight due to the complexity of the iterator and the fact that almost no-one … [View More]

1 0