From josh.craig.wilson at gmail.com Sun Nov 1 09:37:39 2020 From: josh.craig.wilson at gmail.com (Joshua Wilson) Date: Sun, 1 Nov 2020 06:37:39 -0800 Subject: [Numpy-discussion] Ndarray static typing: Order of generic types In-Reply-To: References: Message-ID: > Just to speak for myself, I don't think the precise choice matters very much. There are arguments for consistency both ways. I agree with this. In the absence of strong theoretical considerations I'd fall back to a practical one-we can make ndarray generic over dtype _right now_, while for shape we will need to wait 1+ years for the variadic type variable PEP to settle etc. To me that suggests: - Do ndarray[DType] now - When the shape stuff is ready, do ndarray[DType, ShapeStuff] (or however ShapeStuff ends up being spelled) - Write a mypy plugin that rewrites ndarray[DType] to ndarray[DType, AnyShape] (or whatever) for backwards compatibility On Thu, Oct 29, 2020 at 1:37 PM Stephan Hoyer wrote: > > On Wed, Oct 28, 2020 at 2:44 PM bas van beek wrote: >> >> Hey all, >> >> >> >> With the recent merging of numpy/numpy#16759 we?re at the point where `ndarray` can be made generic w.r.t. its dtype and shape. >> >> An open question which yet remains is to order in which these two parameters should appear (numpy/numpy#16547): >> >> ? `ndarray[Dtype, Shape]` >> >> ? `ndarray[Shape, Dtype]` > > > Hi Bas, > > Thanks for driving this forward! > > Just to speak for myself, I don't think the precise choice matters very much. There are arguments for consistency both ways. In the end Dtype and Shape are different enough that I doubt it will be a point of confusion. > > Also, I would guess many users will define their own type aliases, so can write something more succinct like Float64[shape] rather than ndarray[float64, shape]. We might even consider including some of these in numpy.typing. > > Cheers, > Stephan > > >> >> >> >> There has been a some discussion about this question in issue 16547, but a consensus has not yet to be reached. >> >> Most people seem to slightly preferring one option over the other. >> >> >> >> Are there any further thoughts on this subject? >> >> >> >> Regards, >> >> Bas van Beek >> >> >> >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at python.org >> https://mail.python.org/mailman/listinfo/numpy-discussion > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion From mark.harfouche at gmail.com Sun Nov 1 18:27:30 2020 From: mark.harfouche at gmail.com (Mark Harfouche) Date: Sun, 1 Nov 2020 18:27:30 -0500 Subject: [Numpy-discussion] NumPy 1.20.x branch in two weeks In-Reply-To: References:

Message-ID: I know it seems silly, but would an amendment to NEP29 be reasonable? Many downstream packages look to numpy to understand what versions should be supported and NEP29 gave some good guidance. That said, if it is worth ignoring, or revisiting, some clarity on how to apply NEP29 given recent development would be appreciated. Best, Mark On Sat, Oct 31, 2020 at 8:24 AM Ralf Gommers wrote: > > > On Thu, Oct 29, 2020 at 2:25 PM Charles R Harris < > charlesr.harris at gmail.com> wrote: > >> Hi All, >> >> Time to start planning for the 1.20.x branch. These are my thoughts at >> the moment: >> >> - Keep support for Python 3.6. Python 3.7 came out in June 2018, >> which seems too recent to be our oldest supported version. >> - Drop Python 3.6 for 1.21.x, that will make the oldest supported >> version about three years old. >> - Drop manylinux1 for 1.21.x. It would be nice to drop earlier, but >> manylinux2010 is pretty recent. >> >> There were 33 wheels in the 1.19.3 release, I think we can live with that >> for 1.20.x. I'm more worried about our tools aging out. After Python has >> settled into its yearly release cycle, I think we will end up supporting >> the latest 4 versions. >> >> Thoughts? >> > > Seems reasonable to me. > > Cheers, > Ralf > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From currurant at gmail.com Sun Nov 1 18:54:46 2020 From: currurant at gmail.com (Currurant) Date: Sun, 1 Nov 2020 16:54:46 -0700 (MST) Subject: [Numpy-discussion] Efficient way to draw multinomial distribution random samples Message-ID: <1604274886591-0.post@n7.nabble.com> I realized that neither numpy.random.multinomial nor rng.multinomial has the ability to draw from different multinomial distributions at the same time like what MATLAB mnrnd() does here: https://www.mathworks.com/help/stats/mnrnd.html Also, I have asked this question on StackOverFlow: https://stackoverflow.com/questions/64529620/is-there-an-efficient-way-to-generate-multinomial-random-variables-in-parallel?noredirect=1#comment114131565_64529620 It seems like this is something good to add to numpy.random, since it would be much more faster when you have many multinomial distributions to draw from---using loops. -- Sent from: http://numpy-discussion.10968.n7.nabble.com/ From charlesr.harris at gmail.com Sun Nov 1 19:50:59 2020 From: charlesr.harris at gmail.com (Charles R Harris) Date: Sun, 1 Nov 2020 17:50:59 -0700 Subject: [Numpy-discussion] NumPy 1.20.x branch in two weeks In-Reply-To: References:

Message-ID: On Sun, Nov 1, 2020 at 4:28 PM Mark Harfouche wrote: > I know it seems silly, but would an amendment to NEP29 be reasonable? > > Many downstream packages look to numpy to understand what versions should > be supported and NEP29 gave some good guidance. > That said, if it is worth ignoring, or revisiting, some clarity on how to > apply NEP29 given recent development would be appreciated. > > Best, > > Mark > > Do you think the proposal is not in compliance? There is no requirement that we drop anything more than 42 months old, it is just recommended. The change in the Python release cycle has created some difficulty. With the yearly cycle, 4 python yearly releases will cover 3-4 years, which seems reasonable and we can probably drop to 3 releases towards the end, but with 3.7 coming 18 months after 3.6, four releases is on the long side, and three releases on the short side, so keeping 3.6 is the conservative choice. Once the yearly cycle sets in I think we will be fine. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From kevin.k.sheppard at gmail.com Sun Nov 1 19:58:27 2020 From: kevin.k.sheppard at gmail.com (Kevin Sheppard) Date: Mon, 2 Nov 2020 00:58:27 +0000 Subject: [Numpy-discussion] Efficient way to draw multinomial distribution random samples In-Reply-To: <1604274886591-0.post@n7.nabble.com> References: <1604274886591-0.post@n7.nabble.com> Message-ID: This is in the pending PR. Hopefully out in 1.20. Kevin On Sun, Nov 1, 2020, 23:55 Currurant wrote: > I realized that neither numpy.random.multinomial nor rng.multinomial has > the > ability to draw from different multinomial distributions at the same time > like what MATLAB mnrnd() does here: > > https://www.mathworks.com/help/stats/mnrnd.html > > Also, I have asked this question on StackOverFlow: > > > https://stackoverflow.com/questions/64529620/is-there-an-efficient-way-to-generate-multinomial-random-variables-in-parallel?noredirect=1#comment114131565_64529620 > > It seems like this is something good to add to numpy.random, since it would > be much more faster when you have many multinomial distributions to draw > from---using loops. > > > > -- > Sent from: http://numpy-discussion.10968.n7.nabble.com/ > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From mark.harfouche at gmail.com Sun Nov 1 20:47:25 2020 From: mark.harfouche at gmail.com (Mark Harfouche) Date: Sun, 1 Nov 2020 20:47:25 -0500 Subject: [Numpy-discussion] NumPy 1.20.x branch in two weeks In-Reply-To: References:

Message-ID: > > > Do you think the proposal is not in compliance? There is no requirement > that we drop anything more than 42 months old, it is just recommended. The > change in the Python release cycle has created some difficulty. With the > yearly cycle, 4 python yearly releases will cover 3-4 years, which seems > reasonable and we can probably drop to 3 releases towards the end, but with > 3.7 coming 18 months after 3.6, four releases is on the long side, and > three releases on the short side, so keeping 3.6 is the conservative > choice. Once the yearly cycle sets in I think we will be fine. > > Chuck > I believe that it really helps to "lead by example". I don't mean to reference threads that you have all participated in, but the discussion in: https://mail.python.org/pipermail/scipy-dev/2020-August/024336.html Makes it clear to me at least, that downstream will follow the example that numpy sets. At the time of writing, it was anticipated that Python 3.7, 3.8, and maybe 3.9 would exist in Nov 1st. The support table https://numpy.org/neps/nep-0029-deprecation_policy.html#support-table suggests that any release July 23 should only support 3.7. Barring COVID delays, it seems natural that in Nov 2020, support for Python 3.6 be dropped or that the NEP be revised. These decisions are hard, and take up alot of mental capacity, if the support window needs revisiting, that is fine, it just really helps to be able to point to a document (which is what NEP29 seemed to do). -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Sun Nov 1 21:03:38 2020 From: charlesr.harris at gmail.com (Charles R Harris) Date: Sun, 1 Nov 2020 19:03:38 -0700 Subject: [Numpy-discussion] NumPy 1.20.x branch in two weeks In-Reply-To: References:

Message-ID: On Sun, Nov 1, 2020 at 6:48 PM Mark Harfouche wrote: > >> Do you think the proposal is not in compliance? There is no requirement >> that we drop anything more than 42 months old, it is just recommended. The >> change in the Python release cycle has created some difficulty. With the >> yearly cycle, 4 python yearly releases will cover 3-4 years, which seems >> reasonable and we can probably drop to 3 releases towards the end, but with >> 3.7 coming 18 months after 3.6, four releases is on the long side, and >> three releases on the short side, so keeping 3.6 is the conservative >> choice. Once the yearly cycle sets in I think we will be fine. >> >> Chuck >> > > I believe that it really helps to "lead by example". > > I don't mean to reference threads that you have all participated in, but > the discussion in: > https://mail.python.org/pipermail/scipy-dev/2020-August/024336.html > > Makes it clear to me at least, that downstream will follow the example > that numpy sets. > > At the time of writing, it was anticipated that Python 3.7, 3.8, and maybe > 3.9 would exist in Nov 1st. > The support table > https://numpy.org/neps/nep-0029-deprecation_policy.html#support-table > suggests that any release July 23 should only support 3.7. > > Barring COVID delays, it seems natural that in Nov 2020, support for > Python 3.6 be dropped or that the NEP be revised. > > These decisions are hard, and take up alot of mental capacity, if the > support window needs revisiting, that is fine, it just really helps to be > able to point to a document (which is what NEP29 seemed to do). > > The problem is that if we drop 3.6 the oldest version of Python will only be 30 months old, not 36. Dropping 3.6 for 1.20.x will make it 36 months, which is the recommended minimum coverage. I made sure that the language did not preclude longer support periods in any case. It would be helpful here if more people would comment, I would be happy to go with the shorter period if a majority of downstream projects want to go that way. It's not that I love 3.6, but there is no compelling reason to drop it, as there was for 3.5, at least that I am aware of. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From jeffreback at gmail.com Sun Nov 1 21:07:52 2020 From: jeffreback at gmail.com (Jeff Reback) Date: Sun, 1 Nov 2020 21:07:52 -0500 Subject: [Numpy-discussion] NumPy 1.20.x branch in two weeks In-Reply-To: References: Message-ID: pandas has already dropped 3.6 support in our coming 1.2 release (nov 2020); 1.1.x supports 3.6 > On Nov 1, 2020, at 9:04 PM, Charles R Harris wrote: > > ? > > > On Sun, Nov 1, 2020 at 6:48 PM Mark Harfouche wrote: >>> >>> Do you think the proposal is not in compliance? There is no requirement that we drop anything more than 42 months old, it is just recommended. The change in the Python release cycle has created some difficulty. With the yearly cycle, 4 python yearly releases will cover 3-4 years, which seems reasonable and we can probably drop to 3 releases towards the end, but with 3.7 coming 18 months after 3.6, four releases is on the long side, and three releases on the short side, so keeping 3.6 is the conservative choice. Once the yearly cycle sets in I think we will be fine. >>> >>> Chuck >> >> I believe that it really helps to "lead by example". >> >> I don't mean to reference threads that you have all participated in, but the discussion in: >> https://mail.python.org/pipermail/scipy-dev/2020-August/024336.html >> >> Makes it clear to me at least, that downstream will follow the example that numpy sets. >> >> At the time of writing, it was anticipated that Python 3.7, 3.8, and maybe 3.9 would exist in Nov 1st. >> The support table https://numpy.org/neps/nep-0029-deprecation_policy.html#support-table >> suggests that any release July 23 should only support 3.7. >> >> Barring COVID delays, it seems natural that in Nov 2020, support for Python 3.6 be dropped or that the NEP be revised. >> >> These decisions are hard, and take up alot of mental capacity, if the support window needs revisiting, that is fine, it just really helps to be able to point to a document (which is what NEP29 seemed to do). >> > > The problem is that if we drop 3.6 the oldest version of Python will only be 30 months old, not 36. Dropping 3.6 for 1.20.x will make it 36 months, which is the recommended minimum coverage. I made sure that the language did not preclude longer support periods in any case. > > It would be helpful here if more people would comment, I would be happy to go with the shorter period if a majority of downstream projects want to go that way. It's not that I love 3.6, but there is no compelling reason to drop it, as there was for 3.5, at least that I am aware of. > > Chuck > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion -------------- next part -------------- An HTML attachment was scrubbed... URL: From millman at berkeley.edu Sun Nov 1 21:44:03 2020 From: millman at berkeley.edu (Jarrod Millman) Date: Sun, 1 Nov 2020 18:44:03 -0800 Subject: [Numpy-discussion] NumPy 1.20.x branch in two weeks In-Reply-To: References:

Message-ID: NetworkX is currently planning to support 3.6 for our coming 2.6 release (dec 2020) and 3.0 release (early 2021). We had originally thought about following NEP 29. But I assumed it had been abandoned, since neither NumPy nor SciPy dropped Python 3.6 on Jun 23, 2020. NetworkX is likely to continue supporting whatever versions of Python both NumPy and SciPy support regardless of what NEP 29 says. I wouldn't be surprised if other projects do the same thing. Jarrod From millman at berkeley.edu Sun Nov 1 21:54:53 2020 From: millman at berkeley.edu (Jarrod Millman) Date: Sun, 1 Nov 2020 18:54:53 -0800 Subject: [Numpy-discussion] NumPy 1.20.x branch in two weeks In-Reply-To: References:

Message-ID: I also misunderstood the purpose of the NEP. I assumed it was intended to encourage projects to drop old versions of Python. Other people have viewed the NEP similarly: https://github.com/networkx/networkx/issues/4027 If the intention of the NEP is to specify that projects not drop old version of Python too early, I don't think it is obvious from the NEP. It would be helpful if you added a simple motivation statement near the top of the document. Something like: ## Motivation and Scope The purpose of the NEP is to ensure projects in the scientific Python ecosystem don't drop support for old version of Python and NumPy too soon. On Sun, Nov 1, 2020 at 6:44 PM Jarrod Millman wrote: > > NetworkX is currently planning to support 3.6 for our coming 2.6 > release (dec 2020) and 3.0 release (early 2021). We had originally > thought about following NEP 29. But I assumed it had been abandoned, > since neither NumPy nor SciPy dropped Python 3.6 on Jun 23, 2020. > > NetworkX is likely to continue supporting whatever versions of Python > both NumPy and SciPy support regardless of what NEP 29 says. I > wouldn't be surprised if other projects do the same thing. > > Jarrod From stefanv at berkeley.edu Sun Nov 1 22:47:04 2020 From: stefanv at berkeley.edu (Stefan van der Walt) Date: Sun, 01 Nov 2020 19:47:04 -0800 Subject: [Numpy-discussion] NumPy 1.20.x branch in two weeks In-Reply-To: References:

Message-ID: On Sun, Nov 1, 2020, at 18:54, Jarrod Millman wrote: > I also misunderstood the purpose of the NEP. I assumed it was > intended to encourage projects to drop old versions of Python. Other > people have viewed the NEP similarly: > https://github.com/networkx/networkx/issues/4027 Of all the packages, it makes sense for NumPy to behave most conservatively with depreciations. The NEP suggests allowable support periods, but as far as I recall does not enforce minimal support. Stephan Hoyer had a good recommendation on how we can clarify the NEP to be easier to intuit. Stephan, shall we make an ammendment to the NEP with your idea? Best regards, St?fan From kevin.k.sheppard at gmail.com Mon Nov 2 02:12:34 2020 From: kevin.k.sheppard at gmail.com (Kevin Sheppard) Date: Mon, 2 Nov 2020 07:12:34 +0000 Subject: [Numpy-discussion] NumPy 1.20.x branch in two weeks In-Reply-To: References:

, Message-ID: An HTML attachment was scrubbed... URL: From shoyer at gmail.com Mon Nov 2 02:47:18 2020 From: shoyer at gmail.com (Stephan Hoyer) Date: Sun, 1 Nov 2020 23:47:18 -0800 Subject: [Numpy-discussion] NumPy 1.20.x branch in two weeks In-Reply-To: References:

Message-ID: On Sun, Nov 1, 2020 at 7:47 PM Stefan van der Walt wrote: > On Sun, Nov 1, 2020, at 18:54, Jarrod Millman wrote: > > I also misunderstood the purpose of the NEP. I assumed it was > > intended to encourage projects to drop old versions of Python. Other > > people have viewed the NEP similarly: > > https://github.com/networkx/networkx/issues/4027 > > Of all the packages, it makes sense for NumPy to behave most > conservatively with depreciations. The NEP suggests allowable support > periods, but as far as I recall does not enforce minimal support. > > Stephan Hoyer had a good recommendation on how we can clarify the NEP to > be easier to intuit. Stephan, shall we make an ammendment to the NEP with > your idea? > For reference, here was my proposed revision: https://github.com/numpy/numpy/pull/14086#issuecomment-649287648 Specifically, rather than saying "the latest release of NumPy supports all versions of Python released in the 42 months before NumPy's release", it says "NumPy will only require versions of Python that were released more than 24 months ago". In practice, this works out to the same thing (at least given Python's old 18 month release cycle). This changes the definition of the support window (in a way that I think is clearer and that works better for infrequent releases), but there is still the question of how large that window should be for NumPy. My personal opinion is that somewhere in the range of 24-36 months would be appropriate. > Best regards, > St?fan > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Mon Nov 2 03:01:38 2020 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Mon, 2 Nov 2020 08:01:38 +0000 Subject: [Numpy-discussion] NumPy 1.20.x branch in two weeks In-Reply-To: References:

Message-ID: On Mon, Nov 2, 2020 at 7:47 AM Stephan Hoyer wrote: > On Sun, Nov 1, 2020 at 7:47 PM Stefan van der Walt > wrote: > >> On Sun, Nov 1, 2020, at 18:54, Jarrod Millman wrote: >> > I also misunderstood the purpose of the NEP. I assumed it was >> > intended to encourage projects to drop old versions of Python. > > It was. It is. I think the NEP is very clear on that. Honestly we should just follow the NEP and drop 3.6 now for both NumPy and SciPy, I just am tired of arguing for it - which the NEP should have prevented being necessary, and I don't want to do again right now, so this will probably be my last email on this thread. Other >> > people have viewed the NEP similarly: >> > https://github.com/networkx/networkx/issues/4027 >> >> Of all the packages, it makes sense for NumPy to behave most >> conservatively with depreciations. The NEP suggests allowable support >> periods, but as far as I recall does not enforce minimal support. >> > It doesn't *enforce* it, but the recommendation is very clear. It would be good to follow it. >> Stephan Hoyer had a good recommendation on how we can clarify the NEP to >> be easier to intuit. Stephan, shall we make an ammendment to the NEP with >> your idea? >> > > For reference, here was my proposed revision: > https://github.com/numpy/numpy/pull/14086#issuecomment-649287648 > > Specifically, rather than saying "the latest release of NumPy supports all > versions of Python released in the 42 months before NumPy's release", it > says "NumPy will only require versions of Python that were released more > than 24 months ago". In practice, this works out to the same thing (at > least given Python's old 18 month release cycle). > > This changes the definition of the support window (in a way that I think > is clearer and that works better for infrequent releases), but there is > still the question of how large that window should be for NumPy. > I'm not sure it's clearer, the current NEP has a nice graphic and literally says "a project with a major or minor version release in November 2020 should support Python 3.7 and newer."). However happy to adopt it if it makes others happy - in the end it comes down to the same thing: it's recommended to drop Python 3.6 now. My personal opinion is that somewhere in the range of 24-36 months would be > appropriate. > +1 Cheers, Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From deak.andris at gmail.com Mon Nov 2 07:22:06 2020 From: deak.andris at gmail.com (Andras Deak) Date: Mon, 2 Nov 2020 13:22:06 +0100 Subject: [Numpy-discussion] Do not understand what f2py is reporting In-Reply-To: References: Message-ID: On Sun, Nov 1, 2020 at 2:33 AM Samuel Dupree wrote: > > I'm attempting to build wrappers around two Fortran routines. One is a > Fortran 77 subroutine (see file gravity_derivs.f) that calls a Fortran > 90 package that performs automatic differentiation (see file > auto_deriv.f90). > > I'm running he Anaconda distribution for Python 3.7.6 on a Mac Pro > (2019) under Mac OS X Catalina (ver. 10.15.6). The version of NumPy I'm > running is 1.18.3. The commands I used to attempt the build are > contained in the file auto_deriv_build. The messages output by f2py are > captured in the file auto_derivs_build_report.txt. > > I don't understand the cause behind the error messages I got, so any > advice would be welcomed. > > Sam Dupree. Hi Sam, I've got a partial solution. I haven't used f2py yet but at least the error from your first `f2py` call seems straightforward. Near the top: Line #119 in gravity_derivs.f:" integer * 4 degree" updatevars: no name pattern found for entity='*4degree'. Skipping. This shows that the fortran code gets parsed as `(integer) (*4degree)`. That can't be right. There might be a way to tell f2py to do this right, but anyway I could make your code compile by replacing every such declaration with `integer * 4 :: degree` etc (i.e. adding double colons everywhere). Once that's fixed your first f2py call raises another error: Fatal Error: Cannot open module file ?deriv_class.mod? for reading at (1): No such file or directory I could generate these mod files by manually running `gfortran -c auto_deriv.f90`. After that the .mod files appear and your first `f2py` call will succed. You can now `import gravity_derivs`, but of course this will lead to an error because `auto_deriv` is not available in python. Unfortunately your _second_` f2py` call also dies on `auto_deriv.f90`, with such offending lines: In: :auto_deriv:auto_deriv.f90:ad_auxiliary get_parameters: got "invalid syntax (, line 1)" on '(/((i, i=j,n), j=1,n)/)' I'm guessing that again f2py can't parse that syntax. My hunch is that if you can get f2py to work with `auto_deriv.f90` you should first run that. This should hopefully generate the .mod files after which the second call to `f2py` with `gravity_derivs.f` should work. If `f2py` doesn't generate the .mod files you could at worst run your fortran compiler yourself between the two calls to `f2py`. Cheers, Andr?s > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion From jni at fastmail.com Mon Nov 2 07:49:45 2020 From: jni at fastmail.com (Juan Nunez-Iglesias) Date: Mon, 02 Nov 2020 06:49:45 -0600 Subject: [Numpy-discussion] NumPy 1.20.x branch in two weeks In-Reply-To: References:

Message-ID: <65b3fe9e-6f6a-4c5e-943e-e5747076eb0d@www.fastmail.com> I like Ralf's email, and most of all I agree that the existing wording is clearer. My view on the NEP is that it does not mandate dropping support, but encourage it. In my projects I would drop it if I had use for Python 3.7+ features. It so happens that we want to use PEP-593 so we were grateful for NEP-29 giving us "permission" to drop 3.6. I would suggest that 3.6 be dropped immediately if there are any open PRs that would benefit from it, or code cleanups that it would enable. The point of the NEP is to short-circuit discussion about whether it's "worth" dropping 3.6. If it's valuable at all, do it. Thanks all, Juan. On Mon, 2 Nov 2020, at 2:01 AM, Ralf Gommers wrote: > > > On Mon, Nov 2, 2020 at 7:47 AM Stephan Hoyer wrote: >> On Sun, Nov 1, 2020 at 7:47 PM Stefan van der Walt wrote: >>> On Sun, Nov 1, 2020, at 18:54, Jarrod Millman wrote: >>> > I also misunderstood the purpose of the NEP. I assumed it was >>> > intended to encourage projects to drop old versions of Python. > > It was. It is. I think the NEP is very clear on that. Honestly we should just follow the NEP and drop 3.6 now for both NumPy and SciPy, I just am tired of arguing for it - which the NEP should have prevented being necessary, and I don't want to do again right now, so this will probably be my last email on this thread. > > >>> Other >>> > people have viewed the NEP similarly: >>> > https://github.com/networkx/networkx/issues/4027 >>> >>> Of all the packages, it makes sense for NumPy to behave most conservatively with depreciations. The NEP suggests allowable support periods, but as far as I recall does not enforce minimal support. > > It doesn't *enforce* it, but the recommendation is very clear. It would be good to follow it. > >>> >>> Stephan Hoyer had a good recommendation on how we can clarify the NEP to be easier to intuit. Stephan, shall we make an ammendment to the NEP with your idea? >> >> For reference, here was my proposed revision: >> https://github.com/numpy/numpy/pull/14086#issuecomment-649287648 >> Specifically, rather than saying "the latest release of NumPy supports all versions of Python released in the 42 months before NumPy's release", it says "NumPy will only require versions of Python that were released more than 24 months ago". In practice, this works out to the same thing (at least given Python's old 18 month release cycle). >> >> This changes the definition of the support window (in a way that I think is clearer and that works better for infrequent releases), but there is still the question of how large that window should be for NumPy. > > I'm not sure it's clearer, the current NEP has a nice graphic and literally says "a project with a major or minor version release in November 2020 should support Python 3.7 and newer."). However happy to adopt it if it makes others happy - in the end it comes down to the same thing: it's recommended to drop Python 3.6 now. > >> My personal opinion is that somewhere in the range of 24-36 months would be appropriate. > > +1 > > Cheers, > Ralf > > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Mon Nov 2 11:37:28 2020 From: charlesr.harris at gmail.com (Charles R Harris) Date: Mon, 2 Nov 2020 09:37:28 -0700 Subject: [Numpy-discussion] NumPy 1.19.4 release Message-ID: Hi All, On behalf of the NumPy team I am pleased to announce the release of NumPy 1.19.4. NumPy 1.19.4 is a quick release to revert the OpenBLAS library version. It was hoped that the 0.3.12 OpenBLAS version used in 1.19.3 would work around the Microsoft fmod bug, but problems in some docker environments turned up. Instead, 1.19.4 will use the older library and run a sanity check on import, raising an error if the problem is detected. Microsoft is aware of the problem and has promised a fix, users should upgrade when it becomes available. This release supports Python 3.6-3.9. NumPy Wheels for this release can be downloaded from PyPI , source archives, release notes, and wheel hashes are available on Github . Linux users will need pip >= 0.19.3 in order to install manylinux2010 and manylinux2014 wheels. *Contributors* A total of 1 people contributed to this release. People with a "+" by their names contributed a patch for the first time. - Charles Harris *Pull requests merged* A total of 2 pull requests were merged for this release. - #17679: MAINT: Add check for Windows 10 version 2004 bug. - #17680: REV: Revert OpenBLAS to 1.19.2 version for 1.19.4 Cheers, Charles Harris -------------- next part -------------- An HTML attachment was scrubbed... URL: From sebastian at sipsolutions.net Mon Nov 2 13:48:46 2020 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Mon, 02 Nov 2020 12:48:46 -0600 Subject: [Numpy-discussion] NumPy 1.20.x branch in two weeks In-Reply-To: <65b3fe9e-6f6a-4c5e-943e-e5747076eb0d@www.fastmail.com> References:

<65b3fe9e-6f6a-4c5e-943e-e5747076eb0d@www.fastmail.com> Message-ID: On Mon, 2020-11-02 at 06:49 -0600, Juan Nunez-Iglesias wrote: > I like Ralf's email, and most of all I agree that the existing > wording is clearer. > > My view on the NEP is that it does not mandate dropping support, but > encourage it. In my projects I would drop it if I had use for Python > 3.7+ features. It so happens that we want to use PEP-593 so we were > grateful for NEP-29 giving us "permission" to drop 3.6. > > I would suggest that 3.6 be dropped immediately if there are any open > PRs that would benefit from it, or code cleanups that it would > enable. The point of the NEP is to short-circuit discussion about > whether it's "worth" dropping 3.6. If it's valuable at all, do it. > Probably the only thing that requires 3.7 in NumPy at this time is the module level `__getattr__`, which is used only for deprecations (and to make the financial removal slightly more gentle). I am not sure if PyPy already has stable support for 3.7 yet? Although PyPy is maybe not a big priority. We don't have to support 3.6 and I don't care if we do. Until this discussion my assumption was we would probably drop it. But, current master is tested against 3.6, so the main work seems release related. If Chuck thinks that is no hassle I don't mind if NumPy is a bit more conservative than NEP 29. Or is there a danger of setting a precedent where projects are wrongly expected to keep support just because NumPy still has it, so that NumPy not being conservative actually helps everyone? - Sebastian > Thanks all, > > Juan. > > On Mon, 2 Nov 2020, at 2:01 AM, Ralf Gommers wrote: > > > > On Mon, Nov 2, 2020 at 7:47 AM Stephan Hoyer > > wrote: > > > On Sun, Nov 1, 2020 at 7:47 PM Stefan van der Walt < > > > stefanv at berkeley.edu> wrote: > > > > On Sun, Nov 1, 2020, at 18:54, Jarrod Millman wrote: > > > > > I also misunderstood the purpose of the NEP. I assumed it > > > > > was > > > > > intended to encourage projects to drop old versions of > > > > > Python. > > > > It was. It is. I think the NEP is very clear on that. Honestly we > > should just follow the NEP and drop 3.6 now for both NumPy and > > SciPy, I just am tired of arguing for it - which the NEP should > > have prevented being necessary, and I don't want to do again right > > now, so this will probably be my last email on this thread. > > > > > > > > Other > > > > > people have viewed the NEP similarly: > > > > > https://github.com/networkx/networkx/issues/4027 > > > > > > > > Of all the packages, it makes sense for NumPy to behave most > > > > conservatively with depreciations. The NEP suggests allowable > > > > support periods, but as far as I recall does not enforce > > > > minimal support. > > > > It doesn't *enforce* it, but the recommendation is very clear. It > > would be good to follow it. > > > > > > Stephan Hoyer had a good recommendation on how we can clarify > > > > the NEP to be easier to intuit. Stephan, shall we make an > > > > ammendment to the NEP with your idea? > > > > > > For reference, here was my proposed revision: > > > https://github.com/numpy/numpy/pull/14086#issuecomment-649287648 > > > Specifically, rather than saying "the latest release of NumPy > > > supports all versions of Python released in the 42 months before > > > NumPy's release", it says "NumPy will only require versions of > > > Python that were released more than 24 months ago". In practice, > > > this works out to the same thing (at least given Python's old 18 > > > month release cycle). > > > > > > This changes the definition of the support window (in a way that > > > I think is clearer and that works better for infrequent > > > releases), but there is still the question of how large that > > > window should be for NumPy. > > > > I'm not sure it's clearer, the current NEP has a nice graphic and > > literally says "a project with a major or minor version release in > > November 2020 should support Python 3.7 and newer."). However happy > > to adopt it if it makes others happy - in the end it comes down to > > the same thing: it's recommended to drop Python 3.6 now. > > > > > My personal opinion is that somewhere in the range of 24-36 > > > months would be appropriate. > > > > +1 > > > > Cheers, > > Ralf > > > > > > > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at python.org > > https://mail.python.org/mailman/listinfo/numpy-discussion > > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 833 bytes Desc: This is a digitally signed message part URL: From sdupree at speakeasy.net Mon Nov 2 23:26:00 2020 From: sdupree at speakeasy.net (Samuel Dupree) Date: Mon, 2 Nov 2020 23:26:00 -0500 Subject: [Numpy-discussion] Do not understand what f2py is reporting In-Reply-To: References:

Message-ID: <7426aa38-b667-36ec-46cd-66c6935eb928@speakeasy.net> Andras, Thank you for respond to my post. I sincerely appreciate it. Following your advice, I replaced "integer * 4" with "integer" and I was able to generate the signature files for gravity_derivs.f. The problem now is generating the signature file for auto_deriv.f90. I agree that f2py has a problem with In: :auto_deriv:auto_deriv.f90:ad_auxiliary get_parameters: got "invalid syntax (, line 1)" on '(/((i,i=j,n), j=1,n)/)' I'm not sure I understand why f2py has a problem with this syntax. Is there documentation that talks to what Fortran77, Fortran 90/95 syntax f2py will and will not accept? Sam Dupree. On November/02/2020 07:22:06, Andras Deak wrote: > On Sun, Nov 1, 2020 at 2:33 AM Samuel Dupree wrote: >> I'm attempting to build wrappers around two Fortran routines. One is a >> Fortran 77 subroutine (see file gravity_derivs.f) that calls a Fortran >> 90 package that performs automatic differentiation (see file >> auto_deriv.f90). >> >> I'm running he Anaconda distribution for Python 3.7.6 on a Mac Pro >> (2019) under Mac OS X Catalina (ver. 10.15.6). The version of NumPy I'm >> running is 1.18.3. The commands I used to attempt the build are >> contained in the file auto_deriv_build. The messages output by f2py are >> captured in the file auto_derivs_build_report.txt. >> >> I don't understand the cause behind the error messages I got, so any >> advice would be welcomed. >> >> Sam Dupree. > Hi Sam, > > I've got a partial solution. > I haven't used f2py yet but at least the error from your first `f2py` > call seems straightforward. Near the top: > > Line #119 in gravity_derivs.f:" integer * 4 degree" > updatevars: no name pattern found for entity='*4degree'. Skipping. > > This shows that the fortran code gets parsed as `(integer) > (*4degree)`. That can't be right. There might be a way to tell f2py to > do this right, but anyway I could make your code compile by replacing > every such declaration with `integer * 4 :: degree` etc (i.e. adding > double colons everywhere). > Once that's fixed your first f2py call raises another error: > > Fatal Error: Cannot open module file ?deriv_class.mod? for reading > at (1): No such file or directory > > I could generate these mod files by manually running `gfortran -c > auto_deriv.f90`. After that the .mod files appear and your first > `f2py` call will succed. > You can now `import gravity_derivs`, but of course this will lead to > an error because `auto_deriv` is not available in python. > Unfortunately your _second_` f2py` call also dies on `auto_deriv.f90`, > with such offending lines: > > In: :auto_deriv:auto_deriv.f90:ad_auxiliary > get_parameters: got "invalid syntax (, line 1)" on '(/((i, > i=j,n), j=1,n)/)' > > I'm guessing that again f2py can't parse that syntax. > My hunch is that if you can get f2py to work with `auto_deriv.f90` you > should first run that. This should hopefully generate the .mod files > after which the second call to `f2py` with `gravity_derivs.f` should > work. If `f2py` doesn't generate the .mod files you could at worst run > your fortran compiler yourself between the two calls to `f2py`. > Cheers, > > Andr?s > >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at python.org >> https://mail.python.org/mailman/listinfo/numpy-discussion > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > From mark.harfouche at gmail.com Tue Nov 3 09:17:54 2020 From: mark.harfouche at gmail.com (Mark Harfouche) Date: Tue, 3 Nov 2020 09:17:54 -0500 Subject: [Numpy-discussion] NumPy 1.20.x branch in two weeks In-Reply-To: References:

<65b3fe9e-6f6a-4c5e-943e-e5747076eb0d@www.fastmail.com> Message-ID: Juan made a pretty good argument for keeping 3.6 support in the next scikit-image release, let me try to paraphrase: - Since nobody has made the PR to explicitly drop python 3.6 from the scikit-image build matrix, we will continue to support it, but if somebody were to make the PR, I (Juan) would support it. As for supporting PyPy: it already exists in the build matrix AFAICT. Breaking PyPy would be a deliberate action, as opposed to an accidental byproduct of dropping CPython 3.6. On Mon, Nov 2, 2020, 13:50 Sebastian Berg wrote: > On Mon, 2020-11-02 at 06:49 -0600, Juan Nunez-Iglesias wrote: > > I like Ralf's email, and most of all I agree that the existing > > wording is clearer. > > > > My view on the NEP is that it does not mandate dropping support, but > > encourage it. In my projects I would drop it if I had use for Python > > 3.7+ features. It so happens that we want to use PEP-593 so we were > > grateful for NEP-29 giving us "permission" to drop 3.6. > > > > I would suggest that 3.6 be dropped immediately if there are any open > > PRs that would benefit from it, or code cleanups that it would > > enable. The point of the NEP is to short-circuit discussion about > > whether it's "worth" dropping 3.6. If it's valuable at all, do it. > > > > Probably the only thing that requires 3.7 in NumPy at this time is the > module level `__getattr__`, which is used only for deprecations (and to > make the financial removal slightly more gentle). > I am not sure if PyPy already has stable support for 3.7 yet? Although > PyPy is maybe not a big priority. > > We don't have to support 3.6 and I don't care if we do. Until this > discussion my assumption was we would probably drop it. > > But, current master is tested against 3.6, so the main work seems > release related. If Chuck thinks that is no hassle I don't mind if > NumPy is a bit more conservative than NEP 29. > > Or is there a danger of setting a precedent where projects are wrongly > expected to keep support just because NumPy still has it, so that NumPy > not being conservative actually helps everyone? > > - Sebastian > > > > Thanks all, > > > > Juan. > > > > On Mon, 2 Nov 2020, at 2:01 AM, Ralf Gommers wrote: > > > > > > On Mon, Nov 2, 2020 at 7:47 AM Stephan Hoyer > > > wrote: > > > > On Sun, Nov 1, 2020 at 7:47 PM Stefan van der Walt < > > > > stefanv at berkeley.edu> wrote: > > > > > On Sun, Nov 1, 2020, at 18:54, Jarrod Millman wrote: > > > > > > I also misunderstood the purpose of the NEP. I assumed it > > > > > > was > > > > > > intended to encourage projects to drop old versions of > > > > > > Python. > > > > > > It was. It is. I think the NEP is very clear on that. Honestly we > > > should just follow the NEP and drop 3.6 now for both NumPy and > > > SciPy, I just am tired of arguing for it - which the NEP should > > > have prevented being necessary, and I don't want to do again right > > > now, so this will probably be my last email on this thread. > > > > > > > > > > > Other > > > > > > people have viewed the NEP similarly: > > > > > > https://github.com/networkx/networkx/issues/4027 > > > > > > > > > > Of all the packages, it makes sense for NumPy to behave most > > > > > conservatively with depreciations. The NEP suggests allowable > > > > > support periods, but as far as I recall does not enforce > > > > > minimal support. > > > > > > It doesn't *enforce* it, but the recommendation is very clear. It > > > would be good to follow it. > > > > > > > > Stephan Hoyer had a good recommendation on how we can clarify > > > > > the NEP to be easier to intuit. Stephan, shall we make an > > > > > ammendment to the NEP with your idea? > > > > > > > > For reference, here was my proposed revision: > > > > https://github.com/numpy/numpy/pull/14086#issuecomment-649287648 > > > > Specifically, rather than saying "the latest release of NumPy > > > > supports all versions of Python released in the 42 months before > > > > NumPy's release", it says "NumPy will only require versions of > > > > Python that were released more than 24 months ago". In practice, > > > > this works out to the same thing (at least given Python's old 18 > > > > month release cycle). > > > > > > > > This changes the definition of the support window (in a way that > > > > I think is clearer and that works better for infrequent > > > > releases), but there is still the question of how large that > > > > window should be for NumPy. > > > > > > I'm not sure it's clearer, the current NEP has a nice graphic and > > > literally says "a project with a major or minor version release in > > > November 2020 should support Python 3.7 and newer."). However happy > > > to adopt it if it makes others happy - in the end it comes down to > > > the same thing: it's recommended to drop Python 3.6 now. > > > > > > > My personal opinion is that somewhere in the range of 24-36 > > > > months would be appropriate. > > > > > > +1 > > > > > > Cheers, > > > Ralf > > > > > > > > > > > > _______________________________________________ > > > NumPy-Discussion mailing list > > > NumPy-Discussion at python.org > > > https://mail.python.org/mailman/listinfo/numpy-discussion > > > > > > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at python.org > > https://mail.python.org/mailman/listinfo/numpy-discussion > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From matti.picus at gmail.com Tue Nov 3 10:54:24 2020 From: matti.picus at gmail.com (Matti Picus) Date: Tue, 3 Nov 2020 17:54:24 +0200 Subject: [Numpy-discussion] New package to speed up ufunc inner loops Message-ID: Hi. On behalf of Quansight and RTOSHoldings, I would like to introduce "pnumpy", a package to speed up NumPy. https://quansight.github.io/numpy-threading-extensions/stable/index.html What is in it? - use "PyUFunc_ReplaceLoopBySignature" to hook all the UFunc inner loops - When the inner loop is called with a large enough array, chunk the data and perform the iteration via a thread pool - Add a different memory allocator for "ndarray" data (will require an appropriate API from NumPy) - Allow using optimized loops above and beyond what NumPy provides - Allow logging inner loop calls and parameters to learn about the current process and perhaps tune the performance accordingly The first release contains the hooking mechanism and the thread pool, the rest has been prototyped but is not ready for release. The idea behind the package is that a third-party package can try things out and iterate much faster than NumPy. If some of the ideas bear fruit, and do not add an undue maintenance burden to NumPy, the code can be ported to NumPy. I am not sure NumPy wishes to take upon itself the burden of managing threads, but a third-party package may be able to. I am writing to the mailing list both to announce the pre-release under the wrong name, and, in accordance with the fair play rules[1], to request use of the "numpy" name in the package. We had considered many options, in the end would like to propose "pnumpy" (the p is either "parallel" or "performant" or "preliminary", whatever you desire). Matti [1] https://numpy.org/neps/nep-0036-fair-play.html#fair-play-rules From tcaswell at gmail.com Tue Nov 3 13:49:39 2020 From: tcaswell at gmail.com (Thomas Caswell) Date: Tue, 3 Nov 2020 13:49:39 -0500 Subject: [Numpy-discussion] NumPy 1.20.x branch in two weeks In-Reply-To: References:

<65b3fe9e-6f6a-4c5e-943e-e5747076eb0d@www.fastmail.com> Message-ID: I am in favor of dropping py36 for np1.20, I think it would be good to lead by example. Similar to pandas, the next Matplotlib release (3.4 targeted for Dec/Jan) will not support py36. Tom On Tue, Nov 3, 2020 at 9:18 AM Mark Harfouche wrote: > Juan made a pretty good argument for keeping 3.6 support in the next > scikit-image release, let me try to paraphrase: > > - Since nobody has made the PR to explicitly drop python 3.6 from the > scikit-image build matrix, we will continue to support it, but if somebody > were to make the PR, I (Juan) would support it. > > As for supporting PyPy: it already exists in the build matrix AFAICT. > Breaking PyPy would be a deliberate action, as opposed to an accidental > byproduct of dropping CPython 3.6. > > On Mon, Nov 2, 2020, 13:50 Sebastian Berg > wrote: > >> On Mon, 2020-11-02 at 06:49 -0600, Juan Nunez-Iglesias wrote: >> > I like Ralf's email, and most of all I agree that the existing >> > wording is clearer. >> > >> > My view on the NEP is that it does not mandate dropping support, but >> > encourage it. In my projects I would drop it if I had use for Python >> > 3.7+ features. It so happens that we want to use PEP-593 so we were >> > grateful for NEP-29 giving us "permission" to drop 3.6. >> > >> > I would suggest that 3.6 be dropped immediately if there are any open >> > PRs that would benefit from it, or code cleanups that it would >> > enable. The point of the NEP is to short-circuit discussion about >> > whether it's "worth" dropping 3.6. If it's valuable at all, do it. >> > >> >> Probably the only thing that requires 3.7 in NumPy at this time is the >> module level `__getattr__`, which is used only for deprecations (and to >> make the financial removal slightly more gentle). >> I am not sure if PyPy already has stable support for 3.7 yet? Although >> PyPy is maybe not a big priority. >> >> We don't have to support 3.6 and I don't care if we do. Until this >> discussion my assumption was we would probably drop it. >> >> But, current master is tested against 3.6, so the main work seems >> release related. If Chuck thinks that is no hassle I don't mind if >> NumPy is a bit more conservative than NEP 29. >> >> Or is there a danger of setting a precedent where projects are wrongly >> expected to keep support just because NumPy still has it, so that NumPy >> not being conservative actually helps everyone? >> >> - Sebastian >> >> >> > Thanks all, >> > >> > Juan. >> > >> > On Mon, 2 Nov 2020, at 2:01 AM, Ralf Gommers wrote: >> > > >> > > On Mon, Nov 2, 2020 at 7:47 AM Stephan Hoyer >> > > wrote: >> > > > On Sun, Nov 1, 2020 at 7:47 PM Stefan van der Walt < >> > > > stefanv at berkeley.edu> wrote: >> > > > > On Sun, Nov 1, 2020, at 18:54, Jarrod Millman wrote: >> > > > > > I also misunderstood the purpose of the NEP. I assumed it >> > > > > > was >> > > > > > intended to encourage projects to drop old versions of >> > > > > > Python. >> > > >> > > It was. It is. I think the NEP is very clear on that. Honestly we >> > > should just follow the NEP and drop 3.6 now for both NumPy and >> > > SciPy, I just am tired of arguing for it - which the NEP should >> > > have prevented being necessary, and I don't want to do again right >> > > now, so this will probably be my last email on this thread. >> > > >> > > >> > > > > Other >> > > > > > people have viewed the NEP similarly: >> > > > > > https://github.com/networkx/networkx/issues/4027 >> > > > > >> > > > > Of all the packages, it makes sense for NumPy to behave most >> > > > > conservatively with depreciations. The NEP suggests allowable >> > > > > support periods, but as far as I recall does not enforce >> > > > > minimal support. >> > > >> > > It doesn't *enforce* it, but the recommendation is very clear. It >> > > would be good to follow it. >> > > >> > > > > Stephan Hoyer had a good recommendation on how we can clarify >> > > > > the NEP to be easier to intuit. Stephan, shall we make an >> > > > > ammendment to the NEP with your idea? >> > > > >> > > > For reference, here was my proposed revision: >> > > > https://github.com/numpy/numpy/pull/14086#issuecomment-649287648 >> > > > Specifically, rather than saying "the latest release of NumPy >> > > > supports all versions of Python released in the 42 months before >> > > > NumPy's release", it says "NumPy will only require versions of >> > > > Python that were released more than 24 months ago". In practice, >> > > > this works out to the same thing (at least given Python's old 18 >> > > > month release cycle). >> > > > >> > > > This changes the definition of the support window (in a way that >> > > > I think is clearer and that works better for infrequent >> > > > releases), but there is still the question of how large that >> > > > window should be for NumPy. >> > > >> > > I'm not sure it's clearer, the current NEP has a nice graphic and >> > > literally says "a project with a major or minor version release in >> > > November 2020 should support Python 3.7 and newer."). However happy >> > > to adopt it if it makes others happy - in the end it comes down to >> > > the same thing: it's recommended to drop Python 3.6 now. >> > > >> > > > My personal opinion is that somewhere in the range of 24-36 >> > > > months would be appropriate. >> > > >> > > +1 >> > > >> > > Cheers, >> > > Ralf >> > > >> > > >> > > >> > > _______________________________________________ >> > > NumPy-Discussion mailing list >> > > NumPy-Discussion at python.org >> > > https://mail.python.org/mailman/listinfo/numpy-discussion >> > > >> > >> > _______________________________________________ >> > NumPy-Discussion mailing list >> > NumPy-Discussion at python.org >> > https://mail.python.org/mailman/listinfo/numpy-discussion >> >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at python.org >> https://mail.python.org/mailman/listinfo/numpy-discussion >> > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -- Thomas Caswell tcaswell at gmail.com -------------- next part -------------- An HTML attachment was scrubbed... URL: From b.sipocz+numpylist at gmail.com Tue Nov 3 14:58:11 2020 From: b.sipocz+numpylist at gmail.com (Brigitta Sipocz) Date: Tue, 3 Nov 2020 11:58:11 -0800 Subject: [Numpy-discussion] NumPy 1.20.x branch in two weeks In-Reply-To: References:

<65b3fe9e-6f6a-4c5e-943e-e5747076eb0d@www.fastmail.com> Message-ID: Hi, For what it's worth, python 3.6 is also dropped for astropy 4.2 (RC1 to be released in the next few days). We haven't yet formally adopted NEP29, but are very close to it peding some word smithing, and no one from the dev team was fighting for keeping support for 3.6. or numpy 1.16. Cheers, Brigitta On Tue, 3 Nov 2020 at 10:53, Thomas Caswell wrote: > I am in favor of dropping py36 for np1.20, I think it would be good to > lead by example. > > Similar to pandas, the next Matplotlib release (3.4 targeted for Dec/Jan) > will not support py36. > > Tom > > > > On Tue, Nov 3, 2020 at 9:18 AM Mark Harfouche > wrote: > >> Juan made a pretty good argument for keeping 3.6 support in the next >> scikit-image release, let me try to paraphrase: >> >> - Since nobody has made the PR to explicitly drop python 3.6 from the >> scikit-image build matrix, we will continue to support it, but if somebody >> were to make the PR, I (Juan) would support it. >> >> As for supporting PyPy: it already exists in the build matrix AFAICT. >> Breaking PyPy would be a deliberate action, as opposed to an accidental >> byproduct of dropping CPython 3.6. >> >> On Mon, Nov 2, 2020, 13:50 Sebastian Berg >> wrote: >> >>> On Mon, 2020-11-02 at 06:49 -0600, Juan Nunez-Iglesias wrote: >>> > I like Ralf's email, and most of all I agree that the existing >>> > wording is clearer. >>> > >>> > My view on the NEP is that it does not mandate dropping support, but >>> > encourage it. In my projects I would drop it if I had use for Python >>> > 3.7+ features. It so happens that we want to use PEP-593 so we were >>> > grateful for NEP-29 giving us "permission" to drop 3.6. >>> > >>> > I would suggest that 3.6 be dropped immediately if there are any open >>> > PRs that would benefit from it, or code cleanups that it would >>> > enable. The point of the NEP is to short-circuit discussion about >>> > whether it's "worth" dropping 3.6. If it's valuable at all, do it. >>> > >>> >>> Probably the only thing that requires 3.7 in NumPy at this time is the >>> module level `__getattr__`, which is used only for deprecations (and to >>> make the financial removal slightly more gentle). >>> I am not sure if PyPy already has stable support for 3.7 yet? Although >>> PyPy is maybe not a big priority. >>> >>> We don't have to support 3.6 and I don't care if we do. Until this >>> discussion my assumption was we would probably drop it. >>> >>> But, current master is tested against 3.6, so the main work seems >>> release related. If Chuck thinks that is no hassle I don't mind if >>> NumPy is a bit more conservative than NEP 29. >>> >>> Or is there a danger of setting a precedent where projects are wrongly >>> expected to keep support just because NumPy still has it, so that NumPy >>> not being conservative actually helps everyone? >>> >>> - Sebastian >>> >>> >>> > Thanks all, >>> > >>> > Juan. >>> > >>> > On Mon, 2 Nov 2020, at 2:01 AM, Ralf Gommers wrote: >>> > > >>> > > On Mon, Nov 2, 2020 at 7:47 AM Stephan Hoyer >>> > > wrote: >>> > > > On Sun, Nov 1, 2020 at 7:47 PM Stefan van der Walt < >>> > > > stefanv at berkeley.edu> wrote: >>> > > > > On Sun, Nov 1, 2020, at 18:54, Jarrod Millman wrote: >>> > > > > > I also misunderstood the purpose of the NEP. I assumed it >>> > > > > > was >>> > > > > > intended to encourage projects to drop old versions of >>> > > > > > Python. >>> > > >>> > > It was. It is. I think the NEP is very clear on that. Honestly we >>> > > should just follow the NEP and drop 3.6 now for both NumPy and >>> > > SciPy, I just am tired of arguing for it - which the NEP should >>> > > have prevented being necessary, and I don't want to do again right >>> > > now, so this will probably be my last email on this thread. >>> > > >>> > > >>> > > > > Other >>> > > > > > people have viewed the NEP similarly: >>> > > > > > https://github.com/networkx/networkx/issues/4027 >>> > > > > >>> > > > > Of all the packages, it makes sense for NumPy to behave most >>> > > > > conservatively with depreciations. The NEP suggests allowable >>> > > > > support periods, but as far as I recall does not enforce >>> > > > > minimal support. >>> > > >>> > > It doesn't *enforce* it, but the recommendation is very clear. It >>> > > would be good to follow it. >>> > > >>> > > > > Stephan Hoyer had a good recommendation on how we can clarify >>> > > > > the NEP to be easier to intuit. Stephan, shall we make an >>> > > > > ammendment to the NEP with your idea? >>> > > > >>> > > > For reference, here was my proposed revision: >>> > > > https://github.com/numpy/numpy/pull/14086#issuecomment-649287648 >>> > > > Specifically, rather than saying "the latest release of NumPy >>> > > > supports all versions of Python released in the 42 months before >>> > > > NumPy's release", it says "NumPy will only require versions of >>> > > > Python that were released more than 24 months ago". In practice, >>> > > > this works out to the same thing (at least given Python's old 18 >>> > > > month release cycle). >>> > > > >>> > > > This changes the definition of the support window (in a way that >>> > > > I think is clearer and that works better for infrequent >>> > > > releases), but there is still the question of how large that >>> > > > window should be for NumPy. >>> > > >>> > > I'm not sure it's clearer, the current NEP has a nice graphic and >>> > > literally says "a project with a major or minor version release in >>> > > November 2020 should support Python 3.7 and newer."). However happy >>> > > to adopt it if it makes others happy - in the end it comes down to >>> > > the same thing: it's recommended to drop Python 3.6 now. >>> > > >>> > > > My personal opinion is that somewhere in the range of 24-36 >>> > > > months would be appropriate. >>> > > >>> > > +1 >>> > > >>> > > Cheers, >>> > > Ralf >>> > > >>> > > >>> > > >>> > > _______________________________________________ >>> > > NumPy-Discussion mailing list >>> > > NumPy-Discussion at python.org >>> > > https://mail.python.org/mailman/listinfo/numpy-discussion >>> > > >>> > >>> > _______________________________________________ >>> > NumPy-Discussion mailing list >>> > NumPy-Discussion at python.org >>> > https://mail.python.org/mailman/listinfo/numpy-discussion >>> >>> _______________________________________________ >>> NumPy-Discussion mailing list >>> NumPy-Discussion at python.org >>> https://mail.python.org/mailman/listinfo/numpy-discussion >>> >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at python.org >> https://mail.python.org/mailman/listinfo/numpy-discussion >> > > > -- > Thomas Caswell > tcaswell at gmail.com > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From sebastian at sipsolutions.net Wed Nov 4 01:49:45 2020 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Wed, 04 Nov 2020 00:49:45 -0600 Subject: [Numpy-discussion] NumPy Development Meeting Wednesday - Triage Focus (US switched times last week) Message-ID: Hi all, Our bi-weekly triage-focused NumPy development meeting is today (Wednesday, November 4th) at 11 am Pacific Time (18:00 UTC). Everyone is invited to join in and edit the work-in-progress meeting topics and notes: https://hackmd.io/68i_JvOYQfy9ERiHgXMPvg I encourage everyone to notify us of issues or PRs that you feel should be prioritized, discussed, or reviewed. Best regards Sebastian -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 833 bytes Desc: This is a digitally signed message part URL: From ralf.gommers at gmail.com Wed Nov 4 16:43:16 2020 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Wed, 4 Nov 2020 21:43:16 +0000 Subject: [Numpy-discussion] New package to speed up ufunc inner loops In-Reply-To: References: Message-ID: On Tue, Nov 3, 2020 at 3:54 PM Matti Picus wrote: > Hi. On behalf of Quansight and RTOSHoldings, I would like to introduce > "pnumpy", a package to speed up NumPy. > > https://quansight.github.io/numpy-threading-extensions/stable/index.html > > > What is in it? > > - use "PyUFunc_ReplaceLoopBySignature" to hook all the UFunc inner loops > > - When the inner loop is called with a large enough array, chunk the > data and perform the iteration via a thread pool > > - Add a different memory allocator for "ndarray" data (will require an > appropriate API from NumPy) > > - Allow using optimized loops above and beyond what NumPy provides > > - Allow logging inner loop calls and parameters to learn about the > current process and perhaps tune the performance accordingly > > > The first release contains the hooking mechanism and the thread pool, > the rest has been prototyped but is not ready for release. The idea > behind the package is that a third-party package can try things out and > iterate much faster than NumPy. If some of the ideas bear fruit, and do > not add an undue maintenance burden to NumPy, the code can be ported to > NumPy. I am not sure NumPy wishes to take upon itself the burden of > managing threads, but a third-party package may be able to. > > > I am writing to the mailing list both to announce the pre-release under > the wrong name, and, in accordance with the fair play rules[1], to > request use of the "numpy" name in the package. We had considered many > options, in the end would like to propose "pnumpy" (the p is either > "parallel" or "performant" or "preliminary", whatever you desire). > Thanks Matti! Obviously as another Quansight employee I have a conflict of interest here, so let me just say I wasn't involved with choosing the `pnumpy` name but did already comment internally on using "numpy" as part of the package name would probably be fine, given that Matti is the main author and the intent is to migrate the useful parts into NumPy itself. Hopefully someone else can comment, maybe St?fan as the "fair play" NEP author? Cheers, Ralf > > Matti > > > [1] https://numpy.org/neps/nep-0036-fair-play.html#fair-play-rules > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From asmeurer at gmail.com Wed Nov 4 16:47:40 2020 From: asmeurer at gmail.com (Aaron Meurer) Date: Wed, 4 Nov 2020 14:47:40 -0700 Subject: [Numpy-discussion] New package to speed up ufunc inner loops In-Reply-To: References:

Message-ID: I hope this isn't too off topic, but this "fair play" NEP reads like it is a set of additional restrictions on the NumPy license, which if it is, would make NumPy no longer open source by the OSI definition. I think the NEP should be much clearer that these are requests but not requirements. Aaron Meurer On Wed, Nov 4, 2020 at 2:44 PM Ralf Gommers wrote: > > > > On Tue, Nov 3, 2020 at 3:54 PM Matti Picus wrote: >> >> Hi. On behalf of Quansight and RTOSHoldings, I would like to introduce >> "pnumpy", a package to speed up NumPy. >> >> https://quansight.github.io/numpy-threading-extensions/stable/index.html >> >> >> What is in it? >> >> - use "PyUFunc_ReplaceLoopBySignature" to hook all the UFunc inner loops >> >> - When the inner loop is called with a large enough array, chunk the >> data and perform the iteration via a thread pool >> >> - Add a different memory allocator for "ndarray" data (will require an >> appropriate API from NumPy) >> >> - Allow using optimized loops above and beyond what NumPy provides >> >> - Allow logging inner loop calls and parameters to learn about the >> current process and perhaps tune the performance accordingly >> >> >> The first release contains the hooking mechanism and the thread pool, >> the rest has been prototyped but is not ready for release. The idea >> behind the package is that a third-party package can try things out and >> iterate much faster than NumPy. If some of the ideas bear fruit, and do >> not add an undue maintenance burden to NumPy, the code can be ported to >> NumPy. I am not sure NumPy wishes to take upon itself the burden of >> managing threads, but a third-party package may be able to. >> >> >> I am writing to the mailing list both to announce the pre-release under >> the wrong name, and, in accordance with the fair play rules[1], to >> request use of the "numpy" name in the package. We had considered many >> options, in the end would like to propose "pnumpy" (the p is either >> "parallel" or "performant" or "preliminary", whatever you desire). > > > Thanks Matti! > > Obviously as another Quansight employee I have a conflict of interest here, so let me just say I wasn't involved with choosing the `pnumpy` name but did already comment internally on using "numpy" as part of the package name would probably be fine, given that Matti is the main author and the intent is to migrate the useful parts into NumPy itself. > > Hopefully someone else can comment, maybe St?fan as the "fair play" NEP author? > > Cheers, > Ralf > > >> >> >> Matti >> >> >> [1] https://numpy.org/neps/nep-0036-fair-play.html#fair-play-rules >> >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at python.org >> https://mail.python.org/mailman/listinfo/numpy-discussion > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion From robert.kern at gmail.com Wed Nov 4 17:01:29 2020 From: robert.kern at gmail.com (Robert Kern) Date: Wed, 4 Nov 2020 17:01:29 -0500 Subject: [Numpy-discussion] New package to speed up ufunc inner loops In-Reply-To: References:

Message-ID: On Wed, Nov 4, 2020 at 4:49 PM Aaron Meurer wrote: > I hope this isn't too off topic, but this "fair play" NEP reads like > it is a set of additional restrictions on the NumPy license, which if > it is, would make NumPy no longer open source by the OSI definition. I > think the NEP should be much clearer that these are requests but not > requirements. > FWIW, I don't read the NEP like that. Aside from the trademark on the name "NumPy", which _are_ enforceable requirements but are orthogonal to the copyright license, I see enough "request-like" language on everything else. -- Robert Kern -------------- next part -------------- An HTML attachment was scrubbed... URL: From sebastian at sipsolutions.net Wed Nov 4 17:15:37 2020 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Wed, 04 Nov 2020 16:15:37 -0600 Subject: [Numpy-discussion] New package to speed up ufunc inner loops In-Reply-To: References: Message-ID: <615e34a86cd0fd6bedb4c4d053f7c9006e1a28c5.camel@sipsolutions.net> On Tue, 2020-11-03 at 17:54 +0200, Matti Picus wrote: > Hi. On behalf of Quansight and RTOSHoldings, I would like to > introduce > "pnumpy", a package to speed up NumPy. > > https://quansight.github.io/numpy-threading-extensions/stable/index.html > Nice to see these efforts especially with intention of possible upstreaming. I hope we can improve the NumPy infrastructure to make these tries much easier and powerful in the future! (And as I mentioned, I had such things in mind with NEP 43, albeit as a possible later extension, not an explicit goal.) I am a bit curious about the actual performance improvements even without allowing more flexibility on the NumPy side, my gut feeling would be fairly large variations with sometimes big improvements due to parallelization bug often only added overheads due to NumPy not giving you deep enough control? As to the name, I don't have an issue with using `pnumpy`, although I was never hugely concerned about it. Initially I thought a longer name might be nicer, but the old(?) accelerated-numpy or fast_numpy_loops doesn't seem that much clearer to me. I guess in the end, I think its just important to be clear that this type of project patches/modifies NumPy and is not associated with it directly. It seams `pnumpy` is currently taken on PyPI with a small amount of downloads: https://pypistats.org/packages/pnumpy (Although I wonder how many are actual users.), though. Cheers, Sebastian > > What is in it? > > - use "PyUFunc_ReplaceLoopBySignature" to hook all the UFunc inner > loops > > - When the inner loop is called with a large enough array, chunk the > data and perform the iteration via a thread pool > > - Add a different memory allocator for "ndarray" data (will require > an > appropriate API from NumPy) > > - Allow using optimized loops above and beyond what NumPy provides > > - Allow logging inner loop calls and parameters to learn about the > current process and perhaps tune the performance accordingly > > > The first release contains the hooking mechanism and the thread > pool, > the rest has been prototyped but is not ready for release. The idea > behind the package is that a third-party package can try things out > and > iterate much faster than NumPy. If some of the ideas bear fruit, and > do > not add an undue maintenance burden to NumPy, the code can be ported > to > NumPy. I am not sure NumPy wishes to take upon itself the burden of > managing threads, but a third-party package may be able to. > > > I am writing to the mailing list both to announce the pre-release > under > the wrong name, and, in accordance with the fair play rules[1], to > request use of the "numpy" name in the package. We had considered > many > options, in the end would like to propose "pnumpy" (the p is either > "parallel" or "performant" or "preliminary", whatever you desire). > > > Matti > > > [1] https://numpy.org/neps/nep-0036-fair-play.html#fair-play-rules > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 833 bytes Desc: This is a digitally signed message part URL: From stefanv at berkeley.edu Wed Nov 4 17:20:22 2020 From: stefanv at berkeley.edu (Stefan van der Walt) Date: Wed, 04 Nov 2020 14:20:22 -0800 Subject: [Numpy-discussion] New package to speed up ufunc inner loops In-Reply-To: References:

Message-ID: On Wed, Nov 4, 2020, at 13:47, Aaron Meurer wrote: > I hope this isn't too off topic, but this "fair play" NEP reads like > it is a set of additional restrictions on the NumPy license, which if > it is, would make NumPy no longer open source by the OSI definition. I > think the NEP should be much clearer that these are requests but not > requirements. Specifically, the NEP is worded as follows: """ This document aims to define a minimal set of rules that, when followed, will be considered good-faith efforts in line with the expectations of the NumPy developers. ... When in doubt, please talk to us first. We may suggest an alternative; at minimum, we?ll be prepared. """ There is no language of forced restriction. The heading in question is "Do not reuse the NumPy name for projects not developed by the NumPy community". Matti is a member of our community, and while the project may be sponsored by others, he is doing exactly what the NEP recommends: discussing the issue with the community. Community members should weigh in if they see an issue with the naming. I don't think this is a particularly good name for a package (not easy to pronounce, does not indicate functionality of the package), but I don't personally have an issue with it either. Best regards, St?fan From asmeurer at gmail.com Wed Nov 4 17:54:18 2020 From: asmeurer at gmail.com (Aaron Meurer) Date: Wed, 4 Nov 2020 15:54:18 -0700 Subject: [Numpy-discussion] New package to speed up ufunc inner loops In-Reply-To: References:

Message-ID: On Wed, Nov 4, 2020 at 3:02 PM Robert Kern wrote: > > On Wed, Nov 4, 2020 at 4:49 PM Aaron Meurer wrote: >> >> I hope this isn't too off topic, but this "fair play" NEP reads like >> it is a set of additional restrictions on the NumPy license, which if >> it is, would make NumPy no longer open source by the OSI definition. I >> think the NEP should be much clearer that these are requests but not >> requirements. > > > FWIW, I don't read the NEP like that. Aside from the trademark on the name "NumPy", which _are_ enforceable requirements but are orthogonal to the copyright license, I see enough "request-like" language on everything else. To be clear, I don't read it like that either. But I also implicitly understand that this is the intention of the document, because I know that NumPy wouldn't actually place restrictions like these on its license. My point is just that the document ought to be clearer about this, as I can easily see someone misinterpreting it, especially if they aren't close enough to the community that they would implicitly understand that it is only a set of guidelines. > There is no language of forced restriction. The language you quoted reads ambiguously to me. It isn't forced, but it also isn't obviously nonforced. "Please talk to us first" is the sort of language I would expect to see for software that is commercially licensed and can only be used with permission. All the bullet points say "do not", which sounds forced to me. And the trademark thing makes it even more confusing because even if you read the rest as "only guidelines", it isn't clear if this is somehow an exception. Again, *I* understand the purpose of this document, but I think the way it is currently written it could easily be misinterpreted by someone else. Aaron Meurer > > -- > Robert Kern > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion From stefanv at berkeley.edu Wed Nov 4 18:27:09 2020 From: stefanv at berkeley.edu (Stefan van der Walt) Date: Wed, 04 Nov 2020 15:27:09 -0800 Subject: [Numpy-discussion] New package to speed up ufunc inner loops In-Reply-To: References:

Message-ID: <31e9ec2c-2c6a-4757-910f-a68f2bf328f3@www.fastmail.com> On Wed, Nov 4, 2020, at 14:54, Aaron Meurer wrote: > Again, *I* understand the purpose of this document, but I think the > way it is currently written it could easily be misinterpreted by > someone else. Misinterpreted in what way? That they would think we have an ability to enforce the guidelines? We *are* trying to encourage certain behavior here. If they read it and, our of abundant caution reach out to us, that's a fine outcome. What negative outcomes do you foresee? St?fan From robert.kern at gmail.com Wed Nov 4 18:29:31 2020 From: robert.kern at gmail.com (Robert Kern) Date: Wed, 4 Nov 2020 18:29:31 -0500 Subject: [Numpy-discussion] New package to speed up ufunc inner loops In-Reply-To: References:

Message-ID: On Wed, Nov 4, 2020 at 5:55 PM Aaron Meurer wrote: > On Wed, Nov 4, 2020 at 3:02 PM Robert Kern wrote: > > > > On Wed, Nov 4, 2020 at 4:49 PM Aaron Meurer wrote: > >> > >> I hope this isn't too off topic, but this "fair play" NEP reads like > >> it is a set of additional restrictions on the NumPy license, which if > >> it is, would make NumPy no longer open source by the OSI definition. I > >> think the NEP should be much clearer that these are requests but not > >> requirements. > > > > > > FWIW, I don't read the NEP like that. Aside from the trademark on the > name "NumPy", which _are_ enforceable requirements but are orthogonal to > the copyright license, I see enough "request-like" language on everything > else. > > To be clear, I don't read it like that either. But I also implicitly > understand that this is the intention of the document, because I know > that NumPy wouldn't actually place restrictions like these on its > license. My point is just that the document ought to be clearer about > this, as I can easily see someone misinterpreting it, especially if > they aren't close enough to the community that they would implicitly > understand that it is only a set of guidelines. > > > There is no language of forced restriction. > > The language you quoted reads ambiguously to me. It isn't forced, but > it also isn't obviously nonforced. "Please talk to us first" is the > sort of language I would expect to see for software that is > commercially licensed and can only be used with permission. All the > bullet points say "do not", which sounds forced to me. And the > trademark thing makes it even more confusing because even if you read > the rest as "only guidelines", it isn't clear if this is somehow an > exception. > If you pick out an individual sentence and consider it in isolation, sure. But there's a significant amount of context in the Abstract, Motivation, and Scope sections that preface the rules. And the discussion of many of the rules explicitly discusses ways to "break" the rules if you have to. We use "rule" language in many contexts besides legally-enforceable contracts and licenses. Again, *I* understand the purpose of this document, but I think the > way it is currently written it could easily be misinterpreted by > someone else. > I'm willing to wait for someone to actually misinterpret it. That's not to say that there isn't clearer language that could be drafted. The NEP is still in Draft stage. But if you think it could be clearer, please propose specific edits to the draft. Like with unclear documentation, it's the person who finds the current docs insufficient/confusing/unclear that is in the best position to recommend the language that would have helped them. Collaboration helps. -- Robert Kern -------------- next part -------------- An HTML attachment was scrubbed... URL: From asmeurer at gmail.com Wed Nov 4 19:21:20 2020 From: asmeurer at gmail.com (Aaron Meurer) Date: Wed, 4 Nov 2020 17:21:20 -0700 Subject: [Numpy-discussion] New package to speed up ufunc inner loops In-Reply-To: References:

Message-ID: > Misinterpreted in what way? That they would think we have an ability to enforce the guidelines? We *are* trying to encourage certain behavior here. If they read it and, our of abundant caution reach out to us, that's a fine outcome. > What negative outcomes do you foresee? That it is a legal requirement, as part of the license to use NumPy. The negative outcome is that someone reads the document and believes NumPy to not actually be open source software. > That's not to say that there isn't clearer language that could be drafted. The NEP is still in Draft stage. But if you think it could be clearer, please propose specific edits to the draft. Like with unclear documentation, it's the person who finds the current docs insufficient/confusing/unclear that is in the best position to recommend the language that would have helped them. Collaboration helps. I disagree. The best person to write documentation is the person who actually understands the package. I already noted that I don't actually understand the actual situation with the trademark, for instance. I don't really understand why there is pushback for making NEP clearer. Also "like with unclear documentation", if someone says that documentation is unclear, you should take their word for it that it actually is, and improve it, rather than somehow trying to argue that they actually aren't confused. But as I noted, this is already off topic for the original discussion here, and since there's apparently no interest in improving the NEP wording, I'll drop it. Aaron Meurer From stefanv at berkeley.edu Wed Nov 4 19:29:41 2020 From: stefanv at berkeley.edu (Stefan van der Walt) Date: Wed, 04 Nov 2020 16:29:41 -0800 Subject: [Numpy-discussion] New package to speed up ufunc inner loops In-Reply-To: References:

Message-ID: <7932850c-f115-4b57-b7e2-251e10e5a9dc@www.fastmail.com> On Wed, Nov 4, 2020, at 16:21, Aaron Meurer wrote: > But as I noted, this is already off topic for the original discussion > here, and since there's apparently no interest in improving the NEP > wording, I'll drop it. I was trying to understand where, specifically, the language falls short, and what to do about improving it. Perhaps a sentence making it clear that this is not a licensing issue will assuage your concerns? If not, please help me understand where statements are overly strong, unclear, or insufficient in coverage. Best regards, St?fan From robert.kern at gmail.com Wed Nov 4 19:42:17 2020 From: robert.kern at gmail.com (Robert Kern) Date: Wed, 4 Nov 2020 19:42:17 -0500 Subject: [Numpy-discussion] New package to speed up ufunc inner loops In-Reply-To: References:

Message-ID: On Wed, Nov 4, 2020 at 7:22 PM Aaron Meurer wrote: > > > That's not to say that there isn't clearer language that could be > drafted. The NEP is still in Draft stage. But if you think it could be > clearer, please propose specific edits to the draft. Like with unclear > documentation, it's the person who finds the current docs > insufficient/confusing/unclear that is in the best position to recommend > the language that would have helped them. Collaboration helps. > > I disagree. The best person to write documentation is the person who > actually understands the package. I already noted that I don't > actually understand the actual situation with the trademark, for > instance. > Rather, I meant that the best person to fix confusing language is the person who was confused, after consultation with the authors/experts come to a consensus about what was intended. > I don't really understand why there is pushback for making NEP > clearer. Also "like with unclear documentation", if someone says that > documentation is unclear, you should take their word for it that it > actually is, and improve it, rather than somehow trying to argue that > they actually aren't confused. > I'm not. I'm saying that I don't know how to make it more clear to those people because I'm not experiencing it like they are. The things I could think to add are the same kinds of things that were already stated explicitly in the Abstract, Motivation, and Scope. It seems like Stefan is in the same boat. Authors need editors, but the editor can't just say "rewrite!" I don't know what kind of assumptions and context this hypothetical reader is bringing to this reading that are leading to confusion. Sometimes it's clear, but not for me, here (and more relevantly, Stefan). Do you think this needs a complete revamp? Or just an additional sentence to explicitly state that this does not add additional legal restrictions to the copyright license? -- Robert Kern -------------- next part -------------- An HTML attachment was scrubbed... URL: From tyler.je.reddy at gmail.com Wed Nov 4 22:25:24 2020 From: tyler.je.reddy at gmail.com (Tyler Reddy) Date: Wed, 4 Nov 2020 20:25:24 -0700 Subject: [Numpy-discussion] ANN: SciPy 1.5.4 Message-ID: Hi all, On behalf of the SciPy development team I'm pleased to announce the release of SciPy 1.5.4, which is a bug fix release that includes Python 3.9 wheels and a more complete fix for build issues on XCode 12. Sources and binary wheels can be found at: https://pypi.org/project/scipy/ and at: https://github.com/scipy/scipy/releases/tag/v1.5.4 One of a few ways to install this release with pip: pip install scipy==1.5.4 ===================== SciPy 1.5.4 Release Notes ===================== SciPy 1.5.4 is a bug-fix release with no new features compared to 1.5.3. Importantly, wheels are now available for Python 3.9 and a more complete fix has been applied for issues building with XCode 12. Authors ====== * Peter Bell * CJ Carey * Andrew McCluskey + * Andrew Nelson * Tyler Reddy * Eli Rykoff + * Ian Thomas + A total of 7 people contributed to this release. People with a "+" by their names contributed a patch for the first time. This list of names is automatically generated, and may not be fully complete. Issues closed for 1.5.4 ------------------------------- * `#12763 `__: ndimage.fourier_ellipsoid segmentation fault * `#12789 `__: TestConvolve2d.test_large_array failing on Windows ILP64 CI job * `#12857 `__: sparse A[0,:] = ndarray is ok, A[:,0] = ndarray ValueError from... * `#12860 `__: BUG: Build failure with Xcode 12 * `#12935 `__: Failure to build with Python 3.9.0 on macOS * `#12966 `__: MAINT: lint_diff.py on some backport PRs * `#12988 `__: BUG: Highly multi-dimensional \`gaussian_kde\` giving \`-inf\`... Pull requests for 1.5.4 ------------------------------ * `#12790 `__: TST: Skip TestConvolve2d.test_large_array if not enough memory * `#12851 `__: BUG: sparse: fix inner indexed assignment of a 1d array * `#12875 `__: BUG: segfault in ndimage.fourier_ellipsoid with length-1 dims * `#12937 `__: CI: macOS3.9 testing * `#12957 `__: MAINT: fixes XCode 12/ python 3.9.0 build for 1.5.x maint branch * `#12959 `__: CI: add Windows Python 3.9 to CI * `#12974 `__: MAINT: Run lint_diff.py against the merge target and only for... * `#12978 `__: DOC: next_fast_len output doesn't match docstring * `#12979 `__: BUG: fft.next_fast_len should accept keyword arguments * `#12989 `__: BUG: improved the stability of kde for highly (1000s) multi-dimension... * `#13017 `__: BUG: Add explicit cast to _tmp sum. * `#13022 `__: TST: xfail test_maxiter_worsening() Checksums ========= MD5 ~~~ 09a446e10033c3132f1f257e3f4d9735 scipy-1.5.4-cp36-cp36m-macosx_10_9_x86_64.whl 25e58fde2fd4eb6c7717719db85e368b scipy-1.5.4-cp36-cp36m-manylinux1_i686.whl 2c9705cd57788ad79ea0c1015208f41f scipy-1.5.4-cp36-cp36m-manylinux1_x86_64.whl d0fb84f3ff45e4149698fbc662ac4d47 scipy-1.5.4-cp36-cp36m-manylinux2014_aarch64.whl f94f0e274cd2960ecb2d8751632e098c scipy-1.5.4-cp36-cp36m-win32.whl f56f4d5b67fccc49fb64331c28bdf7d1 scipy-1.5.4-cp36-cp36m-win_amd64.whl 33e0843f8619b78547866579134a733b scipy-1.5.4-cp37-cp37m-macosx_10_9_x86_64.whl 6720a406d82bd08c4370b665d5eddeb9 scipy-1.5.4-cp37-cp37m-manylinux1_i686.whl eafc3bc8a12d41cb348c73b54ad25ad5 scipy-1.5.4-cp37-cp37m-manylinux1_x86_64.whl 1174418ae0614d621acdb49faeaadcb8 scipy-1.5.4-cp37-cp37m-manylinux2014_aarch64.whl 5ca53c5cd6828498c0a41c3ae747a34b scipy-1.5.4-cp37-cp37m-win32.whl cdb91a7db9cf79b7446680f8d106aabc scipy-1.5.4-cp37-cp37m-win_amd64.whl 02a29a4eec9c61c30aef7439138fe1b3 scipy-1.5.4-cp38-cp38-macosx_10_9_x86_64.whl ce8e02167763493374c4bea807139a1b scipy-1.5.4-cp38-cp38-manylinux1_i686.whl 65ec027bfa6bed805dac62744b45c693 scipy-1.5.4-cp38-cp38-manylinux1_x86_64.whl c358b4b332cc9dbcd1eadc229d8b019e scipy-1.5.4-cp38-cp38-manylinux2014_aarch64.whl 492ec3bfe082229076a83d74cfa51d7e scipy-1.5.4-cp38-cp38-win32.whl d5d12211502429f3bc3074b12ca1f541 scipy-1.5.4-cp38-cp38-win_amd64.whl da25e7ac777e8b1b6cd7f117f163e6d2 scipy-1.5.4-cp39-cp39-macosx_10_9_x86_64.whl 12275e3578eb17065081d83d329d18db scipy-1.5.4-cp39-cp39-manylinux1_i686.whl 6778d670f75f536921c3d38e44517280 scipy-1.5.4-cp39-cp39-manylinux1_x86_64.whl efda61c74b29ffe714b6b842ec369a19 scipy-1.5.4-cp39-cp39-manylinux2014_aarch64.whl 107204c14328df879c5fc941e7829389 scipy-1.5.4-cp39-cp39-win32.whl ed6970f7538d38dd91a42950bd6843b7 scipy-1.5.4-cp39-cp39-win_amd64.whl 293401ee7ac354a2f2313373b497f40e scipy-1.5.4.tar.gz d446ec7a6b0bc44484389ab7589eccf5 scipy-1.5.4.tar.xz 47d0dabdc684475bc2aac7e8db9eea6f scipy-1.5.4.zip SHA256 ~~~~~~ 4f12d13ffbc16e988fa40809cbbd7a8b45bc05ff6ea0ba8e3e41f6f4db3a9e47 scipy-1.5.4-cp36-cp36m-macosx_10_9_x86_64.whl a254b98dbcc744c723a838c03b74a8a34c0558c9ac5c86d5561703362231107d scipy-1.5.4-cp36-cp36m-manylinux1_i686.whl 368c0f69f93186309e1b4beb8e26d51dd6f5010b79264c0f1e9ca00cd92ea8c9 scipy-1.5.4-cp36-cp36m-manylinux1_x86_64.whl 4598cf03136067000855d6b44d7a1f4f46994164bcd450fb2c3d481afc25dd06 scipy-1.5.4-cp36-cp36m-manylinux2014_aarch64.whl e98d49a5717369d8241d6cf33ecb0ca72deee392414118198a8e5b4c35c56340 scipy-1.5.4-cp36-cp36m-win32.whl 65923bc3809524e46fb7eb4d6346552cbb6a1ffc41be748535aa502a2e3d3389 scipy-1.5.4-cp36-cp36m-win_amd64.whl 9ad4fcddcbf5dc67619379782e6aeef41218a79e17979aaed01ed099876c0e62 scipy-1.5.4-cp37-cp37m-macosx_10_9_x86_64.whl f87b39f4d69cf7d7529d7b1098cb712033b17ea7714aed831b95628f483fd012 scipy-1.5.4-cp37-cp37m-manylinux1_i686.whl 25b241034215247481f53355e05f9e25462682b13bd9191359075682adcd9554 scipy-1.5.4-cp37-cp37m-manylinux1_x86_64.whl fa789583fc94a7689b45834453fec095245c7e69c58561dc159b5d5277057e4c scipy-1.5.4-cp37-cp37m-manylinux2014_aarch64.whl d6d25c41a009e3c6b7e757338948d0076ee1dd1770d1c09ec131f11946883c54 scipy-1.5.4-cp37-cp37m-win32.whl 2c872de0c69ed20fb1a9b9cf6f77298b04a26f0b8720a5457be08be254366c6e scipy-1.5.4-cp37-cp37m-win_amd64.whl e360cb2299028d0b0d0f65a5c5e51fc16a335f1603aa2357c25766c8dab56938 scipy-1.5.4-cp38-cp38-macosx_10_9_x86_64.whl 3397c129b479846d7eaa18f999369a24322d008fac0782e7828fa567358c36ce scipy-1.5.4-cp38-cp38-manylinux1_i686.whl 168c45c0c32e23f613db7c9e4e780bc61982d71dcd406ead746c7c7c2f2004ce scipy-1.5.4-cp38-cp38-manylinux1_x86_64.whl 213bc59191da2f479984ad4ec39406bf949a99aba70e9237b916ce7547b6ef42 scipy-1.5.4-cp38-cp38-manylinux2014_aarch64.whl 634568a3018bc16a83cda28d4f7aed0d803dd5618facb36e977e53b2df868443 scipy-1.5.4-cp38-cp38-win32.whl b03c4338d6d3d299e8ca494194c0ae4f611548da59e3c038813f1a43976cb437 scipy-1.5.4-cp38-cp38-win_amd64.whl 3d5db5d815370c28d938cf9b0809dade4acf7aba57eaf7ef733bfedc9b2474c4 scipy-1.5.4-cp39-cp39-macosx_10_9_x86_64.whl 6b0ceb23560f46dd236a8ad4378fc40bad1783e997604ba845e131d6c680963e scipy-1.5.4-cp39-cp39-manylinux1_i686.whl ed572470af2438b526ea574ff8f05e7f39b44ac37f712105e57fc4d53a6fb660 scipy-1.5.4-cp39-cp39-manylinux1_x86_64.whl 8c8d6ca19c8497344b810b0b0344f8375af5f6bb9c98bd42e33f747417ab3f57 scipy-1.5.4-cp39-cp39-manylinux2014_aarch64.whl d84cadd7d7998433334c99fa55bcba0d8b4aeff0edb123b2a1dfcface538e474 scipy-1.5.4-cp39-cp39-win32.whl cc1f78ebc982cd0602c9a7615d878396bec94908db67d4ecddca864d049112f2 scipy-1.5.4-cp39-cp39-win_amd64.whl 4a453d5e5689de62e5d38edf40af3f17560bfd63c9c5bd228c18c1f99afa155b scipy-1.5.4.tar.gz 5c87347bfe2db6e23d391aa226584f6b280248c0ca71e08f26f1faf9d7a76bc9 scipy-1.5.4.tar.xz e0bcc10c133a151937550bb42301c56439d34098b1b8f9dd18c5919d604edd37 scipy-1.5.4.zip -------------- next part -------------- An HTML attachment was scrubbed... URL: From sebastian at sipsolutions.net Thu Nov 5 10:21:44 2020 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Thu, 05 Nov 2020 09:21:44 -0600 Subject: [Numpy-discussion] Officially drop Python 3.6 from NumPy 1.20 (was: NumPy 1.20.x branch in two weeks) In-Reply-To: References:

<65b3fe9e-6f6a-4c5e-943e-e5747076eb0d@www.fastmail.com> Message-ID: <1e974b9d8d746dac2a79ecb507ea180788e04404.camel@sipsolutions.net> Hi all, just to note: We discussed this yesterday briefly and decided to drop official support for 3.6 in the 1.20 release. We never had ambition to support 1.20 and there seems advantage in dropping it, if mainly for clarity and consistency with many other projects. If you disagree with this decision, please just bring it up so we can reconsider. Cheers, Sebastian PS: We may keep testing on 3.6 for the moment, at least for PyPy for technical reasons. On Tue, 2020-11-03 at 11:58 -0800, Brigitta Sipocz wrote: > Hi, > > For what it's worth, python 3.6 is also dropped for astropy 4.2 (RC1 > to be > released in the next few days). We haven't yet formally adopted > NEP29, but > are very close to it peding some word smithing, and no one from the > dev > team was fighting for keeping support for 3.6. or numpy 1.16. > > Cheers, > Brigitta > > On Tue, 3 Nov 2020 at 10:53, Thomas Caswell > wrote: > > > I am in favor of dropping py36 for np1.20, I think it would be good > > to > > lead by example. > > > > Similar to pandas, the next Matplotlib release (3.4 targeted for > > Dec/Jan) > > will not support py36. > > > > Tom > > > > > > > > On Tue, Nov 3, 2020 at 9:18 AM Mark Harfouche < > > mark.harfouche at gmail.com> > > wrote: > > > > > Juan made a pretty good argument for keeping 3.6 support in the > > > next > > > scikit-image release, let me try to paraphrase: > > > > > > - Since nobody has made the PR to explicitly drop python 3.6 from > > > the > > > scikit-image build matrix, we will continue to support it, but if > > > somebody > > > were to make the PR, I (Juan) would support it. > > > > > > As for supporting PyPy: it already exists in the build matrix > > > AFAICT. > > > Breaking PyPy would be a deliberate action, as opposed to an > > > accidental > > > byproduct of dropping CPython 3.6. > > > > > > On Mon, Nov 2, 2020, 13:50 Sebastian Berg < > > > sebastian at sipsolutions.net> > > > wrote: > > > > > > > On Mon, 2020-11-02 at 06:49 -0600, Juan Nunez-Iglesias wrote: > > > > > I like Ralf's email, and most of all I agree that the > > > > > existing > > > > > wording is clearer. > > > > > > > > > > My view on the NEP is that it does not mandate dropping > > > > > support, but > > > > > encourage it. In my projects I would drop it if I had use for > > > > > Python > > > > > 3.7+ features. It so happens that we want to use PEP-593 so > > > > > we were > > > > > grateful for NEP-29 giving us "permission" to drop 3.6. > > > > > > > > > > I would suggest that 3.6 be dropped immediately if there are > > > > > any open > > > > > PRs that would benefit from it, or code cleanups that it > > > > > would > > > > > enable. The point of the NEP is to short-circuit discussion > > > > > about > > > > > whether it's "worth" dropping 3.6. If it's valuable at all, > > > > > do it. > > > > > > > > > > > > > Probably the only thing that requires 3.7 in NumPy at this time > > > > is the > > > > module level `__getattr__`, which is used only for deprecations > > > > (and to > > > > make the financial removal slightly more gentle). > > > > I am not sure if PyPy already has stable support for 3.7 yet? > > > > Although > > > > PyPy is maybe not a big priority. > > > > > > > > We don't have to support 3.6 and I don't care if we do. Until > > > > this > > > > discussion my assumption was we would probably drop it. > > > > > > > > But, current master is tested against 3.6, so the main work > > > > seems > > > > release related. If Chuck thinks that is no hassle I don't mind > > > > if > > > > NumPy is a bit more conservative than NEP 29. > > > > > > > > Or is there a danger of setting a precedent where projects are > > > > wrongly > > > > expected to keep support just because NumPy still has it, so > > > > that NumPy > > > > not being conservative actually helps everyone? > > > > > > > > - Sebastian > > > > > > > > > > > > > Thanks all, > > > > > > > > > > Juan. > > > > > > > > > > On Mon, 2 Nov 2020, at 2:01 AM, Ralf Gommers wrote: > > > > > > On Mon, Nov 2, 2020 at 7:47 AM Stephan Hoyer < > > > > > > shoyer at gmail.com> > > > > > > wrote: > > > > > > > On Sun, Nov 1, 2020 at 7:47 PM Stefan van der Walt < > > > > > > > stefanv at berkeley.edu> wrote: > > > > > > > > On Sun, Nov 1, 2020, at 18:54, Jarrod Millman wrote: > > > > > > > > > I also misunderstood the purpose of the NEP. I > > > > > > > > > assumed it > > > > > > > > > was > > > > > > > > > intended to encourage projects to drop old versions > > > > > > > > > of > > > > > > > > > Python. > > > > > > > > > > > > It was. It is. I think the NEP is very clear on that. > > > > > > Honestly we > > > > > > should just follow the NEP and drop 3.6 now for both NumPy > > > > > > and > > > > > > SciPy, I just am tired of arguing for it - which the NEP > > > > > > should > > > > > > have prevented being necessary, and I don't want to do > > > > > > again right > > > > > > now, so this will probably be my last email on this thread. > > > > > > > > > > > > > > > > > > > > Other > > > > > > > > > people have viewed the NEP similarly: > > > > > > > > > https://github.com/networkx/networkx/issues/4027 > > > > > > > > > > > > > > > > Of all the packages, it makes sense for NumPy to behave > > > > > > > > most > > > > > > > > conservatively with depreciations. The NEP suggests > > > > > > > > allowable > > > > > > > > support periods, but as far as I recall does not > > > > > > > > enforce > > > > > > > > minimal support. > > > > > > > > > > > > It doesn't *enforce* it, but the recommendation is very > > > > > > clear. It > > > > > > would be good to follow it. > > > > > > > > > > > > > > Stephan Hoyer had a good recommendation on how we can > > > > > > > > clarify > > > > > > > > the NEP to be easier to intuit. Stephan, shall we make > > > > > > > > an > > > > > > > > ammendment to the NEP with your idea? > > > > > > > > > > > > > > For reference, here was my proposed revision: > > > > > > > https://github.com/numpy/numpy/pull/14086#issuecomment-649287648 > > > > > > > Specifically, rather than saying "the latest release of > > > > > > > NumPy > > > > > > > supports all versions of Python released in the 42 months > > > > > > > before > > > > > > > NumPy's release", it says "NumPy will only require > > > > > > > versions of > > > > > > > Python that were released more than 24 months ago". In > > > > > > > practice, > > > > > > > this works out to the same thing (at least given Python's > > > > > > > old 18 > > > > > > > month release cycle). > > > > > > > > > > > > > > This changes the definition of the support window (in a > > > > > > > way that > > > > > > > I think is clearer and that works better for infrequent > > > > > > > releases), but there is still the question of how large > > > > > > > that > > > > > > > window should be for NumPy. > > > > > > > > > > > > I'm not sure it's clearer, the current NEP has a nice > > > > > > graphic and > > > > > > literally says "a project with a major or minor version > > > > > > release in > > > > > > November 2020 should support Python 3.7 and newer."). > > > > > > However happy > > > > > > to adopt it if it makes others happy - in the end it comes > > > > > > down to > > > > > > the same thing: it's recommended to drop Python 3.6 now. > > > > > > > > > > > > > My personal opinion is that somewhere in the range of 24- > > > > > > > 36 > > > > > > > months would be appropriate. > > > > > > > > > > > > +1 > > > > > > > > > > > > Cheers, > > > > > > Ralf > > > > > > > > > > > > > > > > > > > > > > > > _______________________________________________ > > > > > > NumPy-Discussion mailing list > > > > > > NumPy-Discussion at python.org > > > > > > https://mail.python.org/mailman/listinfo/numpy-discussion > > > > > > > > > > > > > > > > _______________________________________________ > > > > > NumPy-Discussion mailing list > > > > > NumPy-Discussion at python.org > > > > > https://mail.python.org/mailman/listinfo/numpy-discussion > > > > > > > > _______________________________________________ > > > > NumPy-Discussion mailing list > > > > NumPy-Discussion at python.org > > > > https://mail.python.org/mailman/listinfo/numpy-discussion > > > > > > > _______________________________________________ > > > NumPy-Discussion mailing list > > > NumPy-Discussion at python.org > > > https://mail.python.org/mailman/listinfo/numpy-discussion > > > > > > > -- > > Thomas Caswell > > tcaswell at gmail.com > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at python.org > > https://mail.python.org/mailman/listinfo/numpy-discussion > > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 833 bytes Desc: This is a digitally signed message part URL: From sebastian at sipsolutions.net Thu Nov 5 11:55:08 2020 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Thu, 05 Nov 2020 10:55:08 -0600 Subject: [Numpy-discussion] Add sliding_window_view method to numpy In-Reply-To: References: Message-ID: Hi all, just a brief note that I merged this proposal: https://github.com/numpy/numpy/pull/17394 adding `np.sliding_window_view` into the 1.20 release of NumPy. There was only one public API change, and that is that the `shape` argument is now called `window_shape`. This is still a good time for feedback in case you have a better idea e.g. for the function or parameter names. Cheers, Sebastian On Mon, 2020-10-12 at 08:39 +0000, Zimmermann Klaus wrote: > Hello, > > I would like to draw the attention of this list to PR #17394 [1] that > adds the implementation of a sliding window view to numpy. > > Having a sliding window view in numpy is a longstanding open issue > (cf > #7753 [2] from 2016). A brief summary of the discussions surrounding > it > can be found in the description of the PR. > > This PR implements a sliding window view based on stride tricks. > Following the discussion in issue #7753, a first implementation was > provided by Fanjin Zeng in PR #10771. After some discussion, that PR > stalled and I picked up the issue in the present PR #17394. It is > based > on the first implementation, but follows the changed API as suggested > by > Eric Wieser. > > Code reviews have been provided by Bas van Beek, Stephen Hoyer, and > Eric > Wieser. Sebastian Berg added the "62 - Python API" label. > > > Do you think this is suitable for inclusion in numpy? > > Do you consider the PR ready? > > Do you have suggestions or requests? > > > Thanks for your time and consideration! > Klaus > > > [1] https://github.com/numpy/numpy/pull/17394 > [2] https://github.com/numpy/numpy/issues/7753 > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 833 bytes Desc: This is a digitally signed message part URL: From ralf.gommers at gmail.com Thu Nov 5 14:15:17 2020 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Thu, 5 Nov 2020 19:15:17 +0000 Subject: [Numpy-discussion] Add sliding_window_view method to numpy In-Reply-To: References: Message-ID: On Thu, Nov 5, 2020 at 4:56 PM Sebastian Berg wrote: > Hi all, > > just a brief note that I merged this proposal: > > https://github.com/numpy/numpy/pull/17394 > > adding `np.sliding_window_view` into the 1.20 release of NumPy. > > There was only one public API change, and that is that the `shape` > argument is now called `window_shape`. > > This is still a good time for feedback in case you have a better idea > e.g. for the function or parameter names. > The old PR had this in the lib.stride_tricks namespace. Seeing it in the main namespace is unexpected and likely will lead to issues/questions, given that such an overlapping view is going to do behave in ways the average user will be surprised by. It may also lead to requests for other array/tensor libraries to implement this. I don't see any discussion on this in PR 17394, it looks like a decision by the PR author that no one commented on - reconsider that? Cheers, Ralf > > Cheers, > > Sebastian > > > > On Mon, 2020-10-12 at 08:39 +0000, Zimmermann Klaus wrote: > > Hello, > > > > I would like to draw the attention of this list to PR #17394 [1] that > > adds the implementation of a sliding window view to numpy. > > > > Having a sliding window view in numpy is a longstanding open issue > > (cf > > #7753 [2] from 2016). A brief summary of the discussions surrounding > > it > > can be found in the description of the PR. > > > > This PR implements a sliding window view based on stride tricks. > > Following the discussion in issue #7753, a first implementation was > > provided by Fanjin Zeng in PR #10771. After some discussion, that PR > > stalled and I picked up the issue in the present PR #17394. It is > > based > > on the first implementation, but follows the changed API as suggested > > by > > Eric Wieser. > > > > Code reviews have been provided by Bas van Beek, Stephen Hoyer, and > > Eric > > Wieser. Sebastian Berg added the "62 - Python API" label. > > > > > > Do you think this is suitable for inclusion in numpy? > > > > Do you consider the PR ready? > > > > Do you have suggestions or requests? > > > > > > Thanks for your time and consideration! > > Klaus > > > > > > [1] https://github.com/numpy/numpy/pull/17394 > > [2] https://github.com/numpy/numpy/issues/7753 > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at python.org > > https://mail.python.org/mailman/listinfo/numpy-discussion > > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From noamraph at gmail.com Thu Nov 5 15:12:24 2020 From: noamraph at gmail.com (Noam Yorav-Raphael) Date: Thu, 5 Nov 2020 22:12:24 +0200 Subject: [Numpy-discussion] datetime64: Remove deprecation warning when constructing with timezone Message-ID: Hi, I suggest removing the deprecation warning when constructing a datetime64 with a timezone. For example, this is the current behavior: >>> np.datetime64('2020-11-05 16:00+0200') :1: DeprecationWarning: parsing timezone aware datetimes is deprecated; this will raise an error in the future numpy.datetime64('2020-11-05T14:00') I suggest removing the deprecation warning because I find this to be a useful behavior, and because it is a correct behavior. The manual says: "The datetime object represents a single moment in time... Datetimes are always stored based on POSIX time, with an epoch of 1970-01-01T00:00Z." So 2020-11-05T16:00+0200 is indeed the moment in time represented by np.datetime64('2020-11-05T14:00'). I just used this to restrict my data set to records created after a certain moment. It was easier for me to write the moment in my local time and add "+0200" than to figure out the moment representation in UTC. So this is my simple suggestion: remove the deprecation warning. Beyond that, I have 3 ideas for changing the repr of datetime64 that I would like to discuss. 1. Add "Z" at the end, for example, numpy.datetime64('2020-11-05T14:00Z'). This will make it clear to which moment it refers. I think this is significant - I had to dig quite a bit to realize that datetime64('2020-11-05T14:00') means 14:00 UTC. 2. Replace the 'T' with a space. I just find it much easier to read '2020-11-05 14:00Z' than '2020-11-05T14:00Z'. The long sequence of characters makes it hard for my brain to parse. 3. This will require discussion, but will be very convenient: have the repr display the time using the environment time zone, including a time offset. So, in my specific time zone (+0200), I will have: repr(np.datetime64('2020-11-05 14:00Z')) == "numpy.datetime64('2020-11-05T16:00+0200')" I'm sure the pros and cons of having an environment-dependent repr should be discussed. But I will list some pros: 1. It's very convenient - it's immediately obvious to me to which moment 2020-11-05 16:00+0200 refers. 2. It's well defined - I may collect timestamps from machines with different time zones, and I will be able to know to which exact moment each timestamp refers. 3. It's very simple - I could compare any two timestamps, I don't have to worry about time zones. I would be happy to hear your thoughts. Thanks, Noam -------------- next part -------------- An HTML attachment was scrubbed... URL: From shoyer at gmail.com Thu Nov 5 15:51:36 2020 From: shoyer at gmail.com (Stephan Hoyer) Date: Thu, 5 Nov 2020 12:51:36 -0800 Subject: [Numpy-discussion] Add sliding_window_view method to numpy In-Reply-To: References:

Message-ID: On Thu, Nov 5, 2020 at 11:16 AM Ralf Gommers wrote: > > > On Thu, Nov 5, 2020 at 4:56 PM Sebastian Berg > wrote: > >> Hi all, >> >> just a brief note that I merged this proposal: >> >> https://github.com/numpy/numpy/pull/17394 >> >> adding `np.sliding_window_view` into the 1.20 release of NumPy. >> >> There was only one public API change, and that is that the `shape` >> argument is now called `window_shape`. >> >> This is still a good time for feedback in case you have a better idea >> e.g. for the function or parameter names. >> > > The old PR had this in the lib.stride_tricks namespace. Seeing it in the > main namespace is unexpected and likely will lead to issues/questions, > given that such an overlapping view is going to do behave in ways the > average user will be surprised by. It may also lead to requests for other > array/tensor libraries to implement this. I don't see any discussion on > this in PR 17394, it looks like a decision by the PR author that no one > commented on - reconsider that? > > Cheers, > Ralf > +1 let's keep this in the lib.stride_tricks namespace. > > > > >> >> Cheers, >> >> Sebastian >> >> >> >> On Mon, 2020-10-12 at 08:39 +0000, Zimmermann Klaus wrote: >> > Hello, >> > >> > I would like to draw the attention of this list to PR #17394 [1] that >> > adds the implementation of a sliding window view to numpy. >> > >> > Having a sliding window view in numpy is a longstanding open issue >> > (cf >> > #7753 [2] from 2016). A brief summary of the discussions surrounding >> > it >> > can be found in the description of the PR. >> > >> > This PR implements a sliding window view based on stride tricks. >> > Following the discussion in issue #7753, a first implementation was >> > provided by Fanjin Zeng in PR #10771. After some discussion, that PR >> > stalled and I picked up the issue in the present PR #17394. It is >> > based >> > on the first implementation, but follows the changed API as suggested >> > by >> > Eric Wieser. >> > >> > Code reviews have been provided by Bas van Beek, Stephen Hoyer, and >> > Eric >> > Wieser. Sebastian Berg added the "62 - Python API" label. >> > >> > >> > Do you think this is suitable for inclusion in numpy? >> > >> > Do you consider the PR ready? >> > >> > Do you have suggestions or requests? >> > >> > >> > Thanks for your time and consideration! >> > Klaus >> > >> > >> > [1] https://github.com/numpy/numpy/pull/17394 >> > [2] https://github.com/numpy/numpy/issues/7753 >> > _______________________________________________ >> > NumPy-Discussion mailing list >> > NumPy-Discussion at python.org >> > https://mail.python.org/mailman/listinfo/numpy-discussion >> > >> >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at python.org >> https://mail.python.org/mailman/listinfo/numpy-discussion >> > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From wieser.eric+numpy at gmail.com Thu Nov 5 16:04:21 2020 From: wieser.eric+numpy at gmail.com (Eric Wieser) Date: Thu, 5 Nov 2020 21:04:21 +0000 Subject: [Numpy-discussion] datetime64: Remove deprecation warning when constructing with timezone In-Reply-To: References: Message-ID: Without weighing in yet on how I feel about the deprecation, you can see some discussion about why this was originally deprecated in the PR that introduced the warning: https://github.com/numpy/numpy/pull/6453 Eric On Thu, Nov 5, 2020, 20:13 Noam Yorav-Raphael wrote: > Hi, > > I suggest removing the deprecation warning when constructing a datetime64 > with a timezone. For example, this is the current behavior: > > >>> np.datetime64('2020-11-05 16:00+0200') > :1: DeprecationWarning: parsing timezone aware datetimes is > deprecated; this will raise an error in the future > numpy.datetime64('2020-11-05T14:00') > > I suggest removing the deprecation warning because I find this to be a > useful behavior, and because it is a correct behavior. The manual says: > "The datetime object represents a single moment in time... Datetimes are > always stored based on POSIX time, with an epoch of 1970-01-01T00:00Z." > So 2020-11-05T16:00+0200 is indeed the moment in time represented by > np.datetime64('2020-11-05T14:00'). > > I just used this to restrict my data set to records created after a > certain moment. It was easier for me to write the moment in my local time > and add "+0200" than to figure out the moment representation in UTC. > > So this is my simple suggestion: remove the deprecation warning. > > > Beyond that, I have 3 ideas for changing the repr of datetime64 that I > would like to discuss. > > 1. Add "Z" at the end, for example, numpy.datetime64('2020-11-05T14:00Z'). > This will make it clear to which moment it refers. I think this is > significant - I had to dig quite a bit to realize that > datetime64('2020-11-05T14:00') means 14:00 UTC. > > 2. Replace the 'T' with a space. I just find it much easier to read > '2020-11-05 14:00Z' than '2020-11-05T14:00Z'. The long sequence of > characters makes it hard for my brain to parse. > > 3. This will require discussion, but will be very convenient: have the > repr display the time using the environment time zone, including a time > offset. So, in my specific time zone (+0200), I will have: > > repr(np.datetime64('2020-11-05 14:00Z')) == > "numpy.datetime64('2020-11-05T16:00+0200')" > > I'm sure the pros and cons of having an environment-dependent repr should > be discussed. But I will list some pros: > 1. It's very convenient - it's immediately obvious to me to which moment > 2020-11-05 16:00+0200 refers. > 2. It's well defined - I may collect timestamps from machines with > different time zones, and I will be able to know to which exact moment each > timestamp refers. > 3. It's very simple - I could compare any two timestamps, I don't have to > worry about time zones. > > I would be happy to hear your thoughts. > > Thanks, > Noam > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From sebastian at sipsolutions.net Thu Nov 5 18:35:41 2020 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Thu, 05 Nov 2020 17:35:41 -0600 Subject: [Numpy-discussion] Add sliding_window_view method to numpy In-Reply-To: References:

Message-ID: On Thu, 2020-11-05 at 12:51 -0800, Stephan Hoyer wrote: > On Thu, Nov 5, 2020 at 11:16 AM Ralf Gommers > wrote: > > > > > On Thu, Nov 5, 2020 at 4:56 PM Sebastian Berg < > > sebastian at sipsolutions.net> > > wrote: > > > > > Hi all, > > > > > > just a brief note that I merged this proposal: > > > > > > https://github.com/numpy/numpy/pull/17394 > > > > > > adding `np.sliding_window_view` into the 1.20 release of NumPy. > > > > > > There was only one public API change, and that is that the > > > `shape` > > > argument is now called `window_shape`. > > > > > > This is still a good time for feedback in case you have a better > > > idea > > > e.g. for the function or parameter names. > > > > > > > The old PR had this in the lib.stride_tricks namespace. Seeing it > > in the > > main namespace is unexpected and likely will lead to > > issues/questions, > > given that such an overlapping view is going to do behave in ways > > the > > average user will be surprised by. It may also lead to requests for > > other > > array/tensor libraries to implement this. I don't see any > > discussion on > > this in PR 17394, it looks like a decision by the PR author that no > > one > > commented on - reconsider that? > > > > Cheers, > > Ralf > > > > +1 let's keep this in the lib.stride_tricks namespace. > I have no reservations against having it in the main namespace and am happy either way (it can still be exposed later in any case). It is the conservative choice and maybe it is an uncommon enough function that it deserves being a bit hidden... But I am curious, it sounds like you have both very strong reservations, and I would like to understand them better. The behaviour can be surprising, but that is why the default is a read- only view. I do not think it is worse than `np.broadcast_to` in this regard. (It is nowhere near as dangerous as `as_strided`.) It is true that it is specific to NumPy (memory model). So that is maybe a good enough reason right now. But I am not sure that stuffing things into a pretty hidden `np.lib.*` namespaces is a great long term solution either. There is very little useful functionality hidden away in `np.lib.*` currently. Cheers, Sebastian > > > > > > > > > > Cheers, > > > > > > Sebastian > > > > > > > > > > > > On Mon, 2020-10-12 at 08:39 +0000, Zimmermann Klaus wrote: > > > > Hello, > > > > > > > > I would like to draw the attention of this list to PR #17394 > > > > [1] that > > > > adds the implementation of a sliding window view to numpy. > > > > > > > > Having a sliding window view in numpy is a longstanding open > > > > issue > > > > (cf > > > > #7753 [2] from 2016). A brief summary of the discussions > > > > surrounding > > > > it > > > > can be found in the description of the PR. > > > > > > > > This PR implements a sliding window view based on stride > > > > tricks. > > > > Following the discussion in issue #7753, a first implementation > > > > was > > > > provided by Fanjin Zeng in PR #10771. After some discussion, > > > > that PR > > > > stalled and I picked up the issue in the present PR #17394. It > > > > is > > > > based > > > > on the first implementation, but follows the changed API as > > > > suggested > > > > by > > > > Eric Wieser. > > > > > > > > Code reviews have been provided by Bas van Beek, Stephen Hoyer, > > > > and > > > > Eric > > > > Wieser. Sebastian Berg added the "62 - Python API" label. > > > > > > > > > > > > Do you think this is suitable for inclusion in numpy? > > > > > > > > Do you consider the PR ready? > > > > > > > > Do you have suggestions or requests? > > > > > > > > > > > > Thanks for your time and consideration! > > > > Klaus > > > > > > > > > > > > [1] https://github.com/numpy/numpy/pull/17394 > > > > [2] https://github.com/numpy/numpy/issues/7753 > > > > _______________________________________________ > > > > NumPy-Discussion mailing list > > > > NumPy-Discussion at python.org > > > > https://mail.python.org/mailman/listinfo/numpy-discussion > > > > > > > > > > _______________________________________________ > > > NumPy-Discussion mailing list > > > NumPy-Discussion at python.org > > > https://mail.python.org/mailman/listinfo/numpy-discussion > > > > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at python.org > > https://mail.python.org/mailman/listinfo/numpy-discussion > > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 833 bytes Desc: This is a digitally signed message part URL: From shoyer at gmail.com Thu Nov 5 18:44:48 2020 From: shoyer at gmail.com (Stephan Hoyer) Date: Thu, 5 Nov 2020 15:44:48 -0800 Subject: [Numpy-discussion] datetime64: Remove deprecation warning when constructing with timezone In-Reply-To: References: Message-ID: I can try to dig up the old discussions, but datetime64 used to implement both (1) and (3), and this was updated in a very intentional way. Datetime64 now works like Python's own time-zone naive datetime.datetime objects. The documentation referencing "Z" should be updated -- datetime64 can be in any timezone you like. Timezone aware datetime objects are certainly useful, but NumPy's datetime64 was restricted to UTC. The consensus was that it was worse to have UTC-only rather than timezone-naive-only. NumPy's datetime64 is often used for data analysis purposes, for which automatic conversion to the local timezone of the computer running the analysis is often counter-productive. If you care about timezone conversions, I would highly recommend looking into pandas's Timestamp class for this purpose. In the future, this would be a good use-case for a new custom NumPy dtype. (The existing np.datetime64 code cannot easily handle multiple timezones.) On Thu, Nov 5, 2020 at 1:04 PM Eric Wieser wrote: > Without weighing in yet on how I feel about the deprecation, you can see > some discussion about why this was originally deprecated in the PR that > introduced the warning: > > https://github.com/numpy/numpy/pull/6453 > > Eric > > On Thu, Nov 5, 2020, 20:13 Noam Yorav-Raphael wrote: > >> Hi, >> >> I suggest removing the deprecation warning when constructing a datetime64 >> with a timezone. For example, this is the current behavior: >> >> >>> np.datetime64('2020-11-05 16:00+0200') >> :1: DeprecationWarning: parsing timezone aware datetimes is >> deprecated; this will raise an error in the future >> numpy.datetime64('2020-11-05T14:00') >> >> I suggest removing the deprecation warning because I find this to be a >> useful behavior, and because it is a correct behavior. The manual says: >> "The datetime object represents a single moment in time... Datetimes are >> always stored based on POSIX time, with an epoch of 1970-01-01T00:00Z." >> So 2020-11-05T16:00+0200 is indeed the moment in time represented by >> np.datetime64('2020-11-05T14:00'). >> >> I just used this to restrict my data set to records created after a >> certain moment. It was easier for me to write the moment in my local time >> and add "+0200" than to figure out the moment representation in UTC. >> >> So this is my simple suggestion: remove the deprecation warning. >> >> >> Beyond that, I have 3 ideas for changing the repr of datetime64 that I >> would like to discuss. >> >> 1. Add "Z" at the end, for example, >> numpy.datetime64('2020-11-05T14:00Z'). This will make it clear to which >> moment it refers. I think this is significant - I had to dig quite a bit to >> realize that datetime64('2020-11-05T14:00') means 14:00 UTC. >> >> 2. Replace the 'T' with a space. I just find it much easier to read >> '2020-11-05 14:00Z' than '2020-11-05T14:00Z'. The long sequence of >> characters makes it hard for my brain to parse. >> >> 3. This will require discussion, but will be very convenient: have the >> repr display the time using the environment time zone, including a time >> offset. So, in my specific time zone (+0200), I will have: >> >> repr(np.datetime64('2020-11-05 14:00Z')) == >> "numpy.datetime64('2020-11-05T16:00+0200')" >> >> I'm sure the pros and cons of having an environment-dependent repr should >> be discussed. But I will list some pros: >> 1. It's very convenient - it's immediately obvious to me to which moment >> 2020-11-05 16:00+0200 refers. >> 2. It's well defined - I may collect timestamps from machines with >> different time zones, and I will be able to know to which exact moment each >> timestamp refers. >> 3. It's very simple - I could compare any two timestamps, I don't have to >> worry about time zones. >> >> I would be happy to hear your thoughts. >> >> Thanks, >> Noam >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at python.org >> https://mail.python.org/mailman/listinfo/numpy-discussion >> > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From sebastian at sipsolutions.net Thu Nov 5 19:39:06 2020 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Thu, 05 Nov 2020 18:39:06 -0600 Subject: [Numpy-discussion] Add sliding_window_view method to numpy In-Reply-To: References:

Message-ID: <1cdb0b720f09845d03ccfdc2e171f98d7e925ee3.camel@sipsolutions.net> On Thu, 2020-11-05 at 17:35 -0600, Sebastian Berg wrote: > On Thu, 2020-11-05 at 12:51 -0800, Stephan Hoyer wrote: > > On Thu, Nov 5, 2020 at 11:16 AM Ralf Gommers < > > ralf.gommers at gmail.com> > > wrote: > > > > > On Thu, Nov 5, 2020 at 4:56 PM Sebastian Berg < > > > sebastian at sipsolutions.net> > > > wrote: > > > > > > > Hi all, > > > > > > > > just a brief note that I merged this proposal: > > > > > > > > https://github.com/numpy/numpy/pull/17394 > > > > > > > > adding `np.sliding_window_view` into the 1.20 release of NumPy. > > > > > > > > There was only one public API change, and that is that the > > > > `shape` > > > > argument is now called `window_shape`. > > > > > > > > This is still a good time for feedback in case you have a > > > > better > > > > idea > > > > e.g. for the function or parameter names. > > > > > > > > > > The old PR had this in the lib.stride_tricks namespace. Seeing it > > > in the > > > main namespace is unexpected and likely will lead to > > > issues/questions, > > > given that such an overlapping view is going to do behave in ways > > > the > > > average user will be surprised by. It may also lead to requests > > > for > > > other > > > array/tensor libraries to implement this. I don't see any > > > discussion on > > > this in PR 17394, it looks like a decision by the PR author that > > > no > > > one > > > commented on - reconsider that? > > > > > > Cheers, > > > Ralf > > > > > > > +1 let's keep this in the lib.stride_tricks namespace. > > > > I have no reservations against having it in the main namespace and am > happy either way (it can still be exposed later in any case). It is > the > conservative choice and maybe it is an uncommon enough function that > it > deserves being a bit hidden... In any case, its the safe bet for NumPy 1.20 at least so I opened a PR: https://github.com/numpy/numpy/pull/17720 Name changes, etc. are also possible of course. I still think it might be nice to find a better place for this type of function that `np.lib.stride_tricks` though, but dunno... - Sebastian > > But I am curious, it sounds like you have both very strong > reservations, and I would like to understand them better. > > The behaviour can be surprising, but that is why the default is a > read- > only view. I do not think it is worse than `np.broadcast_to` in this > regard. (It is nowhere near as dangerous as `as_strided`.) > > It is true that it is specific to NumPy (memory model). So that is > maybe a good enough reason right now. But I am not sure that > stuffing > things into a pretty hidden `np.lib.*` namespaces is a great long > term > solution either. There is very little useful functionality hidden > away > in `np.lib.*` currently. > > Cheers, > > Sebastian > > > > > > > > > > > Cheers, > > > > > > > > Sebastian > > > > > > > > > > > > > > > > On Mon, 2020-10-12 at 08:39 +0000, Zimmermann Klaus wrote: > > > > > Hello, > > > > > > > > > > I would like to draw the attention of this list to PR #17394 > > > > > [1] that > > > > > adds the implementation of a sliding window view to numpy. > > > > > > > > > > Having a sliding window view in numpy is a longstanding open > > > > > issue > > > > > (cf > > > > > #7753 [2] from 2016). A brief summary of the discussions > > > > > surrounding > > > > > it > > > > > can be found in the description of the PR. > > > > > > > > > > This PR implements a sliding window view based on stride > > > > > tricks. > > > > > Following the discussion in issue #7753, a first > > > > > implementation > > > > > was > > > > > provided by Fanjin Zeng in PR #10771. After some discussion, > > > > > that PR > > > > > stalled and I picked up the issue in the present PR #17394. > > > > > It > > > > > is > > > > > based > > > > > on the first implementation, but follows the changed API as > > > > > suggested > > > > > by > > > > > Eric Wieser. > > > > > > > > > > Code reviews have been provided by Bas van Beek, Stephen > > > > > Hoyer, > > > > > and > > > > > Eric > > > > > Wieser. Sebastian Berg added the "62 - Python API" label. > > > > > > > > > > > > > > > Do you think this is suitable for inclusion in numpy? > > > > > > > > > > Do you consider the PR ready? > > > > > > > > > > Do you have suggestions or requests? > > > > > > > > > > > > > > > Thanks for your time and consideration! > > > > > Klaus > > > > > > > > > > > > > > > [1] https://github.com/numpy/numpy/pull/17394 > > > > > [2] https://github.com/numpy/numpy/issues/7753 > > > > > _______________________________________________ > > > > > NumPy-Discussion mailing list > > > > > NumPy-Discussion at python.org > > > > > https://mail.python.org/mailman/listinfo/numpy-discussion > > > > > > > > > > > > > _______________________________________________ > > > > NumPy-Discussion mailing list > > > > NumPy-Discussion at python.org > > > > https://mail.python.org/mailman/listinfo/numpy-discussion > > > > > > > _______________________________________________ > > > NumPy-Discussion mailing list > > > NumPy-Discussion at python.org > > > https://mail.python.org/mailman/listinfo/numpy-discussion > > > > > > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at python.org > > https://mail.python.org/mailman/listinfo/numpy-discussion > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 833 bytes Desc: This is a digitally signed message part URL: From klaus.zimmermann at smhi.se Fri Nov 6 04:45:42 2020 From: klaus.zimmermann at smhi.se (Zimmermann Klaus) Date: Fri, 6 Nov 2020 09:45:42 +0000 Subject: [Numpy-discussion] Add sliding_window_view method to numpy In-Reply-To: <1cdb0b720f09845d03ccfdc2e171f98d7e925ee3.camel@sipsolutions.net> References:

<1cdb0b720f09845d03ccfdc2e171f98d7e925ee3.camel@sipsolutions.net> Message-ID: <32e8736e-55ed-1155-3da5-003d907c4e65@smhi.se> Hi all, I have absolutely no problem keeping this out of the main namespace. In fact I'd like to point out that it was not my idea. Rather, it was proposed by Bas van Beek in the comments [1,2] and received a little more scrutiny from Eric Wieser in [3]. The reason that it didn't receive the scrutiny it probably deserves is that it got a bit mangled up with the array dispatch discussion; sorry for that. On the subject matter, I am also curious about the potential for confusion. What other behavior could one expect from a sliding window view with this shape? As I said, I am completely fine with keeping this out of the main namespace, but I agree with Sebastian's comment, that `np.lib.stride_tricks` is perhaps not the best namespace. The reason from my point of view is that stride tricks is really a technical (and slightly ominous) name that might throw of more application oriented programmers from finding and using this function. Thinking of my scientist colleagues, I think those are exactly the kind of users that could benefit from such a prototyping tool. Cheers Klaus [1] https://github.com/numpy/numpy/pull/17394#issuecomment-700998618 [2] https://github.com/numpy/numpy/pull/17394#discussion_r498215468 [3] https://github.com/numpy/numpy/pull/17394#discussion_r498724340 On 06/11/2020 01:39, Sebastian Berg wrote: > On Thu, 2020-11-05 at 17:35 -0600, Sebastian Berg wrote: >> On Thu, 2020-11-05 at 12:51 -0800, Stephan Hoyer wrote: >>> On Thu, Nov 5, 2020 at 11:16 AM Ralf Gommers < >>> ralf.gommers at gmail.com> >>> wrote: >>> >>>> On Thu, Nov 5, 2020 at 4:56 PM Sebastian Berg < >>>> sebastian at sipsolutions.net> >>>> wrote: >>>> >>>>> Hi all, >>>>> >>>>> just a brief note that I merged this proposal: >>>>> >>>>> https://github.com/numpy/numpy/pull/17394 >>>>> >>>>> adding `np.sliding_window_view` into the 1.20 release of NumPy. >>>>> >>>>> There was only one public API change, and that is that the >>>>> `shape` >>>>> argument is now called `window_shape`. >>>>> >>>>> This is still a good time for feedback in case you have a >>>>> better >>>>> idea >>>>> e.g. for the function or parameter names. >>>>> >>>> >>>> The old PR had this in the lib.stride_tricks namespace. Seeing it >>>> in the >>>> main namespace is unexpected and likely will lead to >>>> issues/questions, >>>> given that such an overlapping view is going to do behave in ways >>>> the >>>> average user will be surprised by. It may also lead to requests >>>> for >>>> other >>>> array/tensor libraries to implement this. I don't see any >>>> discussion on >>>> this in PR 17394, it looks like a decision by the PR author that >>>> no >>>> one >>>> commented on - reconsider that? >>>> >>>> Cheers, >>>> Ralf >>>> >>> >>> +1 let's keep this in the lib.stride_tricks namespace. >>> >> >> I have no reservations against having it in the main namespace and am >> happy either way (it can still be exposed later in any case). It is >> the >> conservative choice and maybe it is an uncommon enough function that >> it >> deserves being a bit hidden... > > > In any case, its the safe bet for NumPy 1.20 at least so I opened a PR: > > https://github.com/numpy/numpy/pull/17720 > > Name changes, etc. are also possible of course. > > I still think it might be nice to find a better place for this type of > function that `np.lib.stride_tricks` though, but dunno... > > - Sebastian > > > >> >> But I am curious, it sounds like you have both very strong >> reservations, and I would like to understand them better. >> >> The behaviour can be surprising, but that is why the default is a >> read- >> only view. I do not think it is worse than `np.broadcast_to` in this >> regard. (It is nowhere near as dangerous as `as_strided`.) >> >> It is true that it is specific to NumPy (memory model). So that is >> maybe a good enough reason right now. But I am not sure that >> stuffing >> things into a pretty hidden `np.lib.*` namespaces is a great long >> term >> solution either. There is very little useful functionality hidden >> away >> in `np.lib.*` currently. >> >> Cheers, >> >> Sebastian >> >>>> >>>> >>>>> Cheers, >>>>> >>>>> Sebastian >>>>> >>>>> >>>>> >>>>> On Mon, 2020-10-12 at 08:39 +0000, Zimmermann Klaus wrote: >>>>>> Hello, >>>>>> >>>>>> I would like to draw the attention of this list to PR #17394 >>>>>> [1] that >>>>>> adds the implementation of a sliding window view to numpy. >>>>>> >>>>>> Having a sliding window view in numpy is a longstanding open >>>>>> issue >>>>>> (cf >>>>>> #7753 [2] from 2016). A brief summary of the discussions >>>>>> surrounding >>>>>> it >>>>>> can be found in the description of the PR. >>>>>> >>>>>> This PR implements a sliding window view based on stride >>>>>> tricks. >>>>>> Following the discussion in issue #7753, a first >>>>>> implementation >>>>>> was >>>>>> provided by Fanjin Zeng in PR #10771. After some discussion, >>>>>> that PR >>>>>> stalled and I picked up the issue in the present PR #17394. >>>>>> It >>>>>> is >>>>>> based >>>>>> on the first implementation, but follows the changed API as >>>>>> suggested >>>>>> by >>>>>> Eric Wieser. >>>>>> >>>>>> Code reviews have been provided by Bas van Beek, Stephen >>>>>> Hoyer, >>>>>> and >>>>>> Eric >>>>>> Wieser. Sebastian Berg added the "62 - Python API" label. >>>>>> >>>>>> >>>>>> Do you think this is suitable for inclusion in numpy? >>>>>> >>>>>> Do you consider the PR ready? >>>>>> >>>>>> Do you have suggestions or requests? >>>>>> >>>>>> >>>>>> Thanks for your time and consideration! >>>>>> Klaus >>>>>> >>>>>> >>>>>> [1] https://github.com/numpy/numpy/pull/17394 >>>>>> [2] https://github.com/numpy/numpy/issues/7753 >>>>>> _______________________________________________ >>>>>> NumPy-Discussion mailing list >>>>>> NumPy-Discussion at python.org >>>>>> https://mail.python.org/mailman/listinfo/numpy-discussion >>>>>> >>>>> >>>>> _______________________________________________ >>>>> NumPy-Discussion mailing list >>>>> NumPy-Discussion at python.org >>>>> https://mail.python.org/mailman/listinfo/numpy-discussion >>>>> >>>> _______________________________________________ >>>> NumPy-Discussion mailing list >>>> NumPy-Discussion at python.org >>>> https://mail.python.org/mailman/listinfo/numpy-discussion >>>> >>> >>> _______________________________________________ >>> NumPy-Discussion mailing list >>> NumPy-Discussion at python.org >>> https://mail.python.org/mailman/listinfo/numpy-discussion >> >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at python.org >> https://mail.python.org/mailman/listinfo/numpy-discussion > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > From noamraph at gmail.com Fri Nov 6 05:47:46 2020 From: noamraph at gmail.com (Noam Yorav-Raphael) Date: Fri, 6 Nov 2020 12:47:46 +0200 Subject: [Numpy-discussion] datetime64: Remove deprecation warning when constructing with timezone In-Reply-To: References:

Message-ID: Hi, I actually arrived at this by first trying to use pandas.Timestamp and getting very frustrated about it. With pandas, I get: >>> pd.Timestamp.now() Timestamp('2020-11-06 09:45:24.249851') I find the whole notion of a "timezone naive timestamp" to be nearly meaningless. A timestamp should mean a moment in time (as the current numpy documentation defines very well). A "naive timestamp" doesn't mean anything. It's exactly like a "unit naive length". I can have a Length type which just takes a number, and be very happy that it works both if my "unit zone" is inches or centimeters. So "Length(3)" will mean 3 cm in most of the world and 3 inches in the US. But then, if I get "Length(3)" from someone, I can't be sure what length it refers to. So currently, this happens with pandas timestamps: >>> os.environ['TZ'] = 'UTC'; time.tzset() ... t0 = pd.Timestamp.now() ... time.sleep(1) ... os.environ['TZ'] = 'EST-5'; time.tzset() ... t1 = pd.Timestamp.now() ... t1 - t0 Timedelta('0 days 05:00:01.001583') This is not just theoretical - I actually need to work with data from several devices, each in its own time zone. And I need to know that I won't get such meaningless results. And you can even get something like this: >>> t0 = pd.Timestamp.now() ... time.sleep(10) ... t1 = pd.Timestamp.now() ... t1 - t0 Timedelta('0 days 01:00:10.001583') if the first measurement happened to be in winter time and the second measurement happened to be in daylight saving time. The solution is simple, and is what datetime64 used to do before the change - have a type that just represents a moment in time. It's not "in UTC" - it just stores the number of seconds that passed since an agreed moment in time (which is usually 1970-01-01 02:00+0200, which is more commonly referred to as 1970-01-01 00:00Z - it's the exact same moment). I think it would make things clearer if I'll mention that there are operations that are not dealing with timestamps. For example, it's meaningless to ask what is the year of a timestamp - it may depend on the time zone. These are always *human* related questions, that depend on certain human conventions. We can call them "calendar questions". For these types of questions, a type that includes both a timestamp and a timezone offset (in minutes from UTC) can be useful. Some questions even require full timezone information, meaning a function that defines what's the timezone offset for each moment. However, I don't think numpy should deal with those calendar issues. As a very simple example, even for "timestamp+offset" types, it's not clear how to compare them - should values with the same timestamp and different offsets be considered equal or not? And in virtually all of my data analysis, this calendar aspect has nothing to do with the questions I'm trying to answer. I have a suggestion. Instead of changing datetime64 (which I consider to be ill-defined, but never mind), add a new type called "timestamp64". It will have the exact same behavior as datetime64 had before the change, except that its only allowed units will be seconds, milliseconds, microseconds and nanoseconds. Removing the longer units will make it clear that it doesn't deal with calendar and dates. Also, all the business day functionality will not be applicable to timestamp64. In order to get calendar information (such as the year) from timestamp64, you will have to manually convert it to python's datetime (or to np.datetime64) with an explicit timezone (utc, local, an offset, or a timezone object). What do you think? Thanks, Noam On Fri, Nov 6, 2020 at 1:45 AM Stephan Hoyer wrote: > I can try to dig up the old discussions, but datetime64 used to implement > both (1) and (3), and this was updated in a very intentional way. > Datetime64 now works like Python's own time-zone naive datetime.datetime > objects. The documentation referencing "Z" should be updated -- datetime64 > can be in any timezone you like. > > Timezone aware datetime objects are certainly useful, but NumPy's > datetime64 was restricted to UTC. The consensus was that it was worse to > have UTC-only rather than timezone-naive-only. NumPy's datetime64 is often > used for data analysis purposes, for which automatic conversion to the > local timezone of the computer running the analysis is often > counter-productive. > > If you care about timezone conversions, I would highly recommend looking > into pandas's Timestamp class for this purpose. In the future, this would > be a good use-case for a new custom NumPy dtype. (The existing > np.datetime64 code cannot easily handle multiple timezones.) > > On Thu, Nov 5, 2020 at 1:04 PM Eric Wieser > wrote: > >> Without weighing in yet on how I feel about the deprecation, you can see >> some discussion about why this was originally deprecated in the PR that >> introduced the warning: >> >> https://github.com/numpy/numpy/pull/6453 >> >> Eric >> >> On Thu, Nov 5, 2020, 20:13 Noam Yorav-Raphael wrote: >> >>> Hi, >>> >>> I suggest removing the deprecation warning when constructing a >>> datetime64 with a timezone. For example, this is the current behavior: >>> >>> >>> np.datetime64('2020-11-05 16:00+0200') >>> :1: DeprecationWarning: parsing timezone aware datetimes is >>> deprecated; this will raise an error in the future >>> numpy.datetime64('2020-11-05T14:00') >>> >>> I suggest removing the deprecation warning because I find this to be a >>> useful behavior, and because it is a correct behavior. The manual says: >>> "The datetime object represents a single moment in time... Datetimes are >>> always stored based on POSIX time, with an epoch of 1970-01-01T00:00Z." >>> So 2020-11-05T16:00+0200 is indeed the moment in time represented by >>> np.datetime64('2020-11-05T14:00'). >>> >>> I just used this to restrict my data set to records created after a >>> certain moment. It was easier for me to write the moment in my local time >>> and add "+0200" than to figure out the moment representation in UTC. >>> >>> So this is my simple suggestion: remove the deprecation warning. >>> >>> >>> Beyond that, I have 3 ideas for changing the repr of datetime64 that I >>> would like to discuss. >>> >>> 1. Add "Z" at the end, for example, >>> numpy.datetime64('2020-11-05T14:00Z'). This will make it clear to which >>> moment it refers. I think this is significant - I had to dig quite a bit to >>> realize that datetime64('2020-11-05T14:00') means 14:00 UTC. >>> >>> 2. Replace the 'T' with a space. I just find it much easier to read >>> '2020-11-05 14:00Z' than '2020-11-05T14:00Z'. The long sequence of >>> characters makes it hard for my brain to parse. >>> >>> 3. This will require discussion, but will be very convenient: have the >>> repr display the time using the environment time zone, including a time >>> offset. So, in my specific time zone (+0200), I will have: >>> >>> repr(np.datetime64('2020-11-05 14:00Z')) == >>> "numpy.datetime64('2020-11-05T16:00+0200')" >>> >>> I'm sure the pros and cons of having an environment-dependent repr >>> should be discussed. But I will list some pros: >>> 1. It's very convenient - it's immediately obvious to me to which moment >>> 2020-11-05 16:00+0200 refers. >>> 2. It's well defined - I may collect timestamps from machines with >>> different time zones, and I will be able to know to which exact moment each >>> timestamp refers. >>> 3. It's very simple - I could compare any two timestamps, I don't have >>> to worry about time zones. >>> >>> I would be happy to hear your thoughts. >>> >>> Thanks, >>> Noam >>> _______________________________________________ >>> NumPy-Discussion mailing list >>> NumPy-Discussion at python.org >>> https://mail.python.org/mailman/listinfo/numpy-discussion >>> >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at python.org >> https://mail.python.org/mailman/listinfo/numpy-discussion >> > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Fri Nov 6 09:58:23 2020 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Fri, 6 Nov 2020 14:58:23 +0000 Subject: [Numpy-discussion] Add sliding_window_view method to numpy In-Reply-To: <32e8736e-55ed-1155-3da5-003d907c4e65@smhi.se> References:

<1cdb0b720f09845d03ccfdc2e171f98d7e925ee3.camel@sipsolutions.net> <32e8736e-55ed-1155-3da5-003d907c4e65@smhi.se> Message-ID: On Fri, Nov 6, 2020 at 9:51 AM Zimmermann Klaus wrote: > Hi all, > > > I have absolutely no problem keeping this out of the main namespace. > > In fact I'd like to point out that it was not my idea. Rather, it was > proposed by Bas van Beek in the comments [1,2] and received a little > more scrutiny from Eric Wieser in [3]. > Thanks, between two PRs with that many comments, I couldn't figure that out - just saw the commit that make the change. > The reason that it didn't receive the scrutiny it probably deserves is > that it got a bit mangled up with the array dispatch discussion; sorry > for that. > No worries at all. This is why we announce new features on the mailing list. > On the subject matter, I am also curious about the potential for > confusion. What other behavior could one expect from a sliding window > view with this shape? > > As I said, I am completely fine with keeping this out of the main > namespace, but I agree with Sebastian's comment, that > `np.lib.stride_tricks` is perhaps not the best namespace. I agree that that's not a great namespace. There's multiple issues with namespaces, we basically have three good ones (fft, linalg, random) and a bunch of other ones that range from questionable to terrible. See https://github.com/numpy/numpy/blob/master/numpy/tests/test_public_api.py#L127 for details. This would be a good thing to work on - making the `numpy.lib` namespace not bleed into `numpy` via `import *` is one thing to do there, and there's many others. But given backwards compat constraints it's not easy. > The reason > from my point of view is that stride tricks is really a technical (and > slightly ominous) name that might throw of more application oriented > programmers from finding and using this function. Thinking of my > scientist colleagues, I think those are exactly the kind of users that > could benefit from such a prototyping tool. > That phrasing is one of a number of concerns. NumPy is normally not in the business of providing things that are okay as a prototyping tool, but are potentially extremely slow (as pointed out in the Notes section of the docstring). A function like that would basically not be the right tool for almost anything in, e.g., SciPy - it requires an iterative algorithm. In NumPy we don't prefer performance at all costs, but in general it's pretty decent rather than "Numba or Cython may gain you 100x here". Other issues include: 2) It is very specific to NumPy's memory model (as pointed out by you and Sebastian) - just like the rest of stride_tricks 3) It has "view" in the name, which doesn't quite make sense for the main namespace (also connected to point 2 above). 4) The cost of putting something in the main namespace for other array/tensor libraries is large. Maybe other libraries, e.g. CuPy, Dask, TensorFlow, PyTorch, JAX, MXNet, aim to reimplement part or all of the main NumPy namespace as well as possible. This would trigger discussions and likely many person-weeks of work for others. 5) It's a useful function, but it's very much on the margins of NumPy's scope. It could easily have gone into, for example, scipy.signal. At this point the bar for functions going into the main namespace should be (and is) high. All this taken together means it's not even a toss-up for me. If it were just one or two of these points, maybe. But given all the above, I'm pretty confident saying "it does not belong in the main namespace". Cheers, Ralf > > Cheers > Klaus > > > > [1] https://github.com/numpy/numpy/pull/17394#issuecomment-700998618 > [2] https://github.com/numpy/numpy/pull/17394#discussion_r498215468 > [3] https://github.com/numpy/numpy/pull/17394#discussion_r498724340 > > On 06/11/2020 01:39, Sebastian Berg wrote: > > On Thu, 2020-11-05 at 17:35 -0600, Sebastian Berg wrote: > >> On Thu, 2020-11-05 at 12:51 -0800, Stephan Hoyer wrote: > >>> On Thu, Nov 5, 2020 at 11:16 AM Ralf Gommers < > >>> ralf.gommers at gmail.com> > >>> wrote: > >>> > >>>> On Thu, Nov 5, 2020 at 4:56 PM Sebastian Berg < > >>>> sebastian at sipsolutions.net> > >>>> wrote: > >>>> > >>>>> Hi all, > >>>>> > >>>>> just a brief note that I merged this proposal: > >>>>> > >>>>> https://github.com/numpy/numpy/pull/17394 > >>>>> > >>>>> adding `np.sliding_window_view` into the 1.20 release of NumPy. > >>>>> > >>>>> There was only one public API change, and that is that the > >>>>> `shape` > >>>>> argument is now called `window_shape`. > >>>>> > >>>>> This is still a good time for feedback in case you have a > >>>>> better > >>>>> idea > >>>>> e.g. for the function or parameter names. > >>>>> > >>>> > >>>> The old PR had this in the lib.stride_tricks namespace. Seeing it > >>>> in the > >>>> main namespace is unexpected and likely will lead to > >>>> issues/questions, > >>>> given that such an overlapping view is going to do behave in ways > >>>> the > >>>> average user will be surprised by. It may also lead to requests > >>>> for > >>>> other > >>>> array/tensor libraries to implement this. I don't see any > >>>> discussion on > >>>> this in PR 17394, it looks like a decision by the PR author that > >>>> no > >>>> one > >>>> commented on - reconsider that? > >>>> > >>>> Cheers, > >>>> Ralf > >>>> > >>> > >>> +1 let's keep this in the lib.stride_tricks namespace. > >>> > >> > >> I have no reservations against having it in the main namespace and am > >> happy either way (it can still be exposed later in any case). It is > >> the > >> conservative choice and maybe it is an uncommon enough function that > >> it > >> deserves being a bit hidden... > > > > > > In any case, its the safe bet for NumPy 1.20 at least so I opened a PR: > > > > https://github.com/numpy/numpy/pull/17720 > > > > Name changes, etc. are also possible of course. > > > > I still think it might be nice to find a better place for this type of > > function that `np.lib.stride_tricks` though, but dunno... > > > > - Sebastian > > > > > > > >> > >> But I am curious, it sounds like you have both very strong > >> reservations, and I would like to understand them better. > >> > >> The behaviour can be surprising, but that is why the default is a > >> read- > >> only view. I do not think it is worse than `np.broadcast_to` in this > >> regard. (It is nowhere near as dangerous as `as_strided`.) > >> > >> It is true that it is specific to NumPy (memory model). So that is > >> maybe a good enough reason right now. But I am not sure that > >> stuffing > >> things into a pretty hidden `np.lib.*` namespaces is a great long > >> term > >> solution either. There is very little useful functionality hidden > >> away > >> in `np.lib.*` currently. > >> > >> Cheers, > >> > >> Sebastian > >> > >>>> > >>>> > >>>>> Cheers, > >>>>> > >>>>> Sebastian > >>>>> > >>>>> > >>>>> > >>>>> On Mon, 2020-10-12 at 08:39 +0000, Zimmermann Klaus wrote: > >>>>>> Hello, > >>>>>> > >>>>>> I would like to draw the attention of this list to PR #17394 > >>>>>> [1] that > >>>>>> adds the implementation of a sliding window view to numpy. > >>>>>> > >>>>>> Having a sliding window view in numpy is a longstanding open > >>>>>> issue > >>>>>> (cf > >>>>>> #7753 [2] from 2016). A brief summary of the discussions > >>>>>> surrounding > >>>>>> it > >>>>>> can be found in the description of the PR. > >>>>>> > >>>>>> This PR implements a sliding window view based on stride > >>>>>> tricks. > >>>>>> Following the discussion in issue #7753, a first > >>>>>> implementation > >>>>>> was > >>>>>> provided by Fanjin Zeng in PR #10771. After some discussion, > >>>>>> that PR > >>>>>> stalled and I picked up the issue in the present PR #17394. > >>>>>> It > >>>>>> is > >>>>>> based > >>>>>> on the first implementation, but follows the changed API as > >>>>>> suggested > >>>>>> by > >>>>>> Eric Wieser. > >>>>>> > >>>>>> Code reviews have been provided by Bas van Beek, Stephen > >>>>>> Hoyer, > >>>>>> and > >>>>>> Eric > >>>>>> Wieser. Sebastian Berg added the "62 - Python API" label. > >>>>>> > >>>>>> > >>>>>> Do you think this is suitable for inclusion in numpy? > >>>>>> > >>>>>> Do you consider the PR ready? > >>>>>> > >>>>>> Do you have suggestions or requests? > >>>>>> > >>>>>> > >>>>>> Thanks for your time and consideration! > >>>>>> Klaus > >>>>>> > >>>>>> > >>>>>> [1] https://github.com/numpy/numpy/pull/17394 > >>>>>> [2] https://github.com/numpy/numpy/issues/7753 > >>>>>> _______________________________________________ > >>>>>> NumPy-Discussion mailing list > >>>>>> NumPy-Discussion at python.org > >>>>>> https://mail.python.org/mailman/listinfo/numpy-discussion > >>>>>> > >>>>> > >>>>> _______________________________________________ > >>>>> NumPy-Discussion mailing list > >>>>> NumPy-Discussion at python.org > >>>>> https://mail.python.org/mailman/listinfo/numpy-discussion > >>>>> > >>>> _______________________________________________ > >>>> NumPy-Discussion mailing list > >>>> NumPy-Discussion at python.org > >>>> https://mail.python.org/mailman/listinfo/numpy-discussion > >>>> > >>> > >>> _______________________________________________ > >>> NumPy-Discussion mailing list > >>> NumPy-Discussion at python.org > >>> https://mail.python.org/mailman/listinfo/numpy-discussion > >> > >> _______________________________________________ > >> NumPy-Discussion mailing list > >> NumPy-Discussion at python.org > >> https://mail.python.org/mailman/listinfo/numpy-discussion > > > > > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at python.org > > https://mail.python.org/mailman/listinfo/numpy-discussion > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From jbrockmendel at gmail.com Fri Nov 6 10:57:41 2020 From: jbrockmendel at gmail.com (Brock Mendel) Date: Fri, 6 Nov 2020 07:57:41 -0800 Subject: [Numpy-discussion] datetime64: Remove deprecation warning when constructing with timezone In-Reply-To: References:

Message-ID: > I find the whole notion of a "timezone naive timestamp" to be nearly meaningless >From the perspective of, say, the dateutil parser, what would you do with "2020-11-06 07:48"? If you assume it's UTC you'll be wrong in this case. If you assume it is in your local timezone, you'll be wrong in Europe. Timezone-naive datetimes are an abstraction for exactly this case. >>> t0 = pd.Timestamp.now() You can use `pd.Timestamp.now("UTC")`. See also https://mail.python.org/archives/list/datetime-sig at python.org/thread/PT4JWJLYBE5R2QASVBPZLHH37ULJQR43/ , https://github.com/pandas-dev/pandas/issues/22451 On Fri, Nov 6, 2020 at 2:48 AM Noam Yorav-Raphael wrote: > Hi, > > I actually arrived at this by first trying to use pandas.Timestamp and > getting very frustrated about it. With pandas, I get: > > >>> pd.Timestamp.now() > Timestamp('2020-11-06 09:45:24.249851') > > I find the whole notion of a "timezone naive timestamp" to be nearly > meaningless. A timestamp should mean a moment in time (as the current numpy > documentation defines very well). A "naive timestamp" doesn't mean > anything. It's exactly like a "unit naive length". I can have a Length type > which just takes a number, and be very happy that it works both if my "unit > zone" is inches or centimeters. So "Length(3)" will mean 3 cm in most of > the world and 3 inches in the US. But then, if I get "Length(3)" from > someone, I can't be sure what length it refers to. > > So currently, this happens with pandas timestamps: > > >>> os.environ['TZ'] = 'UTC'; time.tzset() > ... t0 = pd.Timestamp.now() > ... time.sleep(1) > ... os.environ['TZ'] = 'EST-5'; time.tzset() > ... t1 = pd.Timestamp.now() > ... t1 - t0 > Timedelta('0 days 05:00:01.001583') > > This is not just theoretical - I actually need to work with data from > several devices, each in its own time zone. And I need to know that I won't > get such meaningless results. > > And you can even get something like this: > > >>> t0 = pd.Timestamp.now() > ... time.sleep(10) > ... t1 = pd.Timestamp.now() > ... t1 - t0 > Timedelta('0 days 01:00:10.001583') > > if the first measurement happened to be in winter time and the second > measurement happened to be in daylight saving time. > > The solution is simple, and is what datetime64 used to do before the > change - have a type that just represents a moment in time. It's not "in > UTC" - it just stores the number of seconds that passed since an agreed > moment in time (which is usually 1970-01-01 02:00+0200, which is more > commonly referred to as 1970-01-01 00:00Z - it's the exact same moment). > > I think it would make things clearer if I'll mention that there are > operations that are not dealing with timestamps. For example, it's > meaningless to ask what is the year of a timestamp - it may depend on the > time zone. These are always *human* related questions, that depend on > certain human conventions. We can call them "calendar questions". For these > types of questions, a type that includes both a timestamp and a timezone > offset (in minutes from UTC) can be useful. Some questions even require > full timezone information, meaning a function that defines what's the > timezone offset for each moment. However, I don't think numpy should deal > with those calendar issues. As a very simple example, even for > "timestamp+offset" types, it's not clear how to compare them - should > values with the same timestamp and different offsets be considered equal or > not? And in virtually all of my data analysis, this calendar aspect has > nothing to do with the questions I'm trying to answer. > > I have a suggestion. Instead of changing datetime64 (which I consider to > be ill-defined, but never mind), add a new type called "timestamp64". It > will have the exact same behavior as datetime64 had before the change, > except that its only allowed units will be seconds, milliseconds, > microseconds and nanoseconds. Removing the longer units will make it clear > that it doesn't deal with calendar and dates. Also, all the business day > functionality will not be applicable to timestamp64. In order to get > calendar information (such as the year) from timestamp64, you will have to > manually convert it to python's datetime (or to np.datetime64) with an > explicit timezone (utc, local, an offset, or a timezone object). > > What do you think? > > Thanks, > Noam > > > > > > On Fri, Nov 6, 2020 at 1:45 AM Stephan Hoyer wrote: > >> I can try to dig up the old discussions, but datetime64 used to implement >> both (1) and (3), and this was updated in a very intentional way. >> Datetime64 now works like Python's own time-zone naive datetime.datetime >> objects. The documentation referencing "Z" should be updated -- datetime64 >> can be in any timezone you like. >> >> Timezone aware datetime objects are certainly useful, but NumPy's >> datetime64 was restricted to UTC. The consensus was that it was worse to >> have UTC-only rather than timezone-naive-only. NumPy's datetime64 is often >> used for data analysis purposes, for which automatic conversion to the >> local timezone of the computer running the analysis is often >> counter-productive. >> >> If you care about timezone conversions, I would highly recommend looking >> into pandas's Timestamp class for this purpose. In the future, this would >> be a good use-case for a new custom NumPy dtype. (The existing >> np.datetime64 code cannot easily handle multiple timezones.) >> >> On Thu, Nov 5, 2020 at 1:04 PM Eric Wieser >> wrote: >> >>> Without weighing in yet on how I feel about the deprecation, you can see >>> some discussion about why this was originally deprecated in the PR that >>> introduced the warning: >>> >>> https://github.com/numpy/numpy/pull/6453 >>> >>> Eric >>> >>> On Thu, Nov 5, 2020, 20:13 Noam Yorav-Raphael >>> wrote: >>> >>>> Hi, >>>> >>>> I suggest removing the deprecation warning when constructing a >>>> datetime64 with a timezone. For example, this is the current behavior: >>>> >>>> >>> np.datetime64('2020-11-05 16:00+0200') >>>> :1: DeprecationWarning: parsing timezone aware datetimes is >>>> deprecated; this will raise an error in the future >>>> numpy.datetime64('2020-11-05T14:00') >>>> >>>> I suggest removing the deprecation warning because I find this to be a >>>> useful behavior, and because it is a correct behavior. The manual says: >>>> "The datetime object represents a single moment in time... Datetimes are >>>> always stored based on POSIX time, with an epoch of 1970-01-01T00:00Z." >>>> So 2020-11-05T16:00+0200 is indeed the moment in time represented by >>>> np.datetime64('2020-11-05T14:00'). >>>> >>>> I just used this to restrict my data set to records created after a >>>> certain moment. It was easier for me to write the moment in my local time >>>> and add "+0200" than to figure out the moment representation in UTC. >>>> >>>> So this is my simple suggestion: remove the deprecation warning. >>>> >>>> >>>> Beyond that, I have 3 ideas for changing the repr of datetime64 that I >>>> would like to discuss. >>>> >>>> 1. Add "Z" at the end, for example, >>>> numpy.datetime64('2020-11-05T14:00Z'). This will make it clear to which >>>> moment it refers. I think this is significant - I had to dig quite a bit to >>>> realize that datetime64('2020-11-05T14:00') means 14:00 UTC. >>>> >>>> 2. Replace the 'T' with a space. I just find it much easier to read >>>> '2020-11-05 14:00Z' than '2020-11-05T14:00Z'. The long sequence of >>>> characters makes it hard for my brain to parse. >>>> >>>> 3. This will require discussion, but will be very convenient: have the >>>> repr display the time using the environment time zone, including a time >>>> offset. So, in my specific time zone (+0200), I will have: >>>> >>>> repr(np.datetime64('2020-11-05 14:00Z')) == >>>> "numpy.datetime64('2020-11-05T16:00+0200')" >>>> >>>> I'm sure the pros and cons of having an environment-dependent repr >>>> should be discussed. But I will list some pros: >>>> 1. It's very convenient - it's immediately obvious to me to which >>>> moment 2020-11-05 16:00+0200 refers. >>>> 2. It's well defined - I may collect timestamps from machines with >>>> different time zones, and I will be able to know to which exact moment each >>>> timestamp refers. >>>> 3. It's very simple - I could compare any two timestamps, I don't have >>>> to worry about time zones. >>>> >>>> I would be happy to hear your thoughts. >>>> >>>> Thanks, >>>> Noam >>>> _______________________________________________ >>>> NumPy-Discussion mailing list >>>> NumPy-Discussion at python.org >>>> https://mail.python.org/mailman/listinfo/numpy-discussion >>>> >>> _______________________________________________ >>> NumPy-Discussion mailing list >>> NumPy-Discussion at python.org >>> https://mail.python.org/mailman/listinfo/numpy-discussion >>> >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at python.org >> https://mail.python.org/mailman/listinfo/numpy-discussion >> > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From klaus.zimmermann at smhi.se Fri Nov 6 11:03:00 2020 From: klaus.zimmermann at smhi.se (Zimmermann Klaus) Date: Fri, 6 Nov 2020 16:03:00 +0000 Subject: [Numpy-discussion] Add sliding_window_view method to numpy In-Reply-To: References:

<1cdb0b720f09845d03ccfdc2e171f98d7e925ee3.camel@sipsolutions.net> <32e8736e-55ed-1155-3da5-003d907c4e65@smhi.se> Message-ID: Hi, On 06/11/2020 15:58, Ralf Gommers wrote: > On Fri, Nov 6, 2020 at 9:51 AM Zimmermann Klaus > > wrote: > I have absolutely no problem keeping this out of the main namespace. > > In fact I'd like to point out that it was not my idea. Rather, it was > proposed by Bas van Beek in the comments [1,2] and received a little > more scrutiny from Eric Wieser in [3]. > > Thanks, between two PRs with that many comments, I couldn't figure that > out - just saw the commit that make the change. Understandable, no worries. > On the subject matter, I am also curious about the potential for > confusion. What other behavior could one expect from a sliding window > view with this shape? > > As I said, I am completely fine with keeping this out of the main > namespace, but I agree with Sebastian's comment, that > `np.lib.stride_tricks` is perhaps not the best namespace. > > > I agree that that's not a great namespace. There's multiple issues with > namespaces, we basically have three good ones (fft, linalg, random) and > a bunch of other ones that range from questionable to terrible. See > https://github.com/numpy/numpy/blob/master/numpy/tests/test_public_api.py#L127 > > for details. > > This would be a good thing to work on - making the `numpy.lib` namespace > not bleed into `numpy` via `import *` is one thing to do there, and > there's many others. But given backwards compat constraints it's not easy. I understand cleaning up all the namespaces is a giant task, so far, far out of scope here. As said before, I also completely agree to keep it out of the main namespace (though I will still argue below :P). I was just wondering if, of the top your head, an existing, better fit comes to mind? > The reason > from my point of view is that stride tricks is really a technical (and > slightly ominous) name that might throw of more application oriented > programmers from finding and using this function. Thinking of my > scientist colleagues, I think those are exactly the kind of users that > could benefit from such a prototyping tool. > > > That phrasing is one of a number of concerns. NumPy is normally not in > the business of providing things that are okay as a prototyping tool, > but are potentially extremely slow (as pointed out in the Notes section > of the docstring). A function like that would basically not be the right > tool for almost anything in, e.g., SciPy - it requires an iterative > algorithm. In NumPy we don't prefer performance at all costs, but in > general it's pretty decent rather than "Numba or Cython may gain you > 100x here". I still think that the performance concern is a bit overblown. Yes, application with large windows can need more FLOPs by an equally large factor. But most such applications will use small to moderate windows. Furthermore, this view focuses only on FLOPs. In my current field of climate science (and many others), that is almost never the limiting factor. Memory demands are far more problematic and incidentally, those are more likely to increase in other methods that require the storage of ancillary, temporary data. > Other issues include: > 2) It is very specific to NumPy's memory model (as pointed out by you > and Sebastian) - just like the rest of stride_tricks Not wrong, but on the other hand, that memory model is not exotic. C, Fortran, and any number of other languages play very nicely with this, just as important downstream libraries like dask. > 3) It has "view" in the name, which doesn't quite make sense for the > main namespace (also connected to point 2 above). Ok. > 4) The cost of putting something in the main namespace for other > array/tensor libraries is large. Maybe other libraries, e.g. CuPy, Dask, > TensorFlow, PyTorch, JAX, MXNet, aim to reimplement part or all of the > main NumPy namespace as well as possible. This would trigger discussions > and likely many person-weeks of work for others. Agreed. Though I have to say that my whole motivation comes from corresponding issues in dask that where specifically waiting for (the older version of) this PR (see [1, 2,...]). But I understand that dask is effectively much closer to the numpy memory model than, say, CuPy, so don't take this to mean it should be in the main namespace. > 5) It's a useful function, but it's very much on the margins of NumPy's > scope. It could easily have gone into, for example, scipy.signal. At > this point the bar for functions going into the main namespace should be> (and is) high. I agree that the bar for the main namespace should be high! > All this taken together means it's not even a toss-up for me. If it were > just one or two of these points, maybe. But given all the above, I'm > pretty confident saying "it does not belong in the main namespace". Again, I am happy with that. Thanks for your thoughts and work! I really appreciate it! Cheers Klaus [1] https://github.com/dask/dask/issues/4659 [2] https://github.com/pydata/xarray/issues/3608 [3] https://github.com/pandas-dev/pandas/issues/26959 > > > Cheers > Klaus > > > > [1] https://github.com/numpy/numpy/pull/17394#issuecomment-700998618 > > [2] https://github.com/numpy/numpy/pull/17394#discussion_r498215468 > > [3] https://github.com/numpy/numpy/pull/17394#discussion_r498724340 > > > On 06/11/2020 01:39, Sebastian Berg wrote: > > On Thu, 2020-11-05 at 17:35 -0600, Sebastian Berg wrote: > >> On Thu, 2020-11-05 at 12:51 -0800, Stephan Hoyer wrote: > >>> On Thu, Nov 5, 2020 at 11:16 AM Ralf Gommers < > >>> ralf.gommers at gmail.com > > >>> wrote: > >>> > >>>> On Thu, Nov 5, 2020 at 4:56 PM Sebastian Berg < > >>>> sebastian at sipsolutions.net > > >>>> wrote: > >>>> > >>>>> Hi all, > >>>>> > >>>>> just a brief note that I merged this proposal: > >>>>> > >>>>>? ? ?https://github.com/numpy/numpy/pull/17394 > > >>>>> > >>>>> adding `np.sliding_window_view` into the 1.20 release of NumPy. > >>>>> > >>>>> There was only one public API change, and that is that the > >>>>> `shape` > >>>>> argument is now called `window_shape`. > >>>>> > >>>>> This is still a good time for feedback in case you have a > >>>>> better > >>>>> idea > >>>>> e.g. for the function or parameter names. > >>>>> > >>>> > >>>> The old PR had this in the lib.stride_tricks namespace. Seeing it > >>>> in the > >>>> main namespace is unexpected and likely will lead to > >>>> issues/questions, > >>>> given that such an overlapping view is going to do behave in ways > >>>> the > >>>> average user will be surprised by. It may also lead to requests > >>>> for > >>>> other > >>>> array/tensor libraries to implement this. I don't see any > >>>> discussion on > >>>> this in PR 17394, it looks like a decision by the PR author that > >>>> no > >>>> one > >>>> commented on - reconsider that? > >>>> > >>>> Cheers, > >>>> Ralf > >>>> > >>> > >>> +1 let's keep this in the lib.stride_tricks namespace. > >>> > >> > >> I have no reservations against having it in the main namespace and am > >> happy either way (it can still be exposed later in any case). It is > >> the > >> conservative choice and maybe it is an uncommon enough function that > >> it > >> deserves being a bit hidden... > > > > > > In any case, its the safe bet for NumPy 1.20 at least so I opened > a PR: > > > >? ? ?https://github.com/numpy/numpy/pull/17720 > > > > > Name changes, etc. are also possible of course. > > > > I still think it might be nice to find a better place for this type of > > function that `np.lib.stride_tricks` though, but dunno... > > > > - Sebastian > > > > > > > >> > >> But I am curious, it sounds like you have both very strong > >> reservations, and I would like to understand them better. > >> > >> The behaviour can be surprising, but that is why the default is a > >> read- > >> only view.? I do not think it is worse than `np.broadcast_to` in this > >> regard. (It is nowhere near as dangerous as `as_strided`.) > >> > >> It is true that it is specific to NumPy (memory model). So that is > >> maybe a good enough reason right now.? But I am not sure that > >> stuffing > >> things into a pretty hidden `np.lib.*` namespaces is a great long > >> term > >> solution either. There is very little useful functionality hidden > >> away > >> in `np.lib.*` currently. > >> > >> Cheers, > >> > >> Sebastian > >> > >>>> > >>>> > >>>>> Cheers, > >>>>> > >>>>> Sebastian > >>>>> > >>>>> > >>>>> > >>>>> On Mon, 2020-10-12 at 08:39 +0000, Zimmermann Klaus wrote: > >>>>>> Hello, > >>>>>> > >>>>>> I would like to draw the attention of this list to PR #17394 > >>>>>> [1] that > >>>>>> adds the implementation of a sliding window view to numpy. > >>>>>> > >>>>>> Having a sliding window view in numpy is a longstanding open > >>>>>> issue > >>>>>> (cf > >>>>>> #7753 [2] from 2016). A brief summary of the discussions > >>>>>> surrounding > >>>>>> it > >>>>>> can be found in the description of the PR. > >>>>>> > >>>>>> This PR implements a sliding window view based on stride > >>>>>> tricks. > >>>>>> Following the discussion in issue #7753, a first > >>>>>> implementation > >>>>>> was > >>>>>> provided by Fanjin Zeng in PR #10771. After some discussion, > >>>>>> that PR > >>>>>> stalled and I picked up the issue in the present PR #17394. > >>>>>> It > >>>>>> is > >>>>>> based > >>>>>> on the first implementation, but follows the changed API as > >>>>>> suggested > >>>>>> by > >>>>>> Eric Wieser. > >>>>>> > >>>>>> Code reviews have been provided by Bas van Beek, Stephen > >>>>>> Hoyer, > >>>>>> and > >>>>>> Eric > >>>>>> Wieser. Sebastian Berg added the "62 - Python API" label. > >>>>>> > >>>>>> > >>>>>> Do you think this is suitable for inclusion in numpy? > >>>>>> > >>>>>> Do you consider the PR ready? > >>>>>> > >>>>>> Do you have suggestions or requests? > >>>>>> > >>>>>> > >>>>>> Thanks for your time and consideration! > >>>>>> Klaus > >>>>>> > >>>>>> > >>>>>> [1] https://github.com/numpy/numpy/pull/17394 > > >>>>>> [2] https://github.com/numpy/numpy/issues/7753 > > >>>>>> _______________________________________________ > >>>>>> NumPy-Discussion mailing list > >>>>>> NumPy-Discussion at python.org > >>>>>> https://mail.python.org/mailman/listinfo/numpy-discussion > > >>>>>> > >>>>> > >>>>> _______________________________________________ > >>>>> NumPy-Discussion mailing list > >>>>> NumPy-Discussion at python.org > >>>>> https://mail.python.org/mailman/listinfo/numpy-discussion > > >>>>> > >>>> _______________________________________________ > >>>> NumPy-Discussion mailing list > >>>> NumPy-Discussion at python.org > >>>> https://mail.python.org/mailman/listinfo/numpy-discussion > > >>>> > >>> > >>> _______________________________________________ > >>> NumPy-Discussion mailing list > >>> NumPy-Discussion at python.org > >>> https://mail.python.org/mailman/listinfo/numpy-discussion > > >> > >> _______________________________________________ > >> NumPy-Discussion mailing list > >> NumPy-Discussion at python.org > >> https://mail.python.org/mailman/listinfo/numpy-discussion > > > > > > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at python.org > > https://mail.python.org/mailman/listinfo/numpy-discussion > > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org