From ralf.gommers at gmail.com Sat Aug 1 07:21:43 2020 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Sat, 1 Aug 2020 12:21:43 +0100 Subject: [Numpy-discussion] participating in AI Code-In? Message-ID: Hi all, We got an invitation to participate in AI Code-In ( https://aicode-in.github.io/AICode-In/). It's a new initiative, seems a bit GSoC like, but created by and for middle/high schoolers. We'd have to create tasks to work on (more like tagging/creation actionable issues than a full project), and provide some mentoring bandwidth. It seems well-organized and because it's a new initiative it may be smaller and more "early adopter" than GSoC. Would anyone be interested to participate as a mentor and/or lead the NumPy organization participation? Cheers, Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From sabertooth2022 at gmail.com Sat Aug 1 07:25:22 2020 From: sabertooth2022 at gmail.com (Saber Tooth) Date: Sat, 1 Aug 2020 16:55:22 +0530 Subject: [Numpy-discussion] participating in AI Code-In? In-Reply-To: References: Message-ID: Hi Ralf , I'd be glad and more than interested to take part in CodeIn as a Mentor if there is no issue . Thanks , Mrinal On Sat, 1 Aug, 2020, 4:52 pm Ralf Gommers, wrote: > Hi all, > > We got an invitation to participate in AI Code-In ( > https://aicode-in.github.io/AICode-In/). It's a new initiative, seems a > bit GSoC like, but created by and for middle/high schoolers. We'd have to > create tasks to work on (more like tagging/creation actionable issues than > a full project), and provide some mentoring bandwidth. > > It seems well-organized and because it's a new initiative it may be > smaller and more "early adopter" than GSoC. Would anyone be interested to > participate as a mentor and/or lead the NumPy organization participation? > > Cheers, > Ralf > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From sabertooth2022 at gmail.com Sat Aug 1 07:40:32 2020 From: sabertooth2022 at gmail.com (Saber Tooth) Date: Sat, 1 Aug 2020 17:10:32 +0530 Subject: [Numpy-discussion] participating in AI Code-In? In-Reply-To: References:

Message-ID: Hi Ralf , I have quite some experience in Computer Vision where I developed a model to detect different type of currency notes and use object detection to augment 3d objects on these currency notes , there I relied up opencv python libraries and NumPy arrays for detection . I'd like to apply for mentoring role . https://github.com/mrityagi/ARnote Here is the link to my repo Thanks , Mrinal On Sat, 1 Aug, 2020, 4:55 pm Saber Tooth, wrote: > Hi Ralf , > I'd be glad and more than interested to take part in CodeIn as a Mentor if > there is no issue . > > Thanks , > Mrinal > > On Sat, 1 Aug, 2020, 4:52 pm Ralf Gommers, wrote: > >> Hi all, >> >> We got an invitation to participate in AI Code-In ( >> https://aicode-in.github.io/AICode-In/). It's a new initiative, seems a >> bit GSoC like, but created by and for middle/high schoolers. We'd have to >> create tasks to work on (more like tagging/creation actionable issues than >> a full project), and provide some mentoring bandwidth. >> >> It seems well-organized and because it's a new initiative it may be >> smaller and more "early adopter" than GSoC. Would anyone be interested to >> participate as a mentor and/or lead the NumPy organization participation? >> >> Cheers, >> Ralf >> >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at python.org >> https://mail.python.org/mailman/listinfo/numpy-discussion >> > -------------- next part -------------- An HTML attachment was scrubbed... URL: From yashboss2000 at gmail.com Sat Aug 1 09:51:05 2020 From: yashboss2000 at gmail.com (yash varshney) Date: Sat, 1 Aug 2020 19:21:05 +0530 Subject: [Numpy-discussion] participating in AI Code-In? In-Reply-To: References: Message-ID: Hey Ralf, This is great, I would love to participate as a mentor in this wonderful opportunity. Brief about me: I am well experienced in Computer Vision as well as in NLP. I have done Tensorflow-in-Practice Specialization, How to win kaggle Competitions, NLP Specialization (ongoing) courses. I have participated in kaggle competitions and have studied 3 courses of Data Science in my college. Also, presently I'm working as a mentee in SPDX community ( under Linux Foundation) via CommunityBridge Mentorship program by Linux Foundation. I also a main contributor in DFFML (dataflow facilitator for Machine Learning) org under PSF. Thanks, would love to hear from you soon. Regards, Yash Varshney B18038 IIT Mandi, H.P., India On Sat, Aug 1, 2020, 4:52 PM Ralf Gommers wrote: > Hi all, > > We got an invitation to participate in AI Code-In ( > https://aicode-in.github.io/AICode-In/). It's a new initiative, seems a > bit GSoC like, but created by and for middle/high schoolers. We'd have to > create tasks to work on (more like tagging/creation actionable issues than > a full project), and provide some mentoring bandwidth. > > It seems well-organized and because it's a new initiative it may be > smaller and more "early adopter" than GSoC. Would anyone be interested to > participate as a mentor and/or lead the NumPy organization participation? > > Cheers, > Ralf > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Sat Aug 1 14:52:11 2020 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Sat, 1 Aug 2020 19:52:11 +0100 Subject: [Numpy-discussion] a summary function to get a quick glimpse on the contents of a numpy array In-Reply-To: References: Message-ID: On Fri, Jul 31, 2020 at 1:40 PM Peter Steinbach wrote: > Dear numpy devs and interested readers, > > as a day-to-day user, it occurred to me that having a quick look into the > contents and extents of arrays is well doable with > numpy. numpy offers a rich set of methods for this. However, very often I > oversee myself and others that one just wants to see > if the values of an array have a certain min/max or mean or how wide the > range of values are. > > I hence sat down to write a summary function that returns a string of > hand-packed summary statistics for a quick inspection. I > propose to include it into numpy and would love to have your feedback on > this idea before I submit a PR. Here is the core > functionality: > > Examples > -------- > >>> a = np.random.normal(size=20) > >>> print(summary(a)) > min 25perc mean stdev median > 75perc max > -2.289870 -2.265757 -0.083213 1.115033 -0.162885 > -2.217532 1.639802 > >>> a = np.reshape(a, newshape=(4,5)) > >>> print(summary(a,axis=1)) > min 25perc mean stdev median > 75perc max > 0 -0.976279 -0.974090 0.293003 1.009383 0.466814 > -0.969712 1.519695 > 1 -0.468854 -0.467739 0.184139 0.649378 -0.036762 > -0.465510 1.303144 > 2 -2.289870 -2.276455 -0.324450 1.230031 -0.289008 > -2.249625 1.111107 > 3 -1.782239 -1.777304 -0.485546 1.259598 -1.236190 > -1.767434 1.639802 > > So you see, it is merely a tiny helper function that can aid practitioners > and data scientists to get a quick insight on what an > array contains. > > first off, here is the code: > > https://github.com/psteinb/numpy/blob/summary-function/numpy/lib/utils.py#L1021 > > I put it there as I am not sure at this point, if the community would > appreciate such a function or not. Judging from the tests, > lib/utils.py appears to a be place for undocumented functions. So to > resolve this and prepare a proper PR, please let me know > where this summary function could reside! > This seems to be more the domain of scipy.stats and statsmodels. Statsmodels already does a good job with this; in SciPy there's stats.describe ( https://docs.scipy.org/doc/scipy/reference/generated/scipy.stats.describe.html) which is quite similar to what you're proposing. Could you think about whether scipy.stats.describe does what you want, and if there's room to improve it (perhaps add a `__repr__` and/or a `__html_repr__` for pretty-printing)? Cheers, Ralf > Second, please give me your thoughts on the summary function's output? > Should the number of digits be configurable? Should the > columns be configurable? Is is ok to honor the axis parameter which is > found in so many numpy functions? > > Last but not least, let me stress that this is my first time contribution > to numpy. I love the library and would like to > contribute something back. So bear with me, if my code violates best > practices in your community for now. I'll bite my teeth > into the formalities of a github PR once I get support from the community > and the core devs. > > I think that a summary function would be a valuable addition to numpy! > Best, > Peter > > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From b.sipocz+numpylist at gmail.com Sun Aug 2 03:03:48 2020 From: b.sipocz+numpylist at gmail.com (Brigitta Sipocz) Date: Sun, 2 Aug 2020 00:03:48 -0700 Subject: [Numpy-discussion] participating in AI Code-In? In-Reply-To: References: Message-ID: Hi, At first sight, the competition element seems a bit weird approach for the open source setting. Do you see a way how it can work out well? (Google also has a code-in for HS students. Has numpy ever tried it? (all the mentors I talked to at the last gsoc summit said it takes more time to mentor than gsoc, but I guess maybe it's partly due to the fact that is different, if we count all the pre-coding period efforts put into gsoc by the wider community, it also adds up significantly)). Cheers, Brigitta On Sat, 1 Aug 2020, 04:22 Ralf Gommers, wrote: > Hi all, > > We got an invitation to participate in AI Code-In ( > https://aicode-in.github.io/AICode-In/). It's a new initiative, seems a > bit GSoC like, but created by and for middle/high schoolers. We'd have to > create tasks to work on (more like tagging/creation actionable issues than > a full project), and provide some mentoring bandwidth. > > It seems well-organized and because it's a new initiative it may be > smaller and more "early adopter" than GSoC. Would anyone be interested to > participate as a mentor and/or lead the NumPy organization participation? > > Cheers, > Ralf > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From warren.weckesser at gmail.com Mon Aug 3 14:09:22 2020 From: warren.weckesser at gmail.com (Warren Weckesser) Date: Mon, 3 Aug 2020 14:09:22 -0400 Subject: [Numpy-discussion] New random.Generator method: permuted Message-ID: In one of the previous weekly zoom meetings, it was suggested to ping the mailing list about an updated PR that implements the `permuted` method for the Generator class in numpy.random. The relevant issue is https://github.com/numpy/numpy/issues/5173 and the PR is https://github.com/numpy/numpy/pull/15121 The new method (as it would be called from Python) is permuted(x, axis=None, out=None) The CircleCI rendering of the docstring from the pull request is https://14745-908607-gh.circle-artifacts.com/0/doc/build/html/reference/random/generated/numpy.random.Generator.permuted.html The new method is an alternative to the existing `shuffle` and `permutation` methods. It handles the `axis` parameter similar to how the sort methods do, i.e. when `axis` is given, the slices along the axis are shuffled independently. This new documentation (added as part of the pull request) explains the API of the various related methods: https://14745-908607-gh.circle-artifacts.com/0/doc/build/html/reference/random/generator.html#permutations Additional feedback on the implementation of `permuted` in the pull request is welcome. Further discussion of the API should be held in the issue gh-5173 (but please familiarize yourself with the discussion of the API in gh-5173--there has already been quite a long discussion of several different APIs). Thanks, Warren From cv1038 at wildcats.unh.edu Mon Aug 3 20:39:51 2020 From: cv1038 at wildcats.unh.edu (Chris Vavaliaris) Date: Mon, 3 Aug 2020 17:39:51 -0700 (MST) Subject: [Numpy-discussion] Add Chebyshev (cosine) transforms implemented via FFTs Message-ID: <1596501591921-0.post@n7.nabble.com> PR #16999: https://github.com/numpy/numpy/pull/16999 Hello all, this PR adds the two 1D Chebyshev transform functions `chebyfft` and `ichebyfft` into the `numpy.fft` module, utilizing the real FFTs `rfft` and `irfft`, respectively. As far as I understand, `pockefft` does not support cosine transforms natively; for this reason, an even extension of the input vector is constructed, whose real FFT corresponds to a cosine transform. The motivation behind these two additions is the ability to quickly perform direct and inverse Chebyshev transforms with `numpy`, without the need to write scripts that do the necessary (although minor) modifications. Chebyshev transforms are used often e.g. in the spectral integration of PDE problems; thus, I believe having them implemented in `numpy` would be useful to many people in the community. I'm happy to get comments/feedback on this feature, and on whether it's something more people would be interested in. Also, I'm not entirely sure what part of this functionality is/isn't present in `scipy`, so that the two `fft` modules remain consistent with one another. Best, Chris -- Sent from: http://numpy-discussion.10968.n7.nabble.com/ From ralf.gommers at gmail.com Tue Aug 4 06:54:21 2020 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Tue, 4 Aug 2020 11:54:21 +0100 Subject: [Numpy-discussion] Add Chebyshev (cosine) transforms implemented via FFTs In-Reply-To: <1596501591921-0.post@n7.nabble.com> References: <1596501591921-0.post@n7.nabble.com> Message-ID: On Tue, Aug 4, 2020 at 1:49 AM Chris Vavaliaris wrote: > PR #16999: https://github.com/numpy/numpy/pull/16999 > > Hello all, > this PR adds the two 1D Chebyshev transform functions `chebyfft` and > `ichebyfft` into the `numpy.fft` module, utilizing the real FFTs `rfft` and > `irfft`, respectively. As far as I understand, `pockefft` does not support > cosine transforms natively; for this reason, an even extension of the input > vector is constructed, whose real FFT corresponds to a cosine transform. > > The motivation behind these two additions is the ability to quickly perform > direct and inverse Chebyshev transforms with `numpy`, without the need to > write scripts that do the necessary (although minor) modifications. > Chebyshev transforms are used often e.g. in the spectral integration of PDE > problems; thus, I believe having them implemented in `numpy` would be > useful > to many people in the community. > > I'm happy to get comments/feedback on this feature, and on whether it's > something more people would be interested in. Also, I'm not entirely sure > what part of this functionality is/isn't present in `scipy`, so that the > two > `fft` modules remain consistent with one another. > Hi Chris, that's a good question. scipy.fft is a superset of numpy.fft, and the functionality included in NumPy is really only the basics that are needed in many fields. The reason for the duplication stems from way back when we had no wheels and SciPy was very hard to install. So I don't think there's anything we'd add to numpy.fft at this point. As I commented on your PR, it would be useful to add some references and applications, and then make your proposal on the scipy-dev list. Cheers, Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From sebastian at sipsolutions.net Tue Aug 4 15:51:54 2020 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Tue, 04 Aug 2020 14:51:54 -0500 Subject: [Numpy-discussion] NumPy Community Meeting Wednesday Message-ID: <8dbf1486b3760142190eb7ca3fd1b34affda7f24.camel@sipsolutions.net> Hi all, There will be a NumPy Community meeting Wednesday Agust 5th at 1pm Pacific Time (20:00 UTC). Everyone is invited and encouraged to join in and edit the work-in-progress meeting topics and notes at: https://hackmd.io/76o-IxCjQX2mOXO_wwkcpg?both Best wishes Sebastian -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 833 bytes Desc: This is a digitally signed message part URL: From charlesr.harris at gmail.com Tue Aug 4 21:09:58 2020 From: charlesr.harris at gmail.com (Charles R Harris) Date: Tue, 4 Aug 2020 19:09:58 -0600 Subject: [Numpy-discussion] Add Chebyshev (cosine) transforms implemented via FFTs In-Reply-To: References: <1596501591921-0.post@n7.nabble.com> Message-ID: On Tue, Aug 4, 2020 at 4:55 AM Ralf Gommers wrote: > > > On Tue, Aug 4, 2020 at 1:49 AM Chris Vavaliaris > wrote: > >> PR #16999: https://github.com/numpy/numpy/pull/16999 >> >> Hello all, >> this PR adds the two 1D Chebyshev transform functions `chebyfft` and >> `ichebyfft` into the `numpy.fft` module, utilizing the real FFTs `rfft` >> and >> `irfft`, respectively. As far as I understand, `pockefft` does not support >> cosine transforms natively; for this reason, an even extension of the >> input >> vector is constructed, whose real FFT corresponds to a cosine transform. >> >> The motivation behind these two additions is the ability to quickly >> perform >> direct and inverse Chebyshev transforms with `numpy`, without the need to >> write scripts that do the necessary (although minor) modifications. >> Chebyshev transforms are used often e.g. in the spectral integration of >> PDE >> problems; thus, I believe having them implemented in `numpy` would be >> useful >> to many people in the community. >> >> I'm happy to get comments/feedback on this feature, and on whether it's >> something more people would be interested in. Also, I'm not entirely sure >> what part of this functionality is/isn't present in `scipy`, so that the >> two >> `fft` modules remain consistent with one another. >> > > Hi Chris, that's a good question. scipy.fft is a superset of numpy.fft, > and the functionality included in NumPy is really only the basics that are > needed in many fields. The reason for the duplication stems from way back > when we had no wheels and SciPy was very hard to install. So I don't think > there's anything we'd add to numpy.fft at this point. > > As I commented on your PR, it would be useful to add some references and > applications, and then make your proposal on the scipy-dev list. > > Chebfun is based around this method, they use series with possibly thousands of terms. Trefethen is a big fan of Chebyshev polynomials. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From shoyer at gmail.com Tue Aug 4 22:15:02 2020 From: shoyer at gmail.com (Stephan Hoyer) Date: Tue, 4 Aug 2020 19:15:02 -0700 Subject: [Numpy-discussion] Add Chebyshev (cosine) transforms implemented via FFTs In-Reply-To: References: <1596501591921-0.post@n7.nabble.com> Message-ID: On Tue, Aug 4, 2020 at 6:10 PM Charles R Harris wrote: > > > On Tue, Aug 4, 2020 at 4:55 AM Ralf Gommers > wrote: > >> >> >> On Tue, Aug 4, 2020 at 1:49 AM Chris Vavaliaris >> wrote: >> >>> PR #16999: https://github.com/numpy/numpy/pull/16999 >>> >>> Hello all, >>> this PR adds the two 1D Chebyshev transform functions `chebyfft` and >>> `ichebyfft` into the `numpy.fft` module, utilizing the real FFTs `rfft` >>> and >>> `irfft`, respectively. As far as I understand, `pockefft` does not >>> support >>> cosine transforms natively; for this reason, an even extension of the >>> input >>> vector is constructed, whose real FFT corresponds to a cosine transform. >>> >>> The motivation behind these two additions is the ability to quickly >>> perform >>> direct and inverse Chebyshev transforms with `numpy`, without the need to >>> write scripts that do the necessary (although minor) modifications. >>> Chebyshev transforms are used often e.g. in the spectral integration of >>> PDE >>> problems; thus, I believe having them implemented in `numpy` would be >>> useful >>> to many people in the community. >>> >>> I'm happy to get comments/feedback on this feature, and on whether it's >>> something more people would be interested in. Also, I'm not entirely sure >>> what part of this functionality is/isn't present in `scipy`, so that the >>> two >>> `fft` modules remain consistent with one another. >>> >> >> Hi Chris, that's a good question. scipy.fft is a superset of numpy.fft, >> and the functionality included in NumPy is really only the basics that are >> needed in many fields. The reason for the duplication stems from way back >> when we had no wheels and SciPy was very hard to install. So I don't think >> there's anything we'd add to numpy.fft at this point. >> >> As I commented on your PR, it would be useful to add some references and >> applications, and then make your proposal on the scipy-dev list. >> >> > Chebfun is based around this method, > they use series with possibly thousands of terms. Trefethen is a big fan of > Chebyshev polynomials. > I am quite sure that Chebyshev transforms are useful, but it does feel like something more directly suitable for SciPy than NumPy. The current division for submodules like numpy.fft/scipy.fft and numpy.linalg/scipy.linalg exists for outdated historical reasons, but at this point it is easiest for users to understand if has SciPy has a strict superset of NumPy's functionality here. Chuck > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From sebastian at sipsolutions.net Wed Aug 5 12:13:56 2020 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Wed, 05 Aug 2020 11:13:56 -0500 Subject: [Numpy-discussion] New random.Generator method: permuted In-Reply-To: References: Message-ID: <8a00e14e7e1c50b6dc82ff0f6c9fc62740912505.camel@sipsolutions.net> On Mon, 2020-08-03 at 14:09 -0400, Warren Weckesser wrote: > In one of the previous weekly zoom meetings, it was suggested > to ping the mailing list about an updated PR that implements > the `permuted` method for the Generator class in numpy.random. > The relevant issue is > > https://github.com/numpy/numpy/issues/5173 > > and the PR is > > https://github.com/numpy/numpy/pull/15121 > > The new method (as it would be called from Python) is > > permuted(x, axis=None, out=None) > I like the proposed API and name personally, and think we should go ahead with it. It is a useful complement to `shuffle` (and sorting). The followup questions of adding `shuffled`, and what to do about `permutation` are important, but I agree with viewing them as a second step. This API has been discussed a few times in various depths, so I assume that `permuted` as a name and API has largely settle down, and reached consensus (at last if there is not more activity here or on the PR). So, as a heads up, I am planning to review and push that forward in the next days, but more discussion is of course welcome. We still have time to decide differently. Cheers, Sebastian > The CircleCI rendering of the docstring from the pull request is > > > https://14745-908607-gh.circle-artifacts.com/0/doc/build/html/reference/random/generated/numpy.random.Generator.permuted.html > > The new method is an alternative to the existing `shuffle` and > `permutation` methods. It handles the `axis` parameter similar > to how the sort methods do, i.e. when `axis` is given, the slices > along the axis are shuffled independently. This new documentation > (added as part of the pull request) explains the API of the various > related methods: > > > https://14745-908607-gh.circle-artifacts.com/0/doc/build/html/reference/random/generator.html#permutations > > Additional feedback on the implementation of `permuted` in the > pull request is welcome. Further discussion of the API should > be held in the issue gh-5173 (but please familiarize yourself > with the discussion of the API in gh-5173--there has already > been quite a long discussion of several different APIs). > > Thanks, > > Warren > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 833 bytes Desc: This is a digitally signed message part URL: From ryan.c.cooper at uconn.edu Wed Aug 5 14:58:05 2020 From: ryan.c.cooper at uconn.edu (cooperrc) Date: Wed, 5 Aug 2020 11:58:05 -0700 (MST) Subject: [Numpy-discussion] Building Numpy Documentation Message-ID: <1596653885292-0.post@n7.nabble.com> I'm trying to build NumPy and its documentation from the current git repo, but I'm hitting a snag. I keep getting a RuntimeError: I'm trying to build NumPy inside the cloned repository from my fork. I'm running Arch (kernel 5.7.12) with gcc and gcc-libs installed. I'm using a fresh conda environment that has only installed Python 3.8 and Cython. Any way I can troubleshoot this issue? -- Sent from: http://numpy-discussion.10968.n7.nabble.com/ From ralf.gommers at gmail.com Wed Aug 5 15:02:10 2020 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Wed, 5 Aug 2020 20:02:10 +0100 Subject: [Numpy-discussion] Building Numpy Documentation In-Reply-To: <1596653885292-0.post@n7.nabble.com> References: <1596653885292-0.post@n7.nabble.com> Message-ID: On Wed, Aug 5, 2020 at 7:58 PM cooperrc wrote: > I'm trying to build NumPy and its documentation from the current git repo, > but I'm hitting a snag. I keep getting a RuntimeError: > > > I'm trying to build NumPy inside the cloned repository from my fork. I'm > running Arch (kernel 5.7.12) with gcc and gcc-libs installed. I'm using a > fresh conda environment that has only installed Python 3.8 and Cython. > > Any way I can troubleshoot this issue? > Opening an issue and including the command you're running and the full build/test log ending in that RuntimeError would be the way to get the input you need. Cheers, Ralf > > > > -- > Sent from: http://numpy-discussion.10968.n7.nabble.com/ > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From numpy_gsod at bigriver.xyz Wed Aug 5 15:15:58 2020 From: numpy_gsod at bigriver.xyz (Ben Nathanson) Date: Wed, 5 Aug 2020 15:15:58 -0400 Subject: [Numpy-discussion] Add Chebyshev (cosine) transforms implemented via FFTs In-Reply-To: References: <1596501591921-0.post@n7.nabble.com> Message-ID: > scipy.fft is a superset of numpy.fft, and the functionality included in NumPy is really only the basics that are needed in many fields. Exactly this sentence might be useful on top of the FFT page. Is the right page reference/routines.fft.html? I can submit a PR. From ralf.gommers at gmail.com Wed Aug 5 16:01:41 2020 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Wed, 5 Aug 2020 21:01:41 +0100 Subject: [Numpy-discussion] Add Chebyshev (cosine) transforms implemented via FFTs In-Reply-To: References: <1596501591921-0.post@n7.nabble.com>

Message-ID: On Wed, Aug 5, 2020 at 8:16 PM Ben Nathanson wrote: > > scipy.fft is a superset of numpy.fft, and the functionality included in > NumPy is really only the basics that are needed in many fields. > > Exactly this sentence might be useful on top of the FFT page. > > Is the right page reference/routines.fft.html? I can submit a PR. > A PR would be great, thanks Ben. And yes, that's the right page. Cheers, Ralf _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ryan.c.cooper at uconn.edu Wed Aug 5 16:35:16 2020 From: ryan.c.cooper at uconn.edu (cooperrc) Date: Wed, 5 Aug 2020 13:35:16 -0700 (MST) Subject: [Numpy-discussion] Building Numpy Documentation In-Reply-To: References: <1596653885292-0.post@n7.nabble.com> Message-ID: <1596659716279-0.post@n7.nabble.com> Thanks! I'll post the issue and full output. -- Sent from: http://numpy-discussion.10968.n7.nabble.com/ From cv1038 at wildcats.unh.edu Wed Aug 5 20:16:01 2020 From: cv1038 at wildcats.unh.edu (Chris Val) Date: Wed, 5 Aug 2020 17:16:01 -0700 (MST) Subject: [Numpy-discussion] Add Chebyshev (cosine) transforms implemented via FFTs In-Reply-To: References: <1596501591921-0.post@n7.nabble.com> Message-ID: <1596672961102-0.post@n7.nabble.com> Stephan Hoyer-2 wrote > On Tue, Aug 4, 2020 at 6:10 PM Charles R Harris < > charlesr.harris@ > > > wrote: > >> >> >> On Tue, Aug 4, 2020 at 4:55 AM Ralf Gommers < > ralf.gommers@ > > >> wrote: >> >>> >>> >>> On Tue, Aug 4, 2020 at 1:49 AM Chris Vavaliaris < > cv1038 at .unh > > >>> wrote: >>> >>>> PR #16999: https://github.com/numpy/numpy/pull/16999 >>>> >>>> Hello all, >>>> this PR adds the two 1D Chebyshev transform functions `chebyfft` and >>>> `ichebyfft` into the `numpy.fft` module, utilizing the real FFTs `rfft` >>>> and >>>> `irfft`, respectively. As far as I understand, `pockefft` does not >>>> support >>>> cosine transforms natively; for this reason, an even extension of the >>>> input >>>> vector is constructed, whose real FFT corresponds to a cosine >>>> transform. >>>> >>>> The motivation behind these two additions is the ability to quickly >>>> perform >>>> direct and inverse Chebyshev transforms with `numpy`, without the need >>>> to >>>> write scripts that do the necessary (although minor) modifications. >>>> Chebyshev transforms are used often e.g. in the spectral integration of >>>> PDE >>>> problems; thus, I believe having them implemented in `numpy` would be >>>> useful >>>> to many people in the community. >>>> >>>> I'm happy to get comments/feedback on this feature, and on whether it's >>>> something more people would be interested in. Also, I'm not entirely >>>> sure >>>> what part of this functionality is/isn't present in `scipy`, so that >>>> the >>>> two >>>> `fft` modules remain consistent with one another. >>>> >>> >>> Hi Chris, that's a good question. scipy.fft is a superset of numpy.fft, >>> and the functionality included in NumPy is really only the basics that >>> are >>> needed in many fields. The reason for the duplication stems from way >>> back >>> when we had no wheels and SciPy was very hard to install. So I don't >>> think >>> there's anything we'd add to numpy.fft at this point. >>> >>> As I commented on your PR, it would be useful to add some references and >>> applications, and then make your proposal on the scipy-dev list. >>> >>> >> Chebfun <https://github.com/chebfun/chebfun> is based around this >> method, >> they use series with possibly thousands of terms. Trefethen is a big fan >> of >> Chebyshev polynomials. >> > > I am quite sure that Chebyshev transforms are useful, but it does feel > like > something more directly suitable for SciPy than NumPy. The current > division > for submodules like numpy.fft/scipy.fft and numpy.linalg/scipy.linalg > exists for outdated historical reasons, but at this point it is easiest > for > users to understand if has SciPy has a strict superset of NumPy's > functionality here. > > > Chuck >> _______________________________________________ >> NumPy-Discussion mailing list >> > NumPy-Discussion@ >> https://mail.python.org/mailman/listinfo/numpy-discussion >> > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion@ > https://mail.python.org/mailman/listinfo/numpy-discussion Thank you all for the replies and feedback! I now have a better understanding of the differences between the NumPy and SciPy FFT modules; it certainly looks like SciPy would be a more appropriate place for such a feature. > Chebfun is based around this method, they use series with possibly > thousands of terms. Trefethen is a big fan of Chebyshev polynomials. > > Chuck Thank you Chuck for your comment; yes I'm aware of Chebfun and of Trefethen's work in general, it's mostly the work of his and some of his past grad students that got me interested in Chebyshev methods in the first place! Chris -- Sent from: http://numpy-discussion.10968.n7.nabble.com/ From kevin.k.sheppard at gmail.com Fri Aug 7 09:00:12 2020 From: kevin.k.sheppard at gmail.com (Kevin Sheppard) Date: Fri, 7 Aug 2020 14:00:12 +0100 Subject: [Numpy-discussion] Replacement for Rackspace Message-ID: <2B5B9B49-80D8-46A8-B12F-84C438A0ED4D@hxcore.ol> An HTML attachment was scrubbed... URL: From andy.terrel at gmail.com Fri Aug 7 09:18:39 2020 From: andy.terrel at gmail.com (Andy Ray Terrel) Date: Fri, 7 Aug 2020 08:18:39 -0500 Subject: [Numpy-discussion] Replacement for Rackspace In-Reply-To: <2B5B9B49-80D8-46A8-B12F-84C438A0ED4D@hxcore.ol> References: <2B5B9B49-80D8-46A8-B12F-84C438A0ED4D@hxcore.ol> Message-ID: If you are looking for servers, I can help with the NumFOCUS allocation from AWS. But anaconda.org will mean less work managing infrastructure. On Fri, Aug 7, 2020 at 8:01 AM Kevin Sheppard wrote: > The Rackspace hosted wheel endpoints at > > > > > https://7933911d6844c6c53a7d-47bd50c35cd79bd838daf386af554a83.ssl.cf2.rackcdn.com/ > > > > and > > > > > https://3f23b170c54c2533c070-1c8a9b3114517dc5fe17b7c3f8c63a43.ssl.cf2.rackcdn.com/ > > > > seem to not be working. I know NumPy, SciPy, pandas and scikit-learn are > all using a common end point on anacona.org. Statsmodels is preparing > for release, and the wheel builder at > https://github.com/MacPython/statsmodels-wheels is failing at upload. Is > there any shared resource for uploading nightlies and release wheels? Or > should we just use a separate account on anaconda.org? > > > > Thanks, > > Kevin > > > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ryan.c.cooper at uconn.edu Fri Aug 7 10:12:46 2020 From: ryan.c.cooper at uconn.edu (cooperrc) Date: Fri, 7 Aug 2020 07:12:46 -0700 (MST) Subject: [Numpy-discussion] Building Numpy Documentation In-Reply-To: <1596659716279-0.post@n7.nabble.com> References: <1596653885292-0.post@n7.nabble.com> <1596659716279-0.post@n7.nabble.com> Message-ID: <1596809566264-0.post@n7.nabble.com> For future reference, I opened and closed issue 17016 on github. The culprit was a `Broken Toolchain` due to a mismatch between Arch's newer ld and conda's older ld. Solution was to move the default ~/conda/envs/doc-build-38/compiler_compat/ld to ~/conda/envs/doc-build-38/compiler_compat/bak_ld Then, the build went smoothly. -- Sent from: http://numpy-discussion.10968.n7.nabble.com/ From p.j.a.cock at googlemail.com Fri Aug 7 12:23:23 2020 From: p.j.a.cock at googlemail.com (Peter Cock) Date: Fri, 7 Aug 2020 17:23:23 +0100 Subject: [Numpy-discussion] Replacement for Rackspace In-Reply-To: <2B5B9B49-80D8-46A8-B12F-84C438A0ED4D@hxcore.ol> References: <2B5B9B49-80D8-46A8-B12F-84C438A0ED4D@hxcore.ol> Message-ID: Ah - this is unwelcome news. See https://mail.python.org/pipermail/scipy-dev/2020-February/023990.html and https://github.com/matthew-brett/multibuild/issues/304 There are quite a few project's using the multibuild system now... Peter On Fri, Aug 7, 2020 at 2:01 PM Kevin Sheppard wrote: > The Rackspace hosted wheel endpoints at > > > > > https://7933911d6844c6c53a7d-47bd50c35cd79bd838daf386af554a83.ssl.cf2.rackcdn.com/ > > > > and > > > > > https://3f23b170c54c2533c070-1c8a9b3114517dc5fe17b7c3f8c63a43.ssl.cf2.rackcdn.com/ > > > > seem to not be working. I know NumPy, SciPy, pandas and scikit-learn are > all using a common end point on anacona.org. Statsmodels is preparing > for release, and the wheel builder at > https://github.com/MacPython/statsmodels-wheels is failing at upload. Is > there any shared resource for uploading nightlies and release wheels? Or > should we just use a separate account on anaconda.org? > > > > Thanks, > > Kevin > > > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Fri Aug 7 17:35:26 2020 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Fri, 7 Aug 2020 22:35:26 +0100 Subject: [Numpy-discussion] Replacement for Rackspace In-Reply-To: <2B5B9B49-80D8-46A8-B12F-84C438A0ED4D@hxcore.ol> References: <2B5B9B49-80D8-46A8-B12F-84C438A0ED4D@hxcore.ol> Message-ID: On Fri, Aug 7, 2020 at 2:00 PM Kevin Sheppard wrote: > The Rackspace hosted wheel endpoints at > > > > > https://7933911d6844c6c53a7d-47bd50c35cd79bd838daf386af554a83.ssl.cf2.rackcdn.com/ > > > > and > > > > > https://3f23b170c54c2533c070-1c8a9b3114517dc5fe17b7c3f8c63a43.ssl.cf2.rackcdn.com/ > > > > seem to not be working. I know NumPy, SciPy, pandas and scikit-learn are > all using a common end point on anacona.org. Statsmodels is preparing > for release, and the wheel builder at > https://github.com/MacPython/statsmodels-wheels is failing at upload. Is > there any shared resource for uploading nightlies and release wheels? Or > should we just use a separate account on anaconda.org? > Copying the numpy-wheels Azure/TravisCI code for this should work, it's pretty concise, e.g.: https://github.com/MacPython/numpy-wheels/blob/master/azure/posix.yml#L87 Not sure about the account credentials, Matti would know. Cheers, Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From ilhanpolat at gmail.com Sun Aug 9 18:15:06 2020 From: ilhanpolat at gmail.com (Ilhan Polat) Date: Mon, 10 Aug 2020 00:15:06 +0200 Subject: [Numpy-discussion] Type declaration to include all valid numerical NumPy types for Cython Message-ID: Hi all, As you might have seen my recent mails in Cython list, I'm trying to cook up an input validator for the linalg.solve() function. The machinery of SciPy linalg is as follows: Some input comes in passes through np.asarray() then depending on the resulting dtype of the numpy array we choose a LAPACK flavor (s,d,c,z) and off it goes through f2py to lalaland and comes back with some result. For the backslash polyalgorithm I need the arrays to be contiguous (C- or F- doesn't matter) and any of the four (possibly via making new copies) float, double, float complex, double complex after the intake because we are using wrapped fortran code (LAPACK) in SciPy. So my difficulty is how to type such function input, say, ctypedef fused numeric_numpy_t: bint cnp.npy_bool cnp.int_t cnp.intp_t cnp.int8_t cnp.int16_t cnp.int32_t cnp.int64_t cnp.uint8_t cnp.uint16_t cnp.uint32_t cnp.uint64_t cnp.float32_t cnp.float64_t cnp.complex64_t cnp.complex128_t Is this acceptable or something else needs to be used? Then there is the storyof np.complex256 and mysterious np.float16. Then there is the Linux vs Windows platform dependence issue and possibly some more that I can't comprehend. Then there are datetime, str, unicode etc. that need to be rejected. So this is quickly getting out of hand for my small brain. To be honest, I am a bit running out of steam working with this issue even though I managed to finish the actual difficult algorithmic part but got stuck here. I am quite surprised how fantastically complicated and confusing both NumPy and Cython docs about this stuff. Shouldn't we keep a generic fused type for such usage? Or maybe there already exists but I don't know and would be really grateful for pointers. Here I wrote a dummy typed Cython function just for type checking: cpdef inline bint ncc( numeric_numpy_t[:, :] a): print(a.is_f_contig()) print(a.is_c_contig()) return a.is_f_contig() or a.is_c_contig() And this is a dummy loop (with aliases) just to check whether fused type is working or not (on windows I couldn't make it work for float16). for x in (np.uint, np.uintc, np.uintp, np.uint0, np.uint8, np.uint16, np.uint32, np.uint64, np.int, np.intc, np.intp, np.int0, np.int8, np.int16, np.int32,np.int64, np.float, np.float32, np.float64, np.float_, np.complex, np.complex64, np.complex128, np.complex_): print(x) C = np.arange(25., dtype=x).reshape(5, 5) ncc(C) Thanks in advance, ilhan -------------- next part -------------- An HTML attachment was scrubbed... URL: From ewm at redtetrahedron.org Sun Aug 9 20:49:37 2020 From: ewm at redtetrahedron.org (Eric Moore) Date: Sun, 9 Aug 2020 20:49:37 -0400 Subject: [Numpy-discussion] Type declaration to include all valid numerical NumPy types for Cython In-Reply-To: References: Message-ID: If that is really all you need, then the version in python is: def convert_one(a): """ Converts input with arbitrary layout and dtype to a blas/lapack compatible dtype with either C or F order. Acceptable objects are passed through without making copies. """ a_arr = np.asarray(a) dtype = np.result_type(a_arr, 1.0) # need to handle these separately if dtype == np.longdouble: dtype = np.dtype('d') elif dtype == np.clongdouble: dtype = np.dtype('D') elif dtype == np.float16: dtype = np.dtype('f') # explicitly force a copy if a_arr isn't one segment return np.array(a_arr, dtype, copy=not a_arr.flags.forc, order='K') In Cython, you could just run exactly this code and it's probably fine. The could also be rewritten using the C calls if you really wanted. You need to either provide your own or use a casting table and the copy / conversion routines from somewhere. Cython, to my knowledge, doesn't provide these things, but Numpy does. Eric On Sun, Aug 9, 2020 at 6:16 PM Ilhan Polat wrote: > Hi all, > > As you might have seen my recent mails in Cython list, I'm trying to cook > up an input validator for the linalg.solve() function. The machinery of > SciPy linalg is as follows: > > Some input comes in passes through np.asarray() then depending on the > resulting dtype of the numpy array we choose a LAPACK flavor (s,d,c,z) and > off it goes through f2py to lalaland and comes back with some result. > > For the backslash polyalgorithm I need the arrays to be contiguous (C- or > F- doesn't matter) and any of the four (possibly via making new copies) > float, double, float complex, double complex after the intake because we > are using wrapped fortran code (LAPACK) in SciPy. So my difficulty is how > to type such function input, say, > > ctypedef fused numeric_numpy_t: > bint > cnp.npy_bool > cnp.int_t > cnp.intp_t > cnp.int8_t > cnp.int16_t > cnp.int32_t > cnp.int64_t > cnp.uint8_t > cnp.uint16_t > cnp.uint32_t > cnp.uint64_t > cnp.float32_t > cnp.float64_t > cnp.complex64_t > cnp.complex128_t > > Is this acceptable or something else needs to be used? Then there is the > storyof np.complex256 and mysterious np.float16. Then there is the Linux vs > Windows platform dependence issue and possibly some more that I can't > comprehend. Then there are datetime, str, unicode etc. that need to be > rejected. So this is quickly getting out of hand for my small brain. > > To be honest, I am a bit running out of steam working with this issue even > though I managed to finish the actual difficult algorithmic part but got > stuck here. I am quite surprised how fantastically complicated and > confusing both NumPy and Cython docs about this stuff. Shouldn't we keep a > generic fused type for such usage? Or maybe there already exists but I > don't know and would be really grateful for pointers. > > Here I wrote a dummy typed Cython function just for type checking: > > cpdef inline bint ncc( numeric_numpy_t[:, :] a): > print(a.is_f_contig()) > print(a.is_c_contig()) > > return a.is_f_contig() or a.is_c_contig() > > And this is a dummy loop (with aliases) just to check whether fused type > is working or not (on windows I couldn't make it work for float16). > > for x in (np.uint, np.uintc, np.uintp, np.uint0, np.uint8, np.uint16, > np.uint32, > np.uint64, np.int, np.intc, np.intp, np.int0, np.int8, np.int16, > np.int32,np.int64, np.float, np.float32, np.float64, np.float_, > np.complex, np.complex64, np.complex128, np.complex_): > print(x) > C = np.arange(25., dtype=x).reshape(5, 5) > ncc(C) > > > Thanks in advance, > ilhan > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ilhanpolat at gmail.com Mon Aug 10 05:24:19 2020 From: ilhanpolat at gmail.com (Ilhan Polat) Date: Mon, 10 Aug 2020 11:24:19 +0200 Subject: [Numpy-discussion] Type declaration to include all valid numerical NumPy types for Cython In-Reply-To: References:

Message-ID: Yes it seems like I don't have any other option anyways. There is a bit of a penalty but I guess this should do the trick. Thanks Eric (again! :D) On Mon, Aug 10, 2020 at 2:51 AM Eric Moore wrote: > If that is really all you need, then the version in python is: > > def convert_one(a): > """ > Converts input with arbitrary layout and dtype to a blas/lapack > compatible dtype with either C or F order. Acceptable objects are > passed > through without making copies. > """ > > a_arr = np.asarray(a) > dtype = np.result_type(a_arr, 1.0) > > # need to handle these separately > if dtype == np.longdouble: > dtype = np.dtype('d') > elif dtype == np.clongdouble: > dtype = np.dtype('D') > elif dtype == np.float16: > dtype = np.dtype('f') > > # explicitly force a copy if a_arr isn't one segment > return np.array(a_arr, dtype, copy=not a_arr.flags.forc, order='K') > > In Cython, you could just run exactly this code and it's probably fine. > The could also be rewritten using the C calls if you really wanted. > > You need to either provide your own or use a casting table and the copy / > conversion routines from somewhere. Cython, to my knowledge, doesn't > provide these things, but Numpy does. > > Eric > > On Sun, Aug 9, 2020 at 6:16 PM Ilhan Polat wrote: > >> Hi all, >> >> As you might have seen my recent mails in Cython list, I'm trying to cook >> up an input validator for the linalg.solve() function. The machinery of >> SciPy linalg is as follows: >> >> Some input comes in passes through np.asarray() then depending on the >> resulting dtype of the numpy array we choose a LAPACK flavor (s,d,c,z) and >> off it goes through f2py to lalaland and comes back with some result. >> >> For the backslash polyalgorithm I need the arrays to be contiguous (C- or >> F- doesn't matter) and any of the four (possibly via making new copies) >> float, double, float complex, double complex after the intake because we >> are using wrapped fortran code (LAPACK) in SciPy. So my difficulty is how >> to type such function input, say, >> >> ctypedef fused numeric_numpy_t: >> bint >> cnp.npy_bool >> cnp.int_t >> cnp.intp_t >> cnp.int8_t >> cnp.int16_t >> cnp.int32_t >> cnp.int64_t >> cnp.uint8_t >> cnp.uint16_t >> cnp.uint32_t >> cnp.uint64_t >> cnp.float32_t >> cnp.float64_t >> cnp.complex64_t >> cnp.complex128_t >> >> Is this acceptable or something else needs to be used? Then there is the >> storyof np.complex256 and mysterious np.float16. Then there is the Linux vs >> Windows platform dependence issue and possibly some more that I can't >> comprehend. Then there are datetime, str, unicode etc. that need to be >> rejected. So this is quickly getting out of hand for my small brain. >> >> To be honest, I am a bit running out of steam working with this issue >> even though I managed to finish the actual difficult algorithmic part but >> got stuck here. I am quite surprised how fantastically complicated and >> confusing both NumPy and Cython docs about this stuff. Shouldn't we keep a >> generic fused type for such usage? Or maybe there already exists but I >> don't know and would be really grateful for pointers. >> >> Here I wrote a dummy typed Cython function just for type checking: >> >> cpdef inline bint ncc( numeric_numpy_t[:, :] a): >> print(a.is_f_contig()) >> print(a.is_c_contig()) >> >> return a.is_f_contig() or a.is_c_contig() >> >> And this is a dummy loop (with aliases) just to check whether fused type >> is working or not (on windows I couldn't make it work for float16). >> >> for x in (np.uint, np.uintc, np.uintp, np.uint0, np.uint8, np.uint16, >> np.uint32, >> np.uint64, np.int, np.intc, np.intp, np.int0, np.int8, >> np.int16, >> np.int32,np.int64, np.float, np.float32, np.float64, np.float_, >> np.complex, np.complex64, np.complex128, np.complex_): >> print(x) >> C = np.arange(25., dtype=x).reshape(5, 5) >> ncc(C) >> >> >> Thanks in advance, >> ilhan >> >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at python.org >> https://mail.python.org/mailman/listinfo/numpy-discussion >> > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From sebastian at sipsolutions.net Mon Aug 10 11:30:23 2020 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Mon, 10 Aug 2020 10:30:23 -0500 Subject: [Numpy-discussion] Experimental `like=` attribute for array creation functions Message-ID: Hi all, as a heads up that Peter Entschev has a PR open to add `like=` to most array creation functions, my current plan is to merge it soon as a preliminary API and bring it up again before the actual release (in a few months). This allows overriding for array-likes, e.g. it will allow: arr = np.asarray([3], like=dask_array) type(arr) is dask.array.Array This was proposed in NEP 35: https://numpy.org/neps/nep-0035-array-creation-dispatch-with-array-function.html Although that has not been accepted as of now, the PR is: https://github.com/numpy/numpy/pull/16935 This was discussed in a smaller group, and is an attempt to see how we can make the array-function protocol viable to allow packages such as sklearn to work with non-NumPy arrays. As of now, this would be experimental and can revisit it before the actual NumPy release. We should probably discuss accepting NEP 35 more. At this time, I hope that we can put in the functionality to facilitate this discussion, rather the other way around. If anyone feels nervous about this step, I would be happy to document that we will not include it in the next release unless the NEP is accepted first, or at least hide it behind an environment variable. Cheers, Sebastian -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 833 bytes Desc: This is a digitally signed message part URL: From einstein.edison at gmail.com Mon Aug 10 11:35:14 2020 From: einstein.edison at gmail.com (Hameer Abbasi) Date: Mon, 10 Aug 2020 17:35:14 +0200 Subject: [Numpy-discussion] Experimental =?utf-8?Q?=60like=3D=60_?=attribute for array creation functions In-Reply-To: References: Message-ID: <7ca98625-53ea-47cd-a027-d9c902742fed@Canary> Hi, We should have a higher-bandwidth meeting/communication for all stakeholders, and particularly some library authors, to see what would be good for them. We should definitely have language in the NEP that says it won?t be in a release unless the NEP is accepted. Best regards, Hameer Abbasi -- Sent from Canary (https://canarymail.io) > On Monday, Aug 10, 2020 at 5:31 PM, Sebastian Berg wrote: > Hi all, > > as a heads up that Peter Entschev has a PR open to add `like=` to > most array creation functions, my current plan is to merge it soon as a preliminary API and bring it up again before the actual release (in a few months). This allows overriding for array-likes, e.g. it will allow: > > > arr = np.asarray([3], like=dask_array) > type(arr) is dask.array.Array > > This was proposed in NEP 35: > > https://numpy.org/neps/nep-0035-array-creation-dispatch-with-array-function.html > > Although that has not been accepted as of now, the PR is: > > https://github.com/numpy/numpy/pull/16935 > > > This was discussed in a smaller group, and is an attempt to see how we > can make the array-function protocol viable to allow packages such as > sklearn to work with non-NumPy arrays. > > As of now, this would be experimental and can revisit it before the > actual NumPy release. We should probably discuss accepting NEP 35 > more. At this time, I hope that we can put in the functionality to > facilitate this discussion, rather the other way around. > > If anyone feels nervous about this step, I would be happy to document > that we will not include it in the next release unless the NEP is > accepted first, or at least hide it behind an environment variable. > > Cheers, > > Sebastian > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion -------------- next part -------------- An HTML attachment was scrubbed... URL: From sebastian at sipsolutions.net Mon Aug 10 15:36:41 2020 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Mon, 10 Aug 2020 14:36:41 -0500 Subject: [Numpy-discussion] Experimental `like=` attribute for array creation functions In-Reply-To: <7ca98625-53ea-47cd-a027-d9c902742fed@Canary> References: <7ca98625-53ea-47cd-a027-d9c902742fed@Canary> Message-ID: <9d9ad7a26241564ec3f14866accfe840b226e1dc.camel@sipsolutions.net> On Mon, 2020-08-10 at 17:35 +0200, Hameer Abbasi wrote: > Hi, > > We should have a higher-bandwidth meeting/communication for all > stakeholders, and particularly some library authors, to see what > would be good for them. > > We should definitely have language in the NEP that says it won?t be > in a release unless the NEP is accepted. In that case, I think the important part is to have language right now in the implementation, although that can refer to the NEP itself of course. You can't expect everyone who may be tempted to use it to actually read the NEP draft, at least not without pointing it out. I will say that I think it is not very high risk, because I think annoying or not, the argument could be deprecated again with a transition short phase. Admittedly, that argument only works if we have a replacement solution. Cheers, Sebastian > > Best regards, > Hameer Abbasi > > -- > Sent from Canary (https://canarymail.io) > > > On Monday, Aug 10, 2020 at 5:31 PM, Sebastian Berg < > > sebastian at sipsolutions.net (mailto:sebastian at sipsolutions.net)> > > wrote: > > Hi all, > > > > as a heads up that Peter Entschev has a PR open to add `like=` to > > most array creation functions, my current plan is to merge it soon > > as a preliminary API and bring it up again before the actual > > release (in a few months). This allows overriding for array-likes, > > e.g. it will allow: > > > > > > arr = np.asarray([3], like=dask_array) > > type(arr) is dask.array.Array > > > > This was proposed in NEP 35: > > > > https://numpy.org/neps/nep-0035-array-creation-dispatch-with-array-function.html > > > > Although that has not been accepted as of now, the PR is: > > > > https://github.com/numpy/numpy/pull/16935 > > > > > > This was discussed in a smaller group, and is an attempt to see how > > we > > can make the array-function protocol viable to allow packages such > > as > > sklearn to work with non-NumPy arrays. > > > > As of now, this would be experimental and can revisit it before the > > actual NumPy release. We should probably discuss accepting NEP 35 > > more. At this time, I hope that we can put in the functionality to > > facilitate this discussion, rather the other way around. > > > > If anyone feels nervous about this step, I would be happy to > > document > > that we will not include it in the next release unless the NEP is > > accepted first, or at least hide it behind an environment variable. > > > > Cheers, > > > > Sebastian > > > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at python.org > > https://mail.python.org/mailman/listinfo/numpy-discussion > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 833 bytes Desc: This is a digitally signed message part URL: From pwang at anaconda.com Mon Aug 10 15:54:38 2020 From: pwang at anaconda.com (Peter Wang) Date: Mon, 10 Aug 2020 14:54:38 -0500 Subject: [Numpy-discussion] Replacement for Rackspace In-Reply-To: <2B5B9B49-80D8-46A8-B12F-84C438A0ED4D@hxcore.ol> References: <2B5B9B49-80D8-46A8-B12F-84C438A0ED4D@hxcore.ol> Message-ID: FWIW, we're happy to provide wheel hosting for statsmodels on anaconda.org. -Peter On Fri, Aug 7, 2020 at 8:01 AM Kevin Sheppard wrote: > The Rackspace hosted wheel endpoints at > > > > > https://7933911d6844c6c53a7d-47bd50c35cd79bd838daf386af554a83.ssl.cf2.rackcdn.com/ > > > > and > > > > > https://3f23b170c54c2533c070-1c8a9b3114517dc5fe17b7c3f8c63a43.ssl.cf2.rackcdn.com/ > > > > seem to not be working. I know NumPy, SciPy, pandas and scikit-learn are > all using a common end point on anacona.org. Statsmodels is preparing > for release, and the wheel builder at > https://github.com/MacPython/statsmodels-wheels is failing at upload. Is > there any shared resource for uploading nightlies and release wheels? Or > should we just use a separate account on anaconda.org? > > > > Thanks, > > Kevin > > > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From matti.picus at gmail.com Mon Aug 10 16:19:42 2020 From: matti.picus at gmail.com (Matti Picus) Date: Mon, 10 Aug 2020 23:19:42 +0300 Subject: [Numpy-discussion] Replacement for Rackspace In-Reply-To: References: <2B5B9B49-80D8-46A8-B12F-84C438A0ED4D@hxcore.ol> Message-ID: <304cbb1c-1871-0da4-097c-70d1c7c6d8e9@gmail.com> On 8/10/20 10:54 PM, Peter Wang wrote: > FWIW, we're happy to provide wheel hosting for statsmodels on > anaconda.org . > > -Peter > > On Fri, Aug 7, 2020 at 8:01 AM Kevin Sheppard > > wrote: > > The Rackspace hosted wheel endpoints at > > https://7933911d6844c6c53a7d-47bd50c35cd79bd838daf386af554a83.ssl.cf2.rackcdn.com/ > > and > > https://3f23b170c54c2533c070-1c8a9b3114517dc5fe17b7c3f8c63a43.ssl.cf2.rackcdn.com/ > > seem to not be working.? I know NumPy, SciPy, pandas and > scikit-learn are all using a common end point on anacona.org > . Statsmodels is preparing for? release, and > the wheel builder at > https://github.com/MacPython/statsmodels-wheels is failing at > upload.? Is there any shared resource for uploading nightlies and > release wheels?? Or should we just use a separate account on > anaconda.org ? > > Thanks, > > Kevin > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > > Thanks Peter, anaconda is generously hosting projects at https://anaconda.org/scipy-wheels-nightly/ (for weekly development releases that can be used to test downstream projects) and https://anaconda.org/multibuild-wheels-staging (for staging wheels to be tested for release on PyPI). The trick is that CI needs a token so it can upload to those organizations. Kevin, we can either add you to the groups you can create a token, or one of the current members could create tokens and transport them safely to Kevin. Please disucss it with me (or one of the other members https://anaconda.org/multibuild-wheels-staging/groups). Matti From p.j.a.cock at googlemail.com Mon Aug 10 17:39:17 2020 From: p.j.a.cock at googlemail.com (Peter Cock) Date: Mon, 10 Aug 2020 22:39:17 +0100 Subject: [Numpy-discussion] Replacement for Rackspace In-Reply-To: <304cbb1c-1871-0da4-097c-70d1c7c6d8e9@gmail.com> References: <2B5B9B49-80D8-46A8-B12F-84C438A0ED4D@hxcore.ol> <304cbb1c-1871-0da4-097c-70d1c7c6d8e9@gmail.com> Message-ID: Hi Matti, Is this an open invitation to the wider Numpy ecosystem? I am interested on behalf of Biopython which was using the donated Rackspace for multibuild wheel staging prior to PyPy release (although having weekly test releases sounds interesting too). I would be happy to continue this discussion off list if you prefer, Thank you, Peter On Mon, Aug 10, 2020 at 9:20 PM Matti Picus wrote: > > On 8/10/20 10:54 PM, Peter Wang wrote: > > FWIW, we're happy to provide wheel hosting for statsmodels on > > anaconda.org . > > > > -Peter > > > > On Fri, Aug 7, 2020 at 8:01 AM Kevin Sheppard > > > wrote: > > > > The Rackspace hosted wheel endpoints at > > > > > https://7933911d6844c6c53a7d-47bd50c35cd79bd838daf386af554a83.ssl.cf2.rackcdn.com/ > > > > and > > > > > https://3f23b170c54c2533c070-1c8a9b3114517dc5fe17b7c3f8c63a43.ssl.cf2.rackcdn.com/ > > > > seem to not be working. I know NumPy, SciPy, pandas and > > scikit-learn are all using a common end point on anacona.org > > . Statsmodels is preparing for release, and > > the wheel builder at > > https://github.com/MacPython/statsmodels-wheels is failing at > > upload. Is there any shared resource for uploading nightlies and > > release wheels? Or should we just use a separate account on > > anaconda.org ? > > > > Thanks, > > > > Kevin > > > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at python.org > > https://mail.python.org/mailman/listinfo/numpy-discussion > > > > > > Thanks Peter, anaconda is generously hosting projects at > https://anaconda.org/scipy-wheels-nightly/ (for weekly development > releases that can be used to test downstream projects) and > https://anaconda.org/multibuild-wheels-staging (for staging wheels to be > tested for release on PyPI). > > > The trick is that CI needs a token so it can upload to those > organizations. Kevin, we can either add you to the groups you can create > a token, or one of the current members could create tokens and transport > them safely to Kevin. Please disucss it with me (or one of the other > members https://anaconda.org/multibuild-wheels-staging/groups). > > Matti > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Mon Aug 10 18:16:34 2020 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Mon, 10 Aug 2020 23:16:34 +0100 Subject: [Numpy-discussion] Experimental `like=` attribute for array creation functions In-Reply-To: <9d9ad7a26241564ec3f14866accfe840b226e1dc.camel@sipsolutions.net> References: <7ca98625-53ea-47cd-a027-d9c902742fed@Canary> <9d9ad7a26241564ec3f14866accfe840b226e1dc.camel@sipsolutions.net> Message-ID: On Mon, Aug 10, 2020 at 8:37 PM Sebastian Berg wrote: > On Mon, 2020-08-10 at 17:35 +0200, Hameer Abbasi wrote: > > Hi, > > > > We should have a higher-bandwidth meeting/communication for all > > stakeholders, and particularly some library authors, to see what > > would be good for them. > I'm not sure that helps. At this point there's little progress since the last meeting, I think the plan is unchanged: we need implementations of all the options on offer, and then try them out in PRs for scikit-learn, SciPy and perhaps another package who's maintainers are interested, to test like=, __array_module__ in realistic situations. > > > We should definitely have language in the NEP that says it won?t be > > in a release unless the NEP is accepted. > > In that case, I think the important part is to have language right now > in the implementation, although that can refer to the NEP itself of > course. > You can't expect everyone who may be tempted to use it to actually read > the NEP draft, at least not without pointing it out. > Agreed, I think the decision is on this list not in the NEP, and to make sure we won't forget we need an issue opened with the 1.20 milestone. Cheers, Ralf > I will say that I think it is not very high risk, because I think > annoying or not, the argument could be deprecated again with a > transition short phase. Admittedly, that argument only works if we have > a replacement solution. > > Cheers, > > Sebastian > > > > > > Best regards, > > Hameer Abbasi > > > > -- > > Sent from Canary (https://canarymail.io) > > > > > On Monday, Aug 10, 2020 at 5:31 PM, Sebastian Berg < > > > sebastian at sipsolutions.net (mailto:sebastian at sipsolutions.net)> > > > wrote: > > > Hi all, > > > > > > as a heads up that Peter Entschev has a PR open to add `like=` to > > > most array creation functions, my current plan is to merge it soon > > > as a preliminary API and bring it up again before the actual > > > release (in a few months). This allows overriding for array-likes, > > > e.g. it will allow: > > > > > > > > > arr = np.asarray([3], like=dask_array) > > > type(arr) is dask.array.Array > > > > > > This was proposed in NEP 35: > > > > > > > https://numpy.org/neps/nep-0035-array-creation-dispatch-with-array-function.html > > > > > > Although that has not been accepted as of now, the PR is: > > > > > > https://github.com/numpy/numpy/pull/16935 > > > > > > > > > This was discussed in a smaller group, and is an attempt to see how > > > we > > > can make the array-function protocol viable to allow packages such > > > as > > > sklearn to work with non-NumPy arrays. > > > > > > As of now, this would be experimental and can revisit it before the > > > actual NumPy release. We should probably discuss accepting NEP 35 > > > more. At this time, I hope that we can put in the functionality to > > > facilitate this discussion, rather the other way around. > > > > > > If anyone feels nervous about this step, I would be happy to > > > document > > > that we will not include it in the next release unless the NEP is > > > accepted first, or at least hide it behind an environment variable. > > > > > > Cheers, > > > > > > Sebastian > > > > > > _______________________________________________ > > > NumPy-Discussion mailing list > > > NumPy-Discussion at python.org > > > https://mail.python.org/mailman/listinfo/numpy-discussion > > > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at python.org > > https://mail.python.org/mailman/listinfo/numpy-discussion > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From matti.picus at gmail.com Tue Aug 11 01:33:42 2020 From: matti.picus at gmail.com (Matti Picus) Date: Tue, 11 Aug 2020 08:33:42 +0300 Subject: [Numpy-discussion] Replacement for Rackspace In-Reply-To: References: <2B5B9B49-80D8-46A8-B12F-84C438A0ED4D@hxcore.ol> <304cbb1c-1871-0da4-097c-70d1c7c6d8e9@gmail.com> Message-ID: An HTML attachment was scrubbed... URL: From sebastian at sipsolutions.net Tue Aug 11 17:15:41 2020 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Tue, 11 Aug 2020 16:15:41 -0500 Subject: [Numpy-discussion] NumPy Development Meeting Today - Triage Focus Message-ID: <191985aa84e634e8c3f4aff72389f66f4c114b32.camel@sipsolutions.net> Hi all, Our bi-weekly triage-focused NumPy development meeting is tomorrow (Wednesday, August 12th) at 11 am Pacific Time (18:00 UTC). Everyone is invited to join in and edit the work-in-progress meeting topics and notes: https://hackmd.io/68i_JvOYQfy9ERiHgXMPvg I encourage everyone to notify us of issues or PRs that you feel should be prioritized or simply discussed briefly. Just comment on it so we can label it, or add your PR/issue to this weeks topics for discussion. Best regards Sebastian -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 833 bytes Desc: This is a digitally signed message part URL: From ilhanpolat at gmail.com Wed Aug 12 19:24:57 2020 From: ilhanpolat at gmail.com (Ilhan Polat) Date: Thu, 13 Aug 2020 01:24:57 +0200 Subject: [Numpy-discussion] Experimental `like=` attribute for array creation functions In-Reply-To: References: <7ca98625-53ea-47cd-a027-d9c902742fed@Canary> <9d9ad7a26241564ec3f14866accfe840b226e1dc.camel@sipsolutions.net> Message-ID: For what is worth, as a potential consumer in SciPy, it really doesn't say anything (both in NEP and the PR) about how the regular users of NumPy will benefit from this. If only and only 3rd parties are going to benefit from it, I am not sure adding a new keyword to an already confusing function is the right thing to do. Let me clarify, - This is already a very (I mean extremely very) easy keyword name to confuse with ones_like, zeros_like and by its nature any other interpretation. It is not signalling anything about the functionality that is being discussed. I would seriously consider reserving such obvious names for really obvious tasks. Because you would also expect the shape and ndim would be mimicked by the "like"d argument but it turns out it is acting more like "typeof=" and not "like=" at all. Because if we follow the semantics it reads as "make your argument asarray like the other thing" but it is actually doing, "make your argument an array with the other thing's type" which might not be an array after all. - Again, if this is meant for downstream libraries (because that's what I got out of the PR discussion, cupy, dask, and JAX were the only examples I could read) then hiding it in another function and writing with capital letters "this is not meant for numpy users" would be a much more convenient way to separate the target audience and regular users. numpy.astypedarray([[some data], [...]], type_of=x) or whatever else it may be would be quite clean and to the point with no ambiguous keywords. I think, arriving to an agreement would be much faster if there is an executive summary of who this is intended for and what the regular usage is. Because with no offense, all I see is "dispatch", "_array_function_" and a lot of technical details of which I am absolutely ignorant. Finally as a minor point, I know we are mostly (ex-)academics but this necessity of formal language on NEPs is self-imposed (probably PEPs are to blame) and not quite helping. It can be a bit more descriptive in my external opinion. best, ilhan On Tue, Aug 11, 2020 at 12:18 AM Ralf Gommers wrote: > > > On Mon, Aug 10, 2020 at 8:37 PM Sebastian Berg > wrote: > >> On Mon, 2020-08-10 at 17:35 +0200, Hameer Abbasi wrote: >> > Hi, >> > >> > We should have a higher-bandwidth meeting/communication for all >> > stakeholders, and particularly some library authors, to see what >> > would be good for them. >> > > I'm not sure that helps. At this point there's little progress since the > last meeting, I think the plan is unchanged: we need implementations of all > the options on offer, and then try them out in PRs for scikit-learn, SciPy > and perhaps another package who's maintainers are interested, to test > like=, __array_module__ in realistic situations. > > > > >> > We should definitely have language in the NEP that says it won?t be >> > in a release unless the NEP is accepted. >> >> In that case, I think the important part is to have language right now >> in the implementation, although that can refer to the NEP itself of >> course. >> You can't expect everyone who may be tempted to use it to actually read >> the NEP draft, at least not without pointing it out. >> > > Agreed, I think the decision is on this list not in the NEP, and to make > sure we won't forget we need an issue opened with the 1.20 milestone. > > Cheers, > Ralf > > >> I will say that I think it is not very high risk, because I think >> annoying or not, the argument could be deprecated again with a >> transition short phase. Admittedly, that argument only works if we have >> a replacement solution. >> >> Cheers, >> >> Sebastian >> >> >> > >> > Best regards, >> > Hameer Abbasi >> > >> > -- >> > Sent from Canary (https://canarymail.io) >> > >> > > On Monday, Aug 10, 2020 at 5:31 PM, Sebastian Berg < >> > > sebastian at sipsolutions.net (mailto:sebastian at sipsolutions.net)> >> > > wrote: >> > > Hi all, >> > > >> > > as a heads up that Peter Entschev has a PR open to add `like=` to >> > > most array creation functions, my current plan is to merge it soon >> > > as a preliminary API and bring it up again before the actual >> > > release (in a few months). This allows overriding for array-likes, >> > > e.g. it will allow: >> > > >> > > >> > > arr = np.asarray([3], like=dask_array) >> > > type(arr) is dask.array.Array >> > > >> > > This was proposed in NEP 35: >> > > >> > > >> https://numpy.org/neps/nep-0035-array-creation-dispatch-with-array-function.html >> > > >> > > Although that has not been accepted as of now, the PR is: >> > > >> > > https://github.com/numpy/numpy/pull/16935 >> > > >> > > >> > > This was discussed in a smaller group, and is an attempt to see how >> > > we >> > > can make the array-function protocol viable to allow packages such >> > > as >> > > sklearn to work with non-NumPy arrays. >> > > >> > > As of now, this would be experimental and can revisit it before the >> > > actual NumPy release. We should probably discuss accepting NEP 35 >> > > more. At this time, I hope that we can put in the functionality to >> > > facilitate this discussion, rather the other way around. >> > > >> > > If anyone feels nervous about this step, I would be happy to >> > > document >> > > that we will not include it in the next release unless the NEP is >> > > accepted first, or at least hide it behind an environment variable. >> > > >> > > Cheers, >> > > >> > > Sebastian >> > > >> > > _______________________________________________ >> > > NumPy-Discussion mailing list >> > > NumPy-Discussion at python.org >> > > https://mail.python.org/mailman/listinfo/numpy-discussion >> > >> > _______________________________________________ >> > NumPy-Discussion mailing list >> > NumPy-Discussion at python.org >> > https://mail.python.org/mailman/listinfo/numpy-discussion >> >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at python.org >> https://mail.python.org/mailman/listinfo/numpy-discussion >> > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From jni at fastmail.com Wed Aug 12 21:44:33 2020 From: jni at fastmail.com (Juan Nunez-Iglesias) Date: Thu, 13 Aug 2020 11:44:33 +1000 Subject: [Numpy-discussion] Experimental `like=` attribute for array creation functions In-Reply-To: References: <7ca98625-53ea-47cd-a027-d9c902742fed@Canary> <9d9ad7a26241564ec3f14866accfe840b226e1dc.camel@sipsolutions.net>

Message-ID: <96330BE4-1CA2-4451-8FE5-357CFA7E4EDC@fastmail.com> I?ve generally been on the ?let the NumPy devs worry about it? side of things, but I do agree with Ilhan that `like=` is confusing and `typeof=` would be a much more appropriate name for that parameter. I do think library writers are NumPy users and so I wouldn?t really make that distinction, though. Users writing their own analysis code could very well be interested in writing code using numpy functions that will transparently work when the input is a CuPy array or whatever. I also share Ilhan?s concern (and I mentioned this in a previous NEP discussion) that NEPs are getting pretty inaccessible. In a sense these are difficult topics and readers should be expected to have *some* familiarity with the topics being discussed, but perhaps more effort should be put into the context/motivation/background of a NEP before accepting it. One way to ensure this might be to require a final proofreading step by someone who has not been involved at all in the discussions, like peer review does for papers. Food for thought. Juan. > On 13 Aug 2020, at 9:24 am, Ilhan Polat wrote: > > For what is worth, as a potential consumer in SciPy, it really doesn't say anything (both in NEP and the PR) about how the regular users of NumPy will benefit from this. If only and only 3rd parties are going to benefit from it, I am not sure adding a new keyword to an already confusing function is the right thing to do. > > Let me clarify, > > - This is already a very (I mean extremely very) easy keyword name to confuse with ones_like, zeros_like and by its nature any other interpretation. It is not signalling anything about the functionality that is being discussed. I would seriously consider reserving such obvious names for really obvious tasks. Because you would also expect the shape and ndim would be mimicked by the "like"d argument but it turns out it is acting more like "typeof=" and not "like=" at all. Because if we follow the semantics it reads as "make your argument asarray like the other thing" but it is actually doing, "make your argument an array with the other thing's type" which might not be an array after all. > > - Again, if this is meant for downstream libraries (because that's what I got out of the PR discussion, cupy, dask, and JAX were the only examples I could read) then hiding it in another function and writing with capital letters "this is not meant for numpy users" would be a much more convenient way to separate the target audience and regular users. numpy.astypedarray([[some data], [...]], type_of=x) or whatever else it may be would be quite clean and to the point with no ambiguous keywords. > > I think, arriving to an agreement would be much faster if there is an executive summary of who this is intended for and what the regular usage is. Because with no offense, all I see is "dispatch", "_array_function_" and a lot of technical details of which I am absolutely ignorant. > > Finally as a minor point, I know we are mostly (ex-)academics but this necessity of formal language on NEPs is self-imposed (probably PEPs are to blame) and not quite helping. It can be a bit more descriptive in my external opinion. > > best, > ilhan > > > > > > > > On Tue, Aug 11, 2020 at 12:18 AM Ralf Gommers > wrote: > > > On Mon, Aug 10, 2020 at 8:37 PM Sebastian Berg > wrote: > On Mon, 2020-08-10 at 17:35 +0200, Hameer Abbasi wrote: > > Hi, > > > > We should have a higher-bandwidth meeting/communication for all > > stakeholders, and particularly some library authors, to see what > > would be good for them. > > I'm not sure that helps. At this point there's little progress since the last meeting, I think the plan is unchanged: we need implementations of all the options on offer, and then try them out in PRs for scikit-learn, SciPy and perhaps another package who's maintainers are interested, to test like=, __array_module__ in realistic situations. > > > > > > We should definitely have language in the NEP that says it won?t be > > in a release unless the NEP is accepted. > > In that case, I think the important part is to have language right now > in the implementation, although that can refer to the NEP itself of > course. > You can't expect everyone who may be tempted to use it to actually read > the NEP draft, at least not without pointing it out. > > Agreed, I think the decision is on this list not in the NEP, and to make sure we won't forget we need an issue opened with the 1.20 milestone. > > Cheers, > Ralf > > > I will say that I think it is not very high risk, because I think > annoying or not, the argument could be deprecated again with a > transition short phase. Admittedly, that argument only works if we have > a replacement solution. > > Cheers, > > Sebastian > > > > > > Best regards, > > Hameer Abbasi > > > > -- > > Sent from Canary (https://canarymail.io ) > > > > > On Monday, Aug 10, 2020 at 5:31 PM, Sebastian Berg < > > > sebastian at sipsolutions.net (mailto:sebastian at sipsolutions.net )> > > > wrote: > > > Hi all, > > > > > > as a heads up that Peter Entschev has a PR open to add `like=` to > > > most array creation functions, my current plan is to merge it soon > > > as a preliminary API and bring it up again before the actual > > > release (in a few months). This allows overriding for array-likes, > > > e.g. it will allow: > > > > > > > > > arr = np.asarray([3], like=dask_array) > > > type(arr) is dask.array.Array > > > > > > This was proposed in NEP 35: > > > > > > https://numpy.org/neps/nep-0035-array-creation-dispatch-with-array-function.html > > > > > > Although that has not been accepted as of now, the PR is: > > > > > > https://github.com/numpy/numpy/pull/16935 > > > > > > > > > This was discussed in a smaller group, and is an attempt to see how > > > we > > > can make the array-function protocol viable to allow packages such > > > as > > > sklearn to work with non-NumPy arrays. > > > > > > As of now, this would be experimental and can revisit it before the > > > actual NumPy release. We should probably discuss accepting NEP 35 > > > more. At this time, I hope that we can put in the functionality to > > > facilitate this discussion, rather the other way around. > > > > > > If anyone feels nervous about this step, I would be happy to > > > document > > > that we will not include it in the next release unless the NEP is > > > accepted first, or at least hide it behind an environment variable. > > > > > > Cheers, > > > > > > Sebastian > > > > > > _______________________________________________ > > > NumPy-Discussion mailing list > > > NumPy-Discussion at python.org > > > https://mail.python.org/mailman/listinfo/numpy-discussion > > > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at python.org > > https://mail.python.org/mailman/listinfo/numpy-discussion > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion -------------- next part -------------- An HTML attachment was scrubbed... URL: From peter at entschev.com Thu Aug 13 06:56:26 2020 From: peter at entschev.com (Peter Andreas Entschev) Date: Thu, 13 Aug 2020 12:56:26 +0200 Subject: [Numpy-discussion] Experimental `like=` attribute for array creation functions In-Reply-To: <96330BE4-1CA2-4451-8FE5-357CFA7E4EDC@fastmail.com> References: <7ca98625-53ea-47cd-a027-d9c902742fed@Canary> <9d9ad7a26241564ec3f14866accfe840b226e1dc.camel@sipsolutions.net>

<96330BE4-1CA2-4451-8FE5-357CFA7E4EDC@fastmail.com> Message-ID: > I am not sure adding a new keyword to an already confusing function is the right thing to do. Could you clarify what is the confusing function in question? > This is already a very (I mean extremely very) easy keyword name to confuse with ones_like, zeros_like and by its nature any other interpretation. To be fair, the usage is the same. Therefore empty_like(downstream_array, ...) and empty(downstream_array, ..., like=downstream_array) should have the exact same behavior, which is arguably redundant now. > It is not signalling anything about the functionality that is being discussed. I would seriously consider reserving such obvious names for really obvious tasks. Because you would also expect the shape and ndim would be mimicked by the "like"d argument but it turns out it is acting more like "typeof=" and not "like=" at all. I understand this can be confusing, and naming was one of the hardest discussions as there's no clear unambiguous name to use for this keyword, "like=" was simply the name that got closer to converging during discussions. At the same time I think "typeof=" is perhaps a better name than "like=", it could be very much confusing with "dtype=", and that would possibly just shift the confusion. > Again, if this is meant for downstream libraries (because that's what I got out of the PR discussion, cupy, dask, and JAX were the only examples I could read) then hiding it in another function and writing with capital letters "this is not meant for numpy users" would be a much more convenient way to separate the target audience and regular users. The problem with this approach is that the __array_function__ protocol relies on downstream libraries implementing functions with the same signature (for example, Dask and CuPy both implement an "array" function that matches NumPy). The purpose of __array_function__ and NEP-35 is to introduce only minimal changes to both NumPy's API and downstream libraries. Of course adding new functions for such cases would work, but IMO it would defeat the purpose of __array_function__ in general as it would require a considerable amount of work in downstream libraries, and we discussed this previously deciding that an argument is better than many new functions [1]. > I think, arriving to an agreement would be much faster if there is an executive summary of who this is intended for and what the regular usage is. Because with no offense, all I see is "dispatch", "_array_function_" and a lot of technical details of which I am absolutely ignorant. This is what I intended to do in the Usage Guidance [2] section. Could you elaborate on what more information you'd want to see there? Or is it just a matter of reorganizing the NEP a bit to try and summarize such things right at the top? > Finally as a minor point, I know we are mostly (ex-)academics but this necessity of formal language on NEPs is self-imposed (probably PEPs are to blame) and not quite helping. It can be a bit more descriptive in my external opinion. TBH, I don't really know how to solve that point, so if you have any specific suggestions, that's certainly welcome. I understand the frustration for a reader trying to understand all the details, with many being only described in NEP-18 [3], but we also strive to avoid rewriting things that are written elsewhere, which would also overburden those who are aware of what's being discussed. > I?ve generally been on the ?let the NumPy devs worry about it? side of things, but I do agree with Ilhan that `like=` is confusing and `typeof=` would be a much more appropriate name for that parameter. To be clear, I have no strong opinion on renaming it, I'm fine either way but I think it's unrealistic to expect that we find somewhat short, unambiguous and properly descriptive names in a single name. If the preference now shifts towards the "typeof=" name, we can change it, but "like=" was really named after "empty_like" and similar functions. > I do think library writers are NumPy users and so I wouldn?t really make that distinction, though. Users writing their own analysis code could very well be interested in writing code using numpy functions that will transparently work when the input is a CuPy array or whatever. I'm guessing this is somewhat of a loose definition of "library", to some extent if you really need "like=" it means that you're writing your own functions around the NumPy API (and that IMO is a library, even if you call it something else), rather than just writing your application on top of the existing NumPy API. I'm also happy to rephrase that in the NEP if people feel it should be done. > I also share Ilhan?s concern (and I mentioned this in a previous NEP discussion) that NEPs are getting pretty inaccessible. In a sense these are difficult topics and readers should be expected to have *some* familiarity with the topics being discussed, but perhaps more effort should be put into the context/motivation/background of a NEP before accepting it. One way to ensure this might be to require a final proofreading step by someone who has not been involved at all in the discussions, like peer review does for papers. This is a good point, and we do always notify people over the mailing list of new NEPs as per NEP-0 [4], which was done for NEP-35 [5] (originally NEP-33, but renamed due to other open NEPs at that time), unfortunately not many concerns were raised about that back then. Best, Peter [1] https://github.com/numpy/numpy/issues/14441#issuecomment-529969572 [2] https://numpy.org/neps/nep-0035-array-creation-dispatch-with-array-function.html#usage-guidance [3] https://numpy.org/neps/nep-0018-array-function-protocol.html [4] https://numpy.org/neps/nep-0000.html#nep-workflow [5] https://mail.python.org/pipermail/numpy-discussion/2019-October/080176.html On Thu, Aug 13, 2020 at 3:44 AM Juan Nunez-Iglesias wrote: > > I?ve generally been on the ?let the NumPy devs worry about it? side of things, but I do agree with Ilhan that `like=` is confusing and `typeof=` would be a much more appropriate name for that parameter. > > I do think library writers are NumPy users and so I wouldn?t really make that distinction, though. Users writing their own analysis code could very well be interested in writing code using numpy functions that will transparently work when the input is a CuPy array or whatever. > > I also share Ilhan?s concern (and I mentioned this in a previous NEP discussion) that NEPs are getting pretty inaccessible. In a sense these are difficult topics and readers should be expected to have *some* familiarity with the topics being discussed, but perhaps more effort should be put into the context/motivation/background of a NEP before accepting it. One way to ensure this might be to require a final proofreading step by someone who has not been involved at all in the discussions, like peer review does for papers. > > Food for thought. > > Juan. > > On 13 Aug 2020, at 9:24 am, Ilhan Polat wrote: > > For what is worth, as a potential consumer in SciPy, it really doesn't say anything (both in NEP and the PR) about how the regular users of NumPy will benefit from this. If only and only 3rd parties are going to benefit from it, I am not sure adding a new keyword to an already confusing function is the right thing to do. > > Let me clarify, > > - This is already a very (I mean extremely very) easy keyword name to confuse with ones_like, zeros_like and by its nature any other interpretation. It is not signalling anything about the functionality that is being discussed. I would seriously consider reserving such obvious names for really obvious tasks. Because you would also expect the shape and ndim would be mimicked by the "like"d argument but it turns out it is acting more like "typeof=" and not "like=" at all. Because if we follow the semantics it reads as "make your argument asarray like the other thing" but it is actually doing, "make your argument an array with the other thing's type" which might not be an array after all. > > - Again, if this is meant for downstream libraries (because that's what I got out of the PR discussion, cupy, dask, and JAX were the only examples I could read) then hiding it in another function and writing with capital letters "this is not meant for numpy users" would be a much more convenient way to separate the target audience and regular users. numpy.astypedarray([[some data], [...]], type_of=x) or whatever else it may be would be quite clean and to the point with no ambiguous keywords. > > I think, arriving to an agreement would be much faster if there is an executive summary of who this is intended for and what the regular usage is. Because with no offense, all I see is "dispatch", "_array_function_" and a lot of technical details of which I am absolutely ignorant. > > Finally as a minor point, I know we are mostly (ex-)academics but this necessity of formal language on NEPs is self-imposed (probably PEPs are to blame) and not quite helping. It can be a bit more descriptive in my external opinion. > > best, > ilhan > > > > > > > > On Tue, Aug 11, 2020 at 12:18 AM Ralf Gommers wrote: >> >> >> >> On Mon, Aug 10, 2020 at 8:37 PM Sebastian Berg wrote: >>> >>> On Mon, 2020-08-10 at 17:35 +0200, Hameer Abbasi wrote: >>> > Hi, >>> > >>> > We should have a higher-bandwidth meeting/communication for all >>> > stakeholders, and particularly some library authors, to see what >>> > would be good for them. >> >> >> I'm not sure that helps. At this point there's little progress since the last meeting, I think the plan is unchanged: we need implementations of all the options on offer, and then try them out in PRs for scikit-learn, SciPy and perhaps another package who's maintainers are interested, to test like=, __array_module__ in realistic situations. >> >> >>> > >>> > We should definitely have language in the NEP that says it won?t be >>> > in a release unless the NEP is accepted. >>> >>> In that case, I think the important part is to have language right now >>> in the implementation, although that can refer to the NEP itself of >>> course. >>> You can't expect everyone who may be tempted to use it to actually read >>> the NEP draft, at least not without pointing it out. >> >> >> Agreed, I think the decision is on this list not in the NEP, and to make sure we won't forget we need an issue opened with the 1.20 milestone. >> >> Cheers, >> Ralf >> >>> >>> I will say that I think it is not very high risk, because I think >>> annoying or not, the argument could be deprecated again with a >>> transition short phase. Admittedly, that argument only works if we have >>> a replacement solution. >>> >>> Cheers, >>> >>> Sebastian >>> >>> >>> > >>> > Best regards, >>> > Hameer Abbasi >>> > >>> > -- >>> > Sent from Canary (https://canarymail.io) >>> > >>> > > On Monday, Aug 10, 2020 at 5:31 PM, Sebastian Berg < >>> > > sebastian at sipsolutions.net (mailto:sebastian at sipsolutions.net)> >>> > > wrote: >>> > > Hi all, >>> > > >>> > > as a heads up that Peter Entschev has a PR open to add `like=` to >>> > > most array creation functions, my current plan is to merge it soon >>> > > as a preliminary API and bring it up again before the actual >>> > > release (in a few months). This allows overriding for array-likes, >>> > > e.g. it will allow: >>> > > >>> > > >>> > > arr = np.asarray([3], like=dask_array) >>> > > type(arr) is dask.array.Array >>> > > >>> > > This was proposed in NEP 35: >>> > > >>> > > https://numpy.org/neps/nep-0035-array-creation-dispatch-with-array-function.html >>> > > >>> > > Although that has not been accepted as of now, the PR is: >>> > > >>> > > https://github.com/numpy/numpy/pull/16935 >>> > > >>> > > >>> > > This was discussed in a smaller group, and is an attempt to see how >>> > > we >>> > > can make the array-function protocol viable to allow packages such >>> > > as >>> > > sklearn to work with non-NumPy arrays. >>> > > >>> > > As of now, this would be experimental and can revisit it before the >>> > > actual NumPy release. We should probably discuss accepting NEP 35 >>> > > more. At this time, I hope that we can put in the functionality to >>> > > facilitate this discussion, rather the other way around. >>> > > >>> > > If anyone feels nervous about this step, I would be happy to >>> > > document >>> > > that we will not include it in the next release unless the NEP is >>> > > accepted first, or at least hide it behind an environment variable. >>> > > >>> > > Cheers, >>> > > >>> > > Sebastian >>> > > >>> > > _______________________________________________ >>> > > NumPy-Discussion mailing list >>> > > NumPy-Discussion at python.org >>> > > https://mail.python.org/mailman/listinfo/numpy-discussion >>> > >>> > _______________________________________________ >>> > NumPy-Discussion mailing list >>> > NumPy-Discussion at python.org >>> > https://mail.python.org/mailman/listinfo/numpy-discussion >>> >>> _______________________________________________ >>> NumPy-Discussion mailing list >>> NumPy-Discussion at python.org >>> https://mail.python.org/mailman/listinfo/numpy-discussion >> >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at python.org >> https://mail.python.org/mailman/listinfo/numpy-discussion > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion From ralf.gommers at gmail.com Thu Aug 13 08:21:56 2020 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Thu, 13 Aug 2020 13:21:56 +0100 Subject: [Numpy-discussion] Experimental `like=` attribute for array creation functions In-Reply-To: References: <7ca98625-53ea-47cd-a027-d9c902742fed@Canary> <9d9ad7a26241564ec3f14866accfe840b226e1dc.camel@sipsolutions.net>

<96330BE4-1CA2-4451-8FE5-357CFA7E4EDC@fastmail.com> Message-ID: Thanks for raising these concerns Ilhan and Juan, and for answering Peter. Let me give my perspective as well. To start with, this is not specifically about Peter's NEP and PR. NEP 35 simply follows the pattern set by previous PRs, and given its tight scope is less difficult to understand than other NEPs on such technical topics. Peter has done a lot of things right, and is close to the finish line. On Thu, Aug 13, 2020 at 12:02 PM Peter Andreas Entschev wrote: > > > I think, arriving to an agreement would be much faster if there is an > executive summary of who this is intended for and what the regular usage > is. Because with no offense, all I see is "dispatch", "_array_function_" > and a lot of technical details of which I am absolutely ignorant. > > This is what I intended to do in the Usage Guidance [2] section. Could > you elaborate on what more information you'd want to see there? Or is > it just a matter of reorganizing the NEP a bit to try and summarize > such things right at the top? > We adapted the NEP template [6] several times last year to try and improve this. And specified in there as well that NEP content set to the mailing list should only contain the sections: Abstract, Motivation and Scope, Usage and Impact, and Backwards compatibility. This to ensure we fully understand the "why" and "what" before the "how". Unfortunately that template and procedure hasn't been exercised much yet, only in NEP 38 [7] and partially in NEP 41 [8]. If we have long-time maintainers of SciPy (Ilhan and myself), scikit-image (Juan) and CuPy (Leo, on the PR review) all saying they don't understand the goals, relevance, target audience, or how they're supposed to use a new feature, that indicates that the people doing the writing and having the discussion are doing something wrong at a very fundamental level. At this point I'm pretty disappointed in and tired of how we write and discuss NEPs on technical topics like dispatching, dtypes and the like. People literally refuse to write down concrete motivations, goals and non-goals, code that's problematic now and will be better/working post-NEP and usage examples before launching into extensive discussion of the gory details of the internals. I'm not sure what to do about it. Completely separate API and behavior proposals from implementation proposals? Make separate "API" and "internals" teams with the likes of Juan, Ilhan and Leo on the API team which then needs to approve every API change in new NEPs? Offer to co-write NEPs if someone is willing but doesn't understand how to go about it? Keep the current structure/process but veto further approvals until NEP authors get it right? I want to make an exception for merging the current NEP, for which the plan is to merge it as experimental to try in downstream PRs and get more experience. That does mean that master will be in an unreleasable state by the way, which is unusual and it'd be nice to get Chuck's explicit OK for that. But after that, I think we need a change here. I would like to hear what everyone thinks is the shape that change should take - any of my above suggestions, or something else? > > Finally as a minor point, I know we are mostly (ex-)academics but this > necessity of formal language on NEPs is self-imposed (probably PEPs are to > blame) and not quite helping. It can be a bit more descriptive in my > external opinion. > > TBH, I don't really know how to solve that point, so if you have any > specific suggestions, that's certainly welcome. I understand the > frustration for a reader trying to understand all the details, with > many being only described in NEP-18 [3], but we also strive to avoid > rewriting things that are written elsewhere, which would also > overburden those who are aware of what's being discussed. > > > > I also share Ilhan?s concern (and I mentioned this in a previous NEP > discussion) that NEPs are getting pretty inaccessible. In a sense these are > difficult topics and readers should be expected to have *some* familiarity > with the topics being discussed, but perhaps more effort should be put into > the context/motivation/background of a NEP before accepting it. One way to > ensure this might be to require a final proofreading step by someone who > has not been involved at all in the discussions, like peer review does for > papers. > Some variant of this proposal would be my preference. Cheers, Ralf > [1] https://github.com/numpy/numpy/issues/14441#issuecomment-529969572 > [2] > https://numpy.org/neps/nep-0035-array-creation-dispatch-with-array-function.html#usage-guidance > [3] https://numpy.org/neps/nep-0018-array-function-protocol.html > [4] https://numpy.org/neps/nep-0000.html#nep-workflow > [5] > https://mail.python.org/pipermail/numpy-discussion/2019-October/080176.html [6] https://github.com/numpy/numpy/blob/master/doc/neps/nep-template.rst [7] https://github.com/numpy/numpy/blob/master/doc/neps/nep-0038-SIMD-optimizations.rst [8] https://github.com/numpy/numpy/blob/master/doc/neps/nep-0041-improved-dtype-support.rst > > > On Thu, Aug 13, 2020 at 3:44 AM Juan Nunez-Iglesias > wrote: > > > > I?ve generally been on the ?let the NumPy devs worry about it? side of > things, but I do agree with Ilhan that `like=` is confusing and `typeof=` > would be a much more appropriate name for that parameter. > > > > I do think library writers are NumPy users and so I wouldn?t really make > that distinction, though. Users writing their own analysis code could very > well be interested in writing code using numpy functions that will > transparently work when the input is a CuPy array or whatever. > > > > I also share Ilhan?s concern (and I mentioned this in a previous NEP > discussion) that NEPs are getting pretty inaccessible. In a sense these are > difficult topics and readers should be expected to have *some* familiarity > with the topics being discussed, but perhaps more effort should be put into > the context/motivation/background of a NEP before accepting it. One way to > ensure this might be to require a final proofreading step by someone who > has not been involved at all in the discussions, like peer review does for > papers. > > > > Food for thought. > > > > Juan. > > > > On 13 Aug 2020, at 9:24 am, Ilhan Polat wrote: > > > > For what is worth, as a potential consumer in SciPy, it really doesn't > say anything (both in NEP and the PR) about how the regular users of NumPy > will benefit from this. If only and only 3rd parties are going to benefit > from it, I am not sure adding a new keyword to an already confusing > function is the right thing to do. > > > > Let me clarify, > > > > - This is already a very (I mean extremely very) easy keyword name to > confuse with ones_like, zeros_like and by its nature any other > interpretation. It is not signalling anything about the functionality that > is being discussed. I would seriously consider reserving such obvious names > for really obvious tasks. Because you would also expect the shape and ndim > would be mimicked by the "like"d argument but it turns out it is acting > more like "typeof=" and not "like=" at all. Because if we follow the > semantics it reads as "make your argument asarray like the other thing" but > it is actually doing, "make your argument an array with the other thing's > type" which might not be an array after all. > > > > - Again, if this is meant for downstream libraries (because that's what > I got out of the PR discussion, cupy, dask, and JAX were the only examples > I could read) then hiding it in another function and writing with capital > letters "this is not meant for numpy users" would be a much more convenient > way to separate the target audience and regular users. > numpy.astypedarray([[some data], [...]], type_of=x) or whatever else it may > be would be quite clean and to the point with no ambiguous keywords. > > > > I think, arriving to an agreement would be much faster if there is an > executive summary of who this is intended for and what the regular usage > is. Because with no offense, all I see is "dispatch", "_array_function_" > and a lot of technical details of which I am absolutely ignorant. > > > > Finally as a minor point, I know we are mostly (ex-)academics but this > necessity of formal language on NEPs is self-imposed (probably PEPs are to > blame) and not quite helping. It can be a bit more descriptive in my > external opinion. > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ilhanpolat at gmail.com Thu Aug 13 09:43:56 2020 From: ilhanpolat at gmail.com (Ilhan Polat) Date: Thu, 13 Aug 2020 15:43:56 +0200 Subject: [Numpy-discussion] Experimental `like=` attribute for array creation functions In-Reply-To: References: <7ca98625-53ea-47cd-a027-d9c902742fed@Canary> <9d9ad7a26241564ec3f14866accfe840b226e1dc.camel@sipsolutions.net>

<96330BE4-1CA2-4451-8FE5-357CFA7E4EDC@fastmail.com> Message-ID: To maybe lighten up the discussion a bit and to make my outsider confusion more tangible, let me start by apologizing for diving head first without weighing the past luggage :-) I always forget how much effort goes into these things and for outsiders like me, it's a matter of dipping the finger and tasting it just before starting to complain how much salt is missing etc. What I was mentioning about NEPs wasn't only related specifically to this one by the way. It's the generic feeling that I have. First let me start what I mean by NumPy users and downstreamers distinction. This is very much related to how data-science and huge-array users are magnetizing every tool out there in the Python world which is fine though the majority of number-crunchers have nothing to do with any of GPU/Parallelism/ClusterUsage etc. Hence when I mention NumPy users, think of people who use NumPy as its own right with no duck-typing and nothing related to subclassing. Just straightforward array creation and lots of ops on these arrays. For those people (I'm one of them), this option brings in a keyword that we would never use. And it gets into many major functions (linspace and others mentioned somewhere). So it has a very appealing name but has nothing to do with me in an already very crowded namespace and keyword catalogue. That's basically a UX issue to be addressed (under the assumption that users like me are the majority). Either making its name as esoteric as possible so I naturally stay away from it or I don't see it. This has absolutely nothing to do with looking down on the downstream libraries. They are flat-out amazing and the more we can support them the merrier. Using yet another metaphor, I was hoping that NumPy would have a loading dock for heavy duty deliveries for downstream projects or specialized array creations and won't disturb the regular customer entrance. Because if I look at this page https://numpy.org/doc/stable/referenc/routines.array-creation.html, there are a lot of functions and I think most of them are candidates to gain this keyword. I wish I can comment on a viable alternative but I really cannot understand the _array_xxxx_ discussions since they fly way over my head no matter how many times I tried. So that's why I naively mentioned the "np.astypedarray" or "np.asarray_but_not_numpy_array" or whatever. Now I see that it is even more complicated and I generated extra noise. So you can just ignore my previous suggestions. Except that I want to draw attention to the UX problem and I'd like to leave it at that. The other point is about the NEP stuff. I think I need to elaborate. If the NEPs are meant for internal NumPy discussions, then by all means, crank up the pointer*-meter to 11 and dive into it, totally fine with me. But if you also want to get feedback from outside, then probably a few lines of code examples for mere mortals would go a long way. Also it would make the discussion much more streamlined in my humble opinion. What I was trying to get at was that almost all NEPs read like a legal document that I want to agree as soon as possible. Because they often come without any or minimal amount of code in it. In NEP35 for example, there are nice code blocks in function dispatching but I guess it's not meant for me. Because it is only decorating asarray with some black magic happening there somehow (I guess). So I can't even comprehend what the proposition would mean for the regular, friendly, anti-duck users. But I am pretty sure it is about dispatching something because the word is repeated ~20 times :-) Thus the feedback would be limited. That was also what I meant there. But again I totally understand the complexity of these issues. So I'm not expecting to understand all details of NumPy machinery in a single NEP. But anyways, hope this clarifies a few things that I failed to convey in my previous mail. ilhan On Thu, Aug 13, 2020 at 2:23 PM Ralf Gommers wrote: > Thanks for raising these concerns Ilhan and Juan, and for answering Peter. > Let me give my perspective as well. > > To start with, this is not specifically about Peter's NEP and PR. NEP 35 > simply follows the pattern set by previous PRs, and given its tight scope > is less difficult to understand than other NEPs on such technical topics. > Peter has done a lot of things right, and is close to the finish line. > > > On Thu, Aug 13, 2020 at 12:02 PM Peter Andreas Entschev < > peter at entschev.com> wrote: > >> >> > I think, arriving to an agreement would be much faster if there is an >> executive summary of who this is intended for and what the regular usage >> is. Because with no offense, all I see is "dispatch", "_array_function_" >> and a lot of technical details of which I am absolutely ignorant. >> >> This is what I intended to do in the Usage Guidance [2] section. Could >> you elaborate on what more information you'd want to see there? Or is >> it just a matter of reorganizing the NEP a bit to try and summarize >> such things right at the top? >> > > We adapted the NEP template [6] several times last year to try and improve > this. And specified in there as well that NEP content set to the mailing > list should only contain the sections: Abstract, Motivation and Scope, > Usage and Impact, and Backwards compatibility. This to ensure we fully > understand the "why" and "what" before the "how". Unfortunately that > template and procedure hasn't been exercised much yet, only in NEP 38 [7] > and partially in NEP 41 [8]. > > If we have long-time maintainers of SciPy (Ilhan and myself), scikit-image > (Juan) and CuPy (Leo, on the PR review) all saying they don't understand > the goals, relevance, target audience, or how they're supposed to use a new > feature, that indicates that the people doing the writing and having the > discussion are doing something wrong at a very fundamental level. > > At this point I'm pretty disappointed in and tired of how we write and > discuss NEPs on technical topics like dispatching, dtypes and the like. > People literally refuse to write down concrete motivations, goals and > non-goals, code that's problematic now and will be better/working post-NEP > and usage examples before launching into extensive discussion of the gory > details of the internals. I'm not sure what to do about it. Completely > separate API and behavior proposals from implementation proposals? Make > separate "API" and "internals" teams with the likes of Juan, Ilhan and Leo > on the API team which then needs to approve every API change in new NEPs? > Offer to co-write NEPs if someone is willing but doesn't understand how to > go about it? Keep the current structure/process but veto further approvals > until NEP authors get it right? > > I want to make an exception for merging the current NEP, for which the > plan is to merge it as experimental to try in downstream PRs and get more > experience. That does mean that master will be in an unreleasable state by > the way, which is unusual and it'd be nice to get Chuck's explicit OK for > that. But after that, I think we need a change here. I would like to hear > what everyone thinks is the shape that change should take - any of my above > suggestions, or something else? > > > >> > Finally as a minor point, I know we are mostly (ex-)academics but this >> necessity of formal language on NEPs is self-imposed (probably PEPs are to >> blame) and not quite helping. It can be a bit more descriptive in my >> external opinion. >> >> TBH, I don't really know how to solve that point, so if you have any >> specific suggestions, that's certainly welcome. I understand the >> frustration for a reader trying to understand all the details, with >> many being only described in NEP-18 [3], but we also strive to avoid >> rewriting things that are written elsewhere, which would also >> overburden those who are aware of what's being discussed. >> >> >> > I also share Ilhan?s concern (and I mentioned this in a previous NEP >> discussion) that NEPs are getting pretty inaccessible. In a sense these are >> difficult topics and readers should be expected to have *some* familiarity >> with the topics being discussed, but perhaps more effort should be put into >> the context/motivation/background of a NEP before accepting it. One way to >> ensure this might be to require a final proofreading step by someone who >> has not been involved at all in the discussions, like peer review does for >> papers. >> > > Some variant of this proposal would be my preference. > > Cheers, > Ralf > > >> [1] https://github.com/numpy/numpy/issues/14441#issuecomment-529969572 >> [2] >> https://numpy.org/neps/nep-0035-array-creation-dispatch-with-array-function.html#usage-guidance >> [3] https://numpy.org/neps/nep-0018-array-function-protocol.html >> [4] https://numpy.org/neps/nep-0000.html#nep-workflow >> [5] >> https://mail.python.org/pipermail/numpy-discussion/2019-October/080176.html > > > [6] https://github.com/numpy/numpy/blob/master/doc/neps/nep-template.rst > [7] > https://github.com/numpy/numpy/blob/master/doc/neps/nep-0038-SIMD-optimizations.rst > [8] > https://github.com/numpy/numpy/blob/master/doc/neps/nep-0041-improved-dtype-support.rst > > > >> >> >> On Thu, Aug 13, 2020 at 3:44 AM Juan Nunez-Iglesias >> wrote: >> > >> > I?ve generally been on the ?let the NumPy devs worry about it? side of >> things, but I do agree with Ilhan that `like=` is confusing and `typeof=` >> would be a much more appropriate name for that parameter. >> > >> > I do think library writers are NumPy users and so I wouldn?t really >> make that distinction, though. Users writing their own analysis code could >> very well be interested in writing code using numpy functions that will >> transparently work when the input is a CuPy array or whatever. >> > >> > I also share Ilhan?s concern (and I mentioned this in a previous NEP >> discussion) that NEPs are getting pretty inaccessible. In a sense these are >> difficult topics and readers should be expected to have *some* familiarity >> with the topics being discussed, but perhaps more effort should be put into >> the context/motivation/background of a NEP before accepting it. One way to >> ensure this might be to require a final proofreading step by someone who >> has not been involved at all in the discussions, like peer review does for >> papers. >> > >> > Food for thought. >> > >> > Juan. >> > >> > On 13 Aug 2020, at 9:24 am, Ilhan Polat wrote: >> > >> > For what is worth, as a potential consumer in SciPy, it really doesn't >> say anything (both in NEP and the PR) about how the regular users of NumPy >> will benefit from this. If only and only 3rd parties are going to benefit >> from it, I am not sure adding a new keyword to an already confusing >> function is the right thing to do. >> > >> > Let me clarify, >> > >> > - This is already a very (I mean extremely very) easy keyword name to >> confuse with ones_like, zeros_like and by its nature any other >> interpretation. It is not signalling anything about the functionality that >> is being discussed. I would seriously consider reserving such obvious names >> for really obvious tasks. Because you would also expect the shape and ndim >> would be mimicked by the "like"d argument but it turns out it is acting >> more like "typeof=" and not "like=" at all. Because if we follow the >> semantics it reads as "make your argument asarray like the other thing" but >> it is actually doing, "make your argument an array with the other thing's >> type" which might not be an array after all. >> > >> > - Again, if this is meant for downstream libraries (because that's what >> I got out of the PR discussion, cupy, dask, and JAX were the only examples >> I could read) then hiding it in another function and writing with capital >> letters "this is not meant for numpy users" would be a much more convenient >> way to separate the target audience and regular users. >> numpy.astypedarray([[some data], [...]], type_of=x) or whatever else it may >> be would be quite clean and to the point with no ambiguous keywords. >> > >> > I think, arriving to an agreement would be much faster if there is an >> executive summary of who this is intended for and what the regular usage >> is. Because with no offense, all I see is "dispatch", "_array_function_" >> and a lot of technical details of which I am absolutely ignorant. >> > >> > Finally as a minor point, I know we are mostly (ex-)academics but this >> necessity of formal language on NEPs is self-imposed (probably PEPs are to >> blame) and not quite helping. It can be a bit more descriptive in my >> external opinion. >> > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From peter at entschev.com Thu Aug 13 09:47:02 2020 From: peter at entschev.com (Peter Andreas Entschev) Date: Thu, 13 Aug 2020 15:47:02 +0200 Subject: [Numpy-discussion] Experimental `like=` attribute for array creation functions In-Reply-To: References: <7ca98625-53ea-47cd-a027-d9c902742fed@Canary> <9d9ad7a26241564ec3f14866accfe840b226e1dc.camel@sipsolutions.net>

<96330BE4-1CA2-4451-8FE5-357CFA7E4EDC@fastmail.com> Message-ID: > We adapted the NEP template [6] several times last year to try and improve this. And specified in there as well that NEP content set to the mailing list should only contain the sections: Abstract, Motivation and Scope, Usage and Impact, and Backwards compatibility. This to ensure we fully understand the "why" and "what" before the "how". Unfortunately that template and procedure hasn't been exercised much yet, only in NEP 38 [7] and partially in NEP 41 [8]. > > If we have long-time maintainers of SciPy (Ilhan and myself), scikit-image (Juan) and CuPy (Leo, on the PR review) all saying they don't understand the goals, relevance, target audience, or how they're supposed to use a new feature, that indicates that the people doing the writing and having the discussion are doing something wrong at a very fundamental level. I'm more than happy to edit the NEP and try to clarify all the concerns. However, it gets pretty difficult to do so when I as an author don't understand where the difficulty is. Ilhan, Juan and Ralf now pointed out things that are missing/unclear, but no comment was made in that regard when I sent the NEP, my point being: I couldn't fix what I didn't know was a problem to others. > At this point I'm pretty disappointed in and tired of how we write and discuss NEPs on technical topics like dispatching, dtypes and the like. People literally refuse to write down concrete motivations, goals and non-goals, code that's problematic now and will be better/working post-NEP and usage examples before launching into extensive discussion of the gory details of the internals. I'm not sure what to do about it. Honestly, I don't really understand this. From my perspective, there are two ways to deal with such things: 1. Templates are to be taken mainly as _guidelines_ rather than _hardlines_, and the current text of NEP-35 definitely falls in the first category; 2. Templates are _hardlines_ and to be guided/enforced by maintainers at some point (maybe before merging the PR?). If 2 is the desired case for NumPy, which sounds a lot like what is wanted from NEP-35 and other NEPs generally, maintainers should let the authors know as early as possible that something isn't following the template's hardlines and it should be corrected. I don't mean any of this to remove myself of any responsibility, but would like to express my frustration that a 10 month-old NEP is only now getting so much pushback for being unclear after its implementation is nearing completion. > I want to make an exception for merging the current NEP, for which the plan is to merge it as experimental to try in downstream PRs and get more experience. That does mean that master will be in an unreleasable state by the way, which is unusual and it'd be nice to get Chuck's explicit OK for that. I don't quite understand this either, why would that leave master in an unreleasable state? Best, Peter On Thu, Aug 13, 2020 at 2:21 PM Ralf Gommers wrote: > > Thanks for raising these concerns Ilhan and Juan, and for answering Peter. Let me give my perspective as well. > > To start with, this is not specifically about Peter's NEP and PR. NEP 35 simply follows the pattern set by previous PRs, and given its tight scope is less difficult to understand than other NEPs on such technical topics. Peter has done a lot of things right, and is close to the finish line. > > > On Thu, Aug 13, 2020 at 12:02 PM Peter Andreas Entschev wrote: >> >> >> > I think, arriving to an agreement would be much faster if there is an executive summary of who this is intended for and what the regular usage is. Because with no offense, all I see is "dispatch", "_array_function_" and a lot of technical details of which I am absolutely ignorant. >> >> This is what I intended to do in the Usage Guidance [2] section. Could >> you elaborate on what more information you'd want to see there? Or is >> it just a matter of reorganizing the NEP a bit to try and summarize >> such things right at the top? > > > We adapted the NEP template [6] several times last year to try and improve this. And specified in there as well that NEP content set to the mailing list should only contain the sections: Abstract, Motivation and Scope, Usage and Impact, and Backwards compatibility. This to ensure we fully understand the "why" and "what" before the "how". Unfortunately that template and procedure hasn't been exercised much yet, only in NEP 38 [7] and partially in NEP 41 [8]. > > If we have long-time maintainers of SciPy (Ilhan and myself), scikit-image (Juan) and CuPy (Leo, on the PR review) all saying they don't understand the goals, relevance, target audience, or how they're supposed to use a new feature, that indicates that the people doing the writing and having the discussion are doing something wrong at a very fundamental level. > > At this point I'm pretty disappointed in and tired of how we write and discuss NEPs on technical topics like dispatching, dtypes and the like. People literally refuse to write down concrete motivations, goals and non-goals, code that's problematic now and will be better/working post-NEP and usage examples before launching into extensive discussion of the gory details of the internals. I'm not sure what to do about it. Completely separate API and behavior proposals from implementation proposals? Make separate "API" and "internals" teams with the likes of Juan, Ilhan and Leo on the API team which then needs to approve every API change in new NEPs? Offer to co-write NEPs if someone is willing but doesn't understand how to go about it? Keep the current structure/process but veto further approvals until NEP authors get it right? > > I want to make an exception for merging the current NEP, for which the plan is to merge it as experimental to try in downstream PRs and get more experience. That does mean that master will be in an unreleasable state by the way, which is unusual and it'd be nice to get Chuck's explicit OK for that. But after that, I think we need a change here. I would like to hear what everyone thinks is the shape that change should take - any of my above suggestions, or something else? > > >> >> > Finally as a minor point, I know we are mostly (ex-)academics but this necessity of formal language on NEPs is self-imposed (probably PEPs are to blame) and not quite helping. It can be a bit more descriptive in my external opinion. >> >> TBH, I don't really know how to solve that point, so if you have any >> specific suggestions, that's certainly welcome. I understand the >> frustration for a reader trying to understand all the details, with >> many being only described in NEP-18 [3], but we also strive to avoid >> rewriting things that are written elsewhere, which would also >> overburden those who are aware of what's being discussed. >> >> >> > I also share Ilhan?s concern (and I mentioned this in a previous NEP discussion) that NEPs are getting pretty inaccessible. In a sense these are difficult topics and readers should be expected to have *some* familiarity with the topics being discussed, but perhaps more effort should be put into the context/motivation/background of a NEP before accepting it. One way to ensure this might be to require a final proofreading step by someone who has not been involved at all in the discussions, like peer review does for papers. > > > Some variant of this proposal would be my preference. > > Cheers, > Ralf > >> >> [1] https://github.com/numpy/numpy/issues/14441#issuecomment-529969572 >> [2] https://numpy.org/neps/nep-0035-array-creation-dispatch-with-array-function.html#usage-guidance >> [3] https://numpy.org/neps/nep-0018-array-function-protocol.html >> [4] https://numpy.org/neps/nep-0000.html#nep-workflow >> [5] https://mail.python.org/pipermail/numpy-discussion/2019-October/080176.html > > > [6] https://github.com/numpy/numpy/blob/master/doc/neps/nep-template.rst > [7] https://github.com/numpy/numpy/blob/master/doc/neps/nep-0038-SIMD-optimizations.rst > [8] https://github.com/numpy/numpy/blob/master/doc/neps/nep-0041-improved-dtype-support.rst > > >> >> >> >> On Thu, Aug 13, 2020 at 3:44 AM Juan Nunez-Iglesias wrote: >> > >> > I?ve generally been on the ?let the NumPy devs worry about it? side of things, but I do agree with Ilhan that `like=` is confusing and `typeof=` would be a much more appropriate name for that parameter. >> > >> > I do think library writers are NumPy users and so I wouldn?t really make that distinction, though. Users writing their own analysis code could very well be interested in writing code using numpy functions that will transparently work when the input is a CuPy array or whatever. >> > >> > I also share Ilhan?s concern (and I mentioned this in a previous NEP discussion) that NEPs are getting pretty inaccessible. In a sense these are difficult topics and readers should be expected to have *some* familiarity with the topics being discussed, but perhaps more effort should be put into the context/motivation/background of a NEP before accepting it. One way to ensure this might be to require a final proofreading step by someone who has not been involved at all in the discussions, like peer review does for papers. >> > >> > Food for thought. >> > >> > Juan. >> > >> > On 13 Aug 2020, at 9:24 am, Ilhan Polat wrote: >> > >> > For what is worth, as a potential consumer in SciPy, it really doesn't say anything (both in NEP and the PR) about how the regular users of NumPy will benefit from this. If only and only 3rd parties are going to benefit from it, I am not sure adding a new keyword to an already confusing function is the right thing to do. >> > >> > Let me clarify, >> > >> > - This is already a very (I mean extremely very) easy keyword name to confuse with ones_like, zeros_like and by its nature any other interpretation. It is not signalling anything about the functionality that is being discussed. I would seriously consider reserving such obvious names for really obvious tasks. Because you would also expect the shape and ndim would be mimicked by the "like"d argument but it turns out it is acting more like "typeof=" and not "like=" at all. Because if we follow the semantics it reads as "make your argument asarray like the other thing" but it is actually doing, "make your argument an array with the other thing's type" which might not be an array after all. >> > >> > - Again, if this is meant for downstream libraries (because that's what I got out of the PR discussion, cupy, dask, and JAX were the only examples I could read) then hiding it in another function and writing with capital letters "this is not meant for numpy users" would be a much more convenient way to separate the target audience and regular users. numpy.astypedarray([[some data], [...]], type_of=x) or whatever else it may be would be quite clean and to the point with no ambiguous keywords. >> > >> > I think, arriving to an agreement would be much faster if there is an executive summary of who this is intended for and what the regular usage is. Because with no offense, all I see is "dispatch", "_array_function_" and a lot of technical details of which I am absolutely ignorant. >> > >> > Finally as a minor point, I know we are mostly (ex-)academics but this necessity of formal language on NEPs is self-imposed (probably PEPs are to blame) and not quite helping. It can be a bit more descriptive in my external opinion. > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at python.org > https://mail.python.org/mailman/listinfo/numpy-discussion From ralf.gommers at gmail.com Thu Aug 13 10:13:07 2020 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Thu, 13 Aug 2020 15:13:07 +0100 Subject: [Numpy-discussion] Experimental `like=` attribute for array creation functions In-Reply-To: References: <7ca98625-53ea-47cd-a027-d9c902742fed@Canary> <9d9ad7a26241564ec3f14866accfe840b226e1dc.camel@sipsolutions.net>