Hi all, The first stage of adding the `scipy.datasets` submodule is done as pointed out by Ralf. Thanks everyone involved! We now look forward to making some enhancements, maybe adding more datasets which might be important to the SciPy community. With pooch, it is now much easier to add a new dataset since we don't really add the dataset files inside the SciPy repo. I've created a Tracker Issue for scipy.datasets <https://github.com/scipy/scipy/issues/16983> mentioning the next few Todo Items. I'll get started with them after my vacation next week. Until then, it would be great if people here can share some thoughts on datasets that they would like to see inside `scipy.datasets` and if there were any past requests about adding a certain dataset that could be helpful to the whole community. A few interesting/useful ones are already mentioned on the tracker thanks to suggestions from Ralf. Cheers, Anirudh On Mon, Sep 5, 2022 at 10:22 AM Ralf Gommers <ralf.gommers@gmail.com> wrote:
On Sat, Jul 9, 2022 at 4:02 PM Anirudh Dagar <anirudhdagar6@gmail.com> wrote:
On Sat, Jul 9, 2022 at 2:33 PM Ralf Gommers <ralf.gommers@gmail.com> wrote:
One thing to be done, fairly easy to address, is to move the datasets with separate repos into the SciPy org and update the links <https://github.com/AnirudhDagar/scipy/blob/scipy-datasets/scipy/datasets/_registry.py#L14-L17> in the registry file. Currently, all of these are under https://github.com/scipy-datasets. Some follow-ups will include getting rid of the dataset files from the SciPy repo completely once the PR is approved and merged.
Moving those repos into the SciPy GitHub org seems indeed preferred, to ensure we can reuse our normal permissions management workflow, and don't have to maintain duplicate sets of permissions. There will be quite a few repos over time, however given that they're named `dataset-xxx` I don't see an issue with that.
I plan to move these repos sometime next week. If anyone has a concern, please let me know.
Sounds good! I guess you are already the owner at https://github.com/scipy-datasets, so you should have all the rights to transfer the repo's ownership to SciPy. If not, feel free to create new ones in SciPy.
Thanks, Anirudh
Just wanted to bring up the updates on the mailing list. Please let me know if you have any kind of feedback on Github. Thanks!
Best, Anirudh
On Fri, Feb 25, 2022 at 3:58 PM Ralf Gommers <ralf.gommers@gmail.com> wrote:
On Wed, Feb 23, 2022 at 10:11 PM Stephan Hoyer <shoyer@gmail.com> wrote:
On Tue, Feb 22, 2022 at 9:20 PM Ralf Gommers <ralf.gommers@gmail.com> wrote:
> If we were to keep them in SciPy, they might belong in >> scipy.optimize next to check_grad and approx_frime. But I don't think these >> functions (as written) are very useful. They have obvious computational >> inefficiencies and very limited functionality. I would rather point users >> to a fully functioning library for finite-differences like findiff: >> https://github.com/maroba/findiff >> > > Thanks Stephan! I didn't hear about findiff before. Would you > recommend it over https://github.com/pbrod/numdifftools? >
I haven't used either of them, it just came up in a search for finite differences in Python.
Okay, thanks Stephan. Both look good, so unless someone has practical experience and can make a recommendation for why one of these is preferred, we should probably list both in the deprecation notice.
This has all been completed now as proposed. `scipy.datasets` is a thing, and `scipy.misc` is deprecated. Thanks Anirudh for pushing this forward, and anyone else who helped get it merged!
Cheers, Ralf
_______________________________________________ SciPy-Dev mailing list -- scipy-dev@python.org To unsubscribe send an email to scipy-dev-leave@python.org https://mail.python.org/mailman3/lists/scipy-dev.python.org/ Member address: anirudhdagar6@gmail.com