R: scikit-learn Digest, Vol 11, Issue 32
Dear Gael, This will probably be the case here, but we would like to exclude the scanner-factor from the possible explanations. We are still lucky that we are not in situation where the number of features >> number of samples. Best Ludovico -------- Messaggio originale -------- Da: scikit-learn-request@python.org Data: 27/02/17 23:49 (GMT+01:00) A: scikit-learn@python.org Oggetto: scikit-learn Digest, Vol 11, Issue 32 Send scikit-learn mailing list submissions to scikit-learn@python.org To subscribe or unsubscribe via the World Wide Web, visit https://mail.python.org/mailman/listinfo/scikit-learn or, via email, send a message with subject or body 'help' to scikit-learn-request@python.org You can reach the person managing the list at scikit-learn-owner@python.org When replying, please edit your Subject line so it is more specific than "Re: Contents of scikit-learn digest..." Today's Topics: 1. Re: scikit-learn Digest, Vol 11, Issue 29 (Gael Varoquaux) 2. Re: GSoC 2017 (Joel Nothman) 3. Re: GSoC 2017 (Pradeep Thalasta) ---------------------------------------------------------------------- Message: 1 Date: Mon, 27 Feb 2017 23:19:33 +0100 From: Gael Varoquaux <gael.varoquaux@normalesup.org> To: Scikit-learn user and developer mailing list <scikit-learn@python.org> Subject: Re: [scikit-learn] scikit-learn Digest, Vol 11, Issue 29 Message-ID: <20170227221933.GC2369856@phare.normalesup.org> Content-Type: text/plain; charset=iso-8859-1 On Mon, Feb 27, 2017 at 10:13:04PM +0000, Ludovico Coletta wrote:
The data is stored in a numpy array (shape: 68, 24). We are using scikit 18.1
I saw that I wrote something wrong in previous email. Your solution is indeed correct if we leave Scikit decide how to manage the inner loop. This is what we did at the beginning. By doing so, we noticed that the classifier's perfomance decrease (in comparison to a non-optimised classifier).
With 68 samples, it is not that surprising the model-selection with cross-validation is not able to select a good model. We found the same problem in brain imaging data [1], and it's an intrinsic problem due to small sample sizes: cross-validation is just not very accurate in these settings. Ga?l [1] https://arxiv.org/abs/1606.05201 ------------------------------ Message: 2 Date: Tue, 28 Feb 2017 09:34:43 +1100 From: Joel Nothman <joel.nothman@gmail.com> To: Scikit-learn user and developer mailing list <scikit-learn@python.org> Subject: Re: [scikit-learn] GSoC 2017 Message-ID: <CAAkaFLXVGAbBjYE2yA+8egUgRvoYnbOgO9QfYTeG0iF9bWUX+A@mail.gmail.com> Content-Type: text/plain; charset="utf-8" Hi Pradeep, we would usually only accept candidates who have shown their proficiency and understanding of our package and processes by making some contributions prior to this stage. you are certainly welcome to aim for GSoC 2018 by beginning to develop your familiarity and rapport now. cheers, Joel On 28 Feb 2017 7:01 am, "Pradeep Thalasta" <thalasta@usc.edu> wrote:
Hi, I'm new to open source contribution. Can i take part in GSoc as well?
On Mon, Feb 27, 2017 at 11:29 AM, Raghav R V <ragvrv@gmail.com> wrote:
Or simply a public gist and importantly the link mailed here would do I think...
On 27 Feb 2017 8:28 p.m., "Raghav R V" <ragvrv@gmail.com> wrote:
They can still edit a wiki page from their fork of scikit learn I think. So I'd suggest doing that and mailing to this thread, the link to their proposal...
On 27 Feb 2017 6:55 p.m., "Nelson Liu" <nfliu@uw.edu> wrote:
In past years students made a page on the wiki with their proposal; this isn't possible anymore due to GitHub permissions. Perhaps an alternative method for getting feedback should be suggested on the introduction page?
Nelson Liu
On Mon, Feb 27, 2017 at 2:58 AM, Gael Varoquaux < gael.varoquaux@normalesup.org> wrote:
Hi,
Students have been inquiring about the GSoC (Google Summer of Code) with scikit-learn, and the core team has been quite silent about team.
I am happy to announce that we will be taking part in the scikit-learn again. The reason that we decided to do this is to give a chance to the young, talented, and motivated students.
Importantly, our most limiting resource is the time of our experienced developers. This is clearly visible from the number of pending pull requests. Hence, we need students to be very able and independent. This of course means that they will be getting supervision from mentors. Such supervision is crucial for moving forward with a good project, that delivers mergeable code. However, we will need the students to be very good at interacting efficiently with the mentors. Also, I should stress that we will be able to take only a very few numbers of students.
With that said, let me introduce the 2017 GSoC for scikit-learn. We have set up a wiki page which summarizes the experiences from last year and the ideas for this year: https://github.com/scikit-learn/scikit-learn/wiki/Google-sum mer-of-code-(GSOC)-2017 <https://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_scikit-2Dlearn_scikit-2Dlearn_wiki_Google-2Dsummer-2Dof-2Dcode-2D-28GSOC-29-2D2017&d=DwMFaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8wN-jbuYw7VyipS2uLHiQg&m=WOCvB_ncbkX6zknItZ8JGw5QvsCBNqh2DCc_AxGKj10&s=H2nCh3txo-r0nse5_lCRDVy6d4ZPbhN07HjOLmxIvaY&e=>
Interested students should declare their interest on the mailing list, and discuss with possible mentors here. Factors of success will be
* careful work on a good proposal, that takes on of the ideas on the wiki but breaks it down in a realistic plan with multiple steps and shows a good understanding of the problem.
* demonstration of the required skillset via successful pull requests in scikit-learn.
Cheers,
Ga?l
-- Gael Varoquaux Researcher, INRIA Parietal NeuroSpin/CEA Saclay , Bat 145, 91191 Gif-sur-Yvette France Phone: ++ 33-1-69-08-79-68 http://gael-varoquaux.info <https://urldefense.proofpoint.com/v2/url?u=http-3A__gael-2Dvaroquaux.info&d=DwMFaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8wN-jbuYw7VyipS2uLHiQg&m=WOCvB_ncbkX6zknItZ8JGw5QvsCBNqh2DCc_AxGKj10&s=y374tUtv0ORndPBCuIpGPXu3ISMxJcdDrLkeKw9IYC4&e=> http://twitter.com/GaelVaroquaux <https://urldefense.proofpoint.com/v2/url?u=http-3A__twitter.com_GaelVaroquaux&d=DwMFaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8wN-jbuYw7VyipS2uLHiQg&m=WOCvB_ncbkX6zknItZ8JGw5QvsCBNqh2DCc_AxGKj10&s=O3f-GNDNGiA6ri2BTVjJyQMN7z1dXWSmeVsLujo0Tbo&e=> _______________________________________________ scikit-learn mailing list scikit-learn@python.org https://mail.python.org/mailman/listinfo/scikit-learn <https://urldefense.proofpoint.com/v2/url?u=https-3A__mail.python.org_mailman_listinfo_scikit-2Dlearn&d=DwMFaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8wN-jbuYw7VyipS2uLHiQg&m=WOCvB_ncbkX6zknItZ8JGw5QvsCBNqh2DCc_AxGKj10&s=2HaUcj6htbntv3V5UTTAgAtZk6luVMnqXA9vEOlfJ_k&e=>
_______________________________________________ scikit-learn mailing list scikit-learn@python.org https://mail.python.org/mailman/listinfo/scikit-learn <https://urldefense.proofpoint.com/v2/url?u=https-3A__mail.python.org_mailman_listinfo_scikit-2Dlearn&d=DwMFaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8wN-jbuYw7VyipS2uLHiQg&m=WOCvB_ncbkX6zknItZ8JGw5QvsCBNqh2DCc_AxGKj10&s=2HaUcj6htbntv3V5UTTAgAtZk6luVMnqXA9vEOlfJ_k&e=>
scikit-learn mailing list scikit-learn@python.org https://urldefense.proofpoint.com/v2/url?u=https-3A__mail.py thon.org_mailman_listinfo_scikit-2Dlearn&d=DwICAg&c=clK7kQUT WtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=8wN-jbuYw7VyipS2uLHiQg &m=WOCvB_ncbkX6zknItZ8JGw5QvsCBNqh2DCc_AxGKj10&s=2HaUcj6htbn tv3V5UTTAgAtZk6luVMnqXA9vEOlfJ_k&e=
-- Regards, Pradeep Thalasta
_______________________________________________ scikit-learn mailing list scikit-learn@python.org https://mail.python.org/mailman/listinfo/scikit-learn
participants (1)
-
Ludovico Coletta