[scikit-learn] scikit-learn Digest, Vol 6, Issue 40

Wed Sep 28 11:53:53 EDT 2016

Afarin,
can you please describe your full data set, as maybe you are making a
mistake in how you are setting up the data.

My understanding of what Afarin is saying is that for each person he has a
row for successes and a row for failures (but cannot understand why only
two rows - would expect multiple rows according to different feature
configurations)

So what Afarin wants to do is split by person rather than by row?

Sean

On Wed, Sep 28, 2016 at 5:26 PM, Andreas Mueller <t3kcit at gmail.com> wrote:

> It's not really clear to me what you want to achieve.
> What do you mean by "does not lead to a biased accuracy"?
>
>
> On 09/26/2016 05:06 PM, Afarin Famili wrote:
>
>> Hi David,
>>
>> When applying Train_test_split to the sample space, we have a single row
>> per subject. I am looking for some other function like Train_test_split
>> that can deal with pairs of rows (for each subject), which does not lead to
>> a biased accuracy. We are studying memory and have a row of features for
>> successful memory encoding, and a second row for unsuccessful memory
>> encoding in each of the subjects. Our target space being 1 for successful
>> and 0 for unsuccessful encoding respectively.
>> How do you recommend me to split this set of data in order to get a
>> reasonable/unbiased accuracy?
>>
>> Thanks,
>> Afarin
>>
>>
>>
> _______________________________________________
> scikit-learn mailing list
> scikit-learn at python.org
> https://mail.python.org/mailman/listinfo/scikit-learn
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scikit-learn/attachments/20160928/6b25876c/attachment.html>