[scikit-learn] Machine learning for PU data

Roman Yurchak rth.yurchak at gmail.com
Fri Jun 30 09:39:47 EDT 2017


Hello Ruchika,

I don't think that scikit-learn currently has algorithms that can train 
with positive and unlabeled class labels only. However, you could try 
one of the following compatible wrappers,
   - 
http://nktmemo.github.io/jekyll/update/2015/11/07/pu_classification.html
   - https://github.com/scikit-learn/scikit-learn/pull/371

(haven't tried them myself).

Also, you could try one class SVM as suggested here 
https://stackoverflow.com/questions/25700724/binary-semi-supervised-classification-with-positive-only-and-unlabeled-data-set

-- 
Roman



On 30/06/17 16:06, Ruchika Nayyar wrote:
> Hi All,
>
> I am a scikit-learn user and have a question for the community, if
> anyone has applied any available machine learning algorithms in the
> scikit-learn package for data with positive and unlabeled class only? If
> so would you share some insight with me. I understand this could be a
> broader topic but I am new to analyzing PU data and hence can use some
> help.
>
> Thanks,
> Ruchika
>
>
>
> _______________________________________________
> scikit-learn mailing list
> scikit-learn at python.org
> https://mail.python.org/mailman/listinfo/scikit-learn
>



More information about the scikit-learn mailing list