<div dir="ltr"><div><div>If X is sparse, explicit zeros and missing-value zeros are **both** considered as zeros in the objective functions.</div><div><br></div><div>Changing the objective function wouldn't need a new interface, yet I am not sure the code change would be completely trivial.</div><div>The question is: do we want this new objective function in scikit-learn, since we have no other recommendation-like algorithm?</div></div><div>If we agree that it would useful, feel free to send a PR.</div><div><br></div><div>Tom</div></div><div class="gmail_extra"><br><div class="gmail_quote">2016-08-29 17:50 GMT+02:00 Andreas Mueller <span dir="ltr"><<a href="mailto:t3kcit@gmail.com" target="_blank">t3kcit@gmail.com</a>></span>:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">

  <div bgcolor="#FFFFFF" text="#000000"><span class="">

    <br>

    <br>

    <div>On 08/28/2016 01:16 PM, Raphael C

      wrote:<br>

    </div>

    <blockquote type="cite"><br>

      <br>

      On Sunday, August 28, 2016, Andy <<a href="mailto:t3kcit@gmail.com" target="_blank">t3kcit@gmail.com</a>> wrote:<br>

      <blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">

        <div bgcolor="#FFFFFF" text="#000000"> <br>

          <br>

          <div>On 08/28/2016 12:29 PM, Raphael C wrote:<br>

          </div>

          <blockquote type="cite">To give a little context from the web,

            see e.g. <a href="http://www.quuxlabs.com/blog/2010/09/matrix-factorization-a-simple-tutorial-and-implementation-in-python/" target="_blank">http://www.quuxlabs.com/b<wbr>log/2010/09/matrix-factorizati<wbr>on-a-simple-tutorial-and-<wbr>implementation-in-python/</a> <wbr>where

            it explains:

            <div><br>

            </div>

            <div>"</div>

            <div><font size="2"><span style="background-color:rgba(255,255,255,0)">A

                  question might have come to your mind by now: if we

                  find two matrices <img src="http://www.quuxlabs.com/wp-content/latex/ccf/ccf6cb7a07e53d6a5c3e8449ae73d371-ffffff-000000-0.png" alt="\mathbf{P}" title="\mathbf{P}" style="margin:0px;padding:0px;vertical-align:middle;max-width:640px"> and <img src="http://www.quuxlabs.com/wp-content/latex/5e1/5e1ad0579fc06ddcbda6abaa092b7382-ffffff-000000-0.png" alt="\mathbf{Q}" title="\mathbf{Q}" style="margin:0px;padding:0px;vertical-align:middle;max-width:640px"> such

                  that <img src="http://www.quuxlabs.com/wp-content/latex/4e3/4e37888e71add225aafff9e943e66b88-ffffff-000000-0.png" alt="\mathbf{P} \times \mathbf{Q}" title="\mathbf{P}

                    \times \mathbf{Q}" style="margin:0px;padding:0px;vertical-align:middle;max-width:640px"> approximates <img src="http://www.quuxlabs.com/wp-content/latex/e1f/e1fd601dbae82a538d518550acb1af19-ffffff-000000-0.png" alt="\mathbf{R}" title="\mathbf{R}" style="margin:0px;padding:0px;vertical-align:middle;max-width:640px">,

                  isn’t that our predictions of all the unseen ratings

                  will all be zeros? In fact, we are not really trying

                  to come up with <img src="http://www.quuxlabs.com/wp-content/latex/ccf/ccf6cb7a07e53d6a5c3e8449ae73d371-ffffff-000000-0.png" alt="\mathbf{P}" title="\mathbf{P}" style="margin:0px;padding:0px;vertical-align:middle;max-width:640px"> and <img src="http://www.quuxlabs.com/wp-content/latex/5e1/5e1ad0579fc06ddcbda6abaa092b7382-ffffff-000000-0.png" alt="\mathbf{Q}" title="\mathbf{Q}" style="margin:0px;padding:0px;vertical-align:middle;max-width:640px"> such

                  that we can reproduce <img src="http://www.quuxlabs.com/wp-content/latex/e1f/e1fd601dbae82a538d518550acb1af19-ffffff-000000-0.png" alt="\mathbf{R}" title="\mathbf{R}" style="margin:0px;padding:0px;vertical-align:middle;max-width:640px"> exactly.

                  Instead, we will only try to minimise the errors of

                  the observed user-item pairs. </span></font></div>

            <div><font size="2"><span>"</span></font><br>

            </div>

          </blockquote>

          Yes, the sklearn interface is not meant for matrix completion

          but matrix-factorization.<br>

          There was a PR for some matrix completion for missing value

          imputation at some point.<br>

          <br>

          In general, scikit-learn doesn't really implement anything for

          recommendation algorithms as that requires a different

          interface.<br>

        </div>

      </blockquote>

      <div><br>

      </div>

      <div>Thanks Andy. I just looked up that PR.</div>

      <div><br>

      </div>

      <div>I was thinking simply producing a different factorisation

        optimised only over the observed values wouldn't need a new

        interface. That in itself would be hugely useful.</div>

    </blockquote></span>

    Depends. Usually you don't want to complete all values, but only

    compute a factorization. What do you return? Only the factors?<br>

    The PR implements completing everything, and that you can do with

    the transformer interface. I'm not sure what the status of the PR

    is,<br>

    but doing that with NMF instead of SVD would certainly also be

    interesting.<br>

    <br>

  </div>

<br>______________________________<wbr>_________________<br>

scikit-learn mailing list<br>

<a href="mailto:scikit-learn@python.org">scikit-learn@python.org</a><br>

<a href="https://mail.python.org/mailman/listinfo/scikit-learn" rel="noreferrer" target="_blank">https://mail.python.org/<wbr>mailman/listinfo/scikit-learn</a><br>

<br></blockquote></div><br></div>