<div dir="ltr"><div>Hello community,</div><div class="gmail_extra"><br><div class="gmail_quote"><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><span class="gmail-"><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">

I wonder if there's something similar for the binary class case where,<br>

the prediction is a real value (activation) and from this we can also<br>

derive<br>

  - CMs for all prediction cutoff (or set of cutoffs?)<br>

  - scores over all cutoffs (AUC, AP, ...)<br>

</blockquote></span>

AUC and AP are by definition over all cut-offs. And CMs for all<br>

cutoffs doesn't seem a good idea, because that'll be n_samples many<br>

in the general case. If you want to specify a set of cutoffs, that would be pretty easy to do.<br>

How do you find these cut-offs, though?<span class="gmail-"><br>

<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">

<br>

For me, in analyzing (binary class) performance, reporting scores for<br>

a single cutoff is less useful than seeing how the many scores (tpr,<br>

ppv, mcc, relative risk, chi^2, ...) vary at various false positive<br>

rates, or prediction quantiles.<br></blockquote></span></blockquote><div><br></div><div>In terms of finding cut-offs, one could use the idea of metric surfaces that I recently proposed</div><div><a href="https://onlinelibrary.wiley.com/doi/abs/10.1002/minf.201700127">https://onlinelibrary.wiley.com/doi/abs/10.1002/minf.201700127</a> <br></div><div>and then plot your per-threshold TPR/TNR pairs on the PPV/MCC/etc surfaces to determine what conditions you are willing to accept against the background of your prediction problem.</div><div><br></div><div>I use these surfaces (a) to think about the prediction problem before any attempt at modeling is made, and (b) to deconstruct results such as "Accuracy=85%" into interpretations in the context of my field and the data being predicted.</div><div><br></div><div>Hope this contributes a bit of food for thought.</div><div>J.B.<br></div></div></div></div>