<div dir="ltr"><div class="gmail_extra"><br><div class="gmail_quote">On 2 April 2014 16:06, Sturla Molden <span dir="ltr"><<a href="mailto:sturla.molden@gmail.com" target="_blank">sturla.molden@gmail.com</a>></span> wrote:<br>


<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div class=""><<a href="mailto:josef.pktd@gmail.com">josef.pktd@gmail.com</a>> wrote:<br>

<br>

> pandas came later and thought ddof=1 is worth more than consistency.<br>

<br>

</div>Pandas is a data analysis package. NumPy is a numerical array package.<br>

<br>

I think ddof=1 is justified for Pandas, for consistency with statistical<br>

software (SPSS et al.)<br>

<br>

For NumPy, there are many computational tasks where the Bessel correction<br>

is not wanted, so providing a uncorrected result is the correct thing to<br>

do. NumPy should be a low-level array library that does very little magic.</blockquote></div><br></div><div class="gmail_extra">All this discussion reminds me of the book "Numerical Recipes":<br></div><div class="gmail_extra">


<br>"if the difference between N and N − 1 ever matters to you, then you<br>are probably up to no good anyway — e.g., trying to substantiate a questionable<br>hypothesis with marginal data."<br><br></div><div class="gmail_extra">


For any reasonably sized data set, it is a correction in the second significant figure.<br></div></div>