[Numpy-discussion] Automatic number of bins for numpy histograms
pmhobson at gmail.com
Tue Apr 14 22:00:25 EDT 2015
On Tue, Apr 14, 2015 at 4:24 PM, Jaime Fernández del Río <
jaime.frio at gmail.com> wrote:
> On Tue, Apr 14, 2015 at 4:12 PM, Nathaniel Smith <njs at pobox.com> wrote:
>> On Mon, Apr 13, 2015 at 8:02 AM, Neil Girdhar <mistersheik at gmail.com>
>> > Can I suggest that we instead add the P-square algorithm for the dynamic
>> > calculation of histograms?
>> > (
>> > This is already implemented in C++'s boost library
>> > (
>> > I implemented it in Boost Python as a module, which I'm happy to share.
>> > This is much better than fixed-width histograms in practice. Rather
>> > adjusting the number of bins, it adjusts what you really want, which is
>> > resolution of the bins throughout the domain.
>> This definitely sounds like a useful thing to have in numpy or scipy
>> (though if it's possible to do without using Boost/C++ that would be
>> nice). But yeah, we should leave the existing histogram alone (in this
>> regard) and add a new name for this like "adaptive_histogram" or
>> something. Then you can set about convincing matplotlib and friends to
>> use it by default :-)
> Would having a negative number of bins mean "this many, but with optimized
> boundaries" be too clever an interface?
As a user, I think so. Wouldn't np.histogram(..., adaptive=True) do well
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the NumPy-Discussion