Numpy: Histgram number of samples is negative with bins="auto"

Created on 27 Aug 2019 · 4Comments · Source: numpy/numpy

Getting Number of samples, -20, must be non-negative. when trying to build a histogram of my dataset.

Reproducing code example:

import numpy as np

my_data = np.loadtxt("my_data.csv", delimiter=',', dtype=np.int16)

n_base, bins_base = np.histogram(my_data, bins="auto")

Here is my_data.csv

Error message:

Number of samples, -20, must be non-negative.

Numpy/Python version information:

1.16.4 3.7.4 (default, Aug 13 2019, 20:35:49)
[GCC 7.3.0]

00 - Bug numpy.lib

Source

alexsmartens

Most helpful comment

Thanks for the bug report. I can confirm that this bug exists in the master branch, too.

NumPy devs: The problem is that there is an internal function, _hist_bin_sturges in histograms.py, that uses the method ptp to compute the difference of the maximum and minimum of an array with dtype int16. In this case, the maximum is 32767 and the minimum is -16, so that difference should be 32783. But ptp returns a value with the same type as the array, so it returns -32753, which results in the incorrect calculation.

We could fix that by replacing x.ptp() with something like x.max().item() - x.min().item().

WarrenWeckesser on 27 Aug 2019

👍2

All 4 comments

Interestingly enough, when I convert this dataset to float, then the histogram is being build without an issue

alexsmartens on 27 Aug 2019

Thanks for the bug report. I can confirm that this bug exists in the master branch, too.

We could fix that by replacing x.ptp() with something like x.max().item() - x.min().item().

WarrenWeckesser on 27 Aug 2019

👍2

Most of the other bin estimators have the same problem with x.ptp().

WarrenWeckesser on 27 Aug 2019

A possible fix is in https://github.com/numpy/numpy/pull/14381.

WarrenWeckesser on 27 Aug 2019

Was this page helpful?

0 / 5 - 0 ratings

Related issues

_read32 TypeError: only integer scalar arrays can be converted to a scalar index

ghost · 4Comments

numpy.int64 is not instance of int

dmvianna · 4Comments

Enh: Object array creation function

toddrjen · 4Comments

BUG: Setting the mask on a view with mask=nomask does not propagate to the owner

MorBilly · 4Comments

Nullable integers conversion

Kreol64 · 3Comments