BUG: np.percentile fails with internal overflow when using float16 input on large arrays #29003

friedlerido · 2025-05-19T10:03:09Z

Describe the issue:

When using np.percentile or np.nanpercentile on a large float16 array from real data (≈9 million elements), I get:
ValueError: kth(=-9223372036845518721) out of bounds (9257087)

This happens even for low percentiles like 20 or 50, and is resolved by casting the array to float64.

⚠️ Expected Behavior:
Either: NumPy automatically upcasts float16 to float64
Or: raises a UserWarning if float16 is passed to percentile

Reproduce the code example:

# This does not happen with random arrays of similar shape, but does happen consistently  #with my real-world data:
arr = np.load("example.npy")  # I can share this privately if needed
print(arr.dtype)  # float16
print(arr.shape)  # (9257087,)

np.percentile(arr, 50)  # → raises ValueError

# ✅ This works:
np.percentile(arr.astype(np.float64), 50)  # works fine

Error message:

/opt/venv/lib/python3.11/site-packages/numpy/lib/_function_base_impl.py:107: RuntimeWarning: overflow encountered in cast
  get_virtual_index=lambda n, quantiles: (n - 1) * quantiles,
/opt/venv/lib/python3.11/site-packages/numpy/lib/_function_base_impl.py:4750: RuntimeWarning: overflow encountered in cast
  indexes_above_bounds = virtual_indexes >= valid_values_count - 1
/opt/venv/lib/python3.11/site-packages/numpy/lib/_function_base_impl.py:4655: RuntimeWarning: invalid value encountered in multiply
  lerp_interpolation = asanyarray(add(a, diff_b_a * t, out=out))
/opt/venv/lib/python3.11/site-packages/numpy/lib/_function_base_impl.py:4656: RuntimeWarning: invalid value encountered in scalar multiply
  subtract(b, diff_b_a * (1 - t), out=lerp_interpolation, where=t >= 0.5,
np.float16(nan)

Python and NumPy Versions:

2.2.4
3.11.3 (main, May 23 2023, 13:34:03) [GCC 10.2.1 20210110]

Runtime Environment:

[{'numpy_version': '2.2.4',
'python': '3.11.3 (main, May 23 2023, 13:34:03) [GCC 10.2.1 20210110]',
'uname': uname_result(system='Linux', , release='5.15.0-1039-aws', version='#44~20.04.1-Ubuntu SMP Thu Jun 22 12:21:12 UTC 2023', machine='x86_64')},
{'simd_extensions': {'baseline': ['SSE', 'SSE2', 'SSE3'],
'found': ['SSSE3',
'SSE41',
'POPCNT',
'SSE42',
'AVX',
'F16C',
'FMA3',
'AVX2'],
'not_found': ['AVX512F',
'AVX512CD',
'AVX512_KNL',
'AVX512_KNM',
'AVX512_SKX',
'AVX512_CLX',
'AVX512_CNL',
'AVX512_ICL']}},
{'architecture': 'Haswell',
'filepath': '/opt/venv/lib/python3.11/site-packages/numpy.libs/libscipy_openblas64_-6bb31eeb.so',
'internal_api': 'openblas',
'num_threads': 64,
'prefix': 'libscipy_openblas',
'threading_layer': 'pthreads',
'user_api': 'blas',
'version': '0.3.28'}]

Context for the issue:

No response

eendebakpt · 2025-05-19T14:28:21Z

@friedlerido Could you share the example.npy? If possible try to reduce the example data to a minimal case that can be generated by plain python. That will help us to investigate.

friedlerido · 2025-05-19T14:59:18Z

example.zip

eendebakpt · 2025-05-20T21:03:35Z

For the example above we end up with values_count=n equal to the shape of the input array and quantiles equal to np.array([50], dtype=arr.dtype) inside _quantile. There a call is made to the get_virtual_index of the linear method which is defined as

get_virtual_index=lambda n, quantiles: (n - 1) * quantiles,

and the overflow occurs.

The issue might have been introduced in #23912 @seberg. There the dtype of the input array is set on the quantiles, e.g.

numpy/numpy/lib/_function_base_impl.py

Lines 4273 to 4275 in 2a7a0d0

    
           # Use dtype of array if possible (e.g., if q is a python int or float) 
        
           # by making the divisor have the dtype of the data array. 
        
           q = np.true_divide(q, a.dtype.type(100) if a.dtype.kind == "f" else 100, out=...)

Here is a small reproducer:

import numpy as np
arr = np.zeros(65521, dtype=np.float16)
arr[:10] = 1
z = np.percentile(arr, 50)
print(z)

seberg · 2025-05-21T07:42:10Z

Likely NEP 50 itself that changed the dtype, not so much the specific code change? I am not sure immediately if there are some internal calculations that should maybe always use float64 at least (because they are related to the length of an array, so need full float64 mantissa to be pretty correct).

To some degree, float16 tends to overflow, but this may also not be ideal for float32. The other thing is that a lerp might do a better job (e.g. by being a ufunc), but then I am not sure there is a hot-fix.

eendebakpt · 2025-05-21T19:57:29Z

This indeed seems tricky to get right: the output of get_virtual_index=lambda n, quantiles: (n - 1) * quantiles is an array with virtual indices virtual_indexes that is used in two ways (roughly):

floor(virtual_indexes) and ceil(virtual_indices) are indices of the two data points (in the sorted data) used for interpolation
g= virtual_indexes % 1 is used as a factor to interpolate between the two data points

We can use np.interp (or some other way) to calculate the value of (n - 1) * quantiles without overflows, but the result has to be in float64 (at least higher than float16 which is not enough). Then also g will be float64 and the final result of np.quantile is (1-g)*y[j] + g*y[j+1] which will also be float64. This is not ideal since we would like to have the output of np.quantile(float16_array, [0, .5, .99]) to be float16. On the other hand, np.quantile([0, 1, 3], [0]) is also promoted to float64.

We could downcast g to the dtype of the input array. (since g is in the 0 to 1 range that seems fine), but that might break cases with integer input.

Several other methods also give incorrect results:

import numpy as np
arr = np.zeros(65521, dtype=np.float16)
arr[:10] = 1
methods = ['inverted_cdf','averaged_inverted_cdf','closest_observation','interpolated_inverted_cdf','hazen','weibull',
           'linear', 'median_unbiased','normal_unbiased']

for method in methods:
    z = np.percentile(arr, 50, method=method)
    print(f'{method=} output={z} {z.dtype=}')

Output:

method='inverted_cdf' output=0.0 z.dtype=dtype('float16')
method='averaged_inverted_cdf' output=1.0 z.dtype=dtype('float16')
method='closest_observation' output=0.0 z.dtype=dtype('float16')
method='interpolated_inverted_cdf' output=nan z.dtype=dtype('float16')
method='hazen' output=nan z.dtype=dtype('float16')
method='weibull' output=nan z.dtype=dtype('float16')
method='linear' output=nan z.dtype=dtype('float16')
method='median_unbiased' output=nan z.dtype=dtype('float16')
method='normal_unbiased' output=nan z.dtype=dtype('float16')

friedlerido added the 00 - Bug label May 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

BUG: np.percentile fails with internal overflow when using float16 input on large arrays #29003

BUG: np.percentile fails with internal overflow when using float16 input on large arrays #29003

friedlerido commented May 19, 2025

eendebakpt commented May 19, 2025

Uh oh!

friedlerido commented May 19, 2025

Uh oh!

eendebakpt commented May 20, 2025

Uh oh!

seberg commented May 21, 2025

Uh oh!

eendebakpt commented May 21, 2025

Uh oh!

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Uh oh!

BUG: np.percentile fails with internal overflow when using float16 input on large arrays #29003

BUG: np.percentile fails with internal overflow when using float16 input on large arrays #29003

Comments

friedlerido commented May 19, 2025

Describe the issue:

Reproduce the code example:

Error message:

Python and NumPy Versions:

Runtime Environment:

Context for the issue:

eendebakpt commented May 19, 2025

Uh oh!

friedlerido commented May 19, 2025

Uh oh!

eendebakpt commented May 20, 2025

Uh oh!

seberg commented May 21, 2025

Uh oh!

eendebakpt commented May 21, 2025

Uh oh!

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.