Numpy: How to set float32 as default

Created on 19 Dec 2015 · 21Comments · Source: numpy/numpy

I use cuBLAS + numpy, cuBLAS run very fast on float32, 10times faster than CPU.
However, I need to set dtype=float32 everytime by hand, it's tedious. random.rand() even doesn't support to create float32 array.
Is there anyway to set the default precision to float32 in numpy?

numpy.dtype

Source

fayeshine

👍2

Most helpful comment

This would be quite useful.

JulesGM on 3 Mar 2018

👍11

All 21 comments

There isn't, sorry. And I'm afraid we're unlikely to add such a thing because it would necessarily have to be global state, and this kind of global state tends to create all kinds of problems (e.g. people will try changing the default inside a library, and then unrelated code that happens to use this library will start seeing weird problems when the unrelated code tries to use numpy).

You could make your own utility functions and use those, e.g.:

def array(*args, **kwargs):
    kwargs.setdefault("dtype", np.float32)
    return np.array(*args, **kwargs)

njsmith on 19 Dec 2015

👍1

This would be quite useful.

JulesGM on 3 Mar 2018

👍11

This is old, but it would still be useful (you can find a handful of questions on Stack Overflow asking about it). May I add, a global library state is not really the _only_ option for this. You could have an environment variable, a file configuration or even just a context manager. For example Theano offers a configuration file and a environment variable. I imagine you could have a default float size (like Theano's floatX) and maybe a default integer size (and even a default complex size if you want to push it?). Also, it is not nearly as significant, but there is already at least _some_ global state in NumPy, e.g. set_printoptions (which you could in principle mess up from a library, or from different threads); maybe having a uniform way of configuring the library is not such a bad idea.

I'm not saying it is straightforward, as it probably affects a great portion of the code, and surely there is a lot of corner cases to it, but I think it may be worth considering, even if only as a potential roadmap item.

javidcf on 20 Sep 2018

Especially as how with deep learning (tensorflow, pytorch, etc), people are manipulating arrays of precision smaller than 64 bits pretty much 100% of the time (mainly 32 bits, but mixed precision and quantized models are gaining a lot of ground, with official support from all top vendors)

JulesGM on 20 Sep 2018

I have exactly the same problem. Having some trouble with very large matrices in a very long module that makes many calls to np.array. Can't change all calls to specify the optional argument (dtype=np.float32). I just want to tell numpy to use float32 instead of float64. OS is swapping now. Please help.

asm95 on 19 Oct 2018

I hate that I have to do this everytime

soulslicer on 18 Jan 2019

@soulslicer this issue is closed, we will not be changing this in the conceivable future. Perhaps monkey-patching np.array to add a default dtype would solve your problem. You can arrange for this to be called at python startup via PYTHONSTARTUP for interactive work, or put it in a file and import at project startup.

import numpy as np
_oldarray = np.array
def array32(*args, **kwargs):
    if 'dtype' not in kwargs:
        kwargs['dtype] = 'float32'
    _oldarray(*args, **kwargs)
np.array = _oldarray

mattip on 19 Jan 2019

👍1

heh, another way ;)

from functools import partial
import numpy as np
array32 = partial(np.array, dtype=np.float32)

godaygo on 19 Jan 2019

👍2

FYI with deep neural networks becoming so huge, more and more people will be after this feature.

JulesGM on 19 Jan 2019

👍1

Lol @ numpy

JadBatmobile on 22 Feb 2019

👎1

hey i want each number occupying 38 gigs on your computer

JadBatmobile on 22 Feb 2019

👎1

That's not what's at stake here @JadBatmobile

JulesGM on 22 Feb 2019

njsmith explained in clear terms 3 years ago why this "feature" would very easily (read: in one line of code) lead to a lot of latent and non-local bugs. Such a "feature" should only be used responsibly. I don't think implementing features that need to be used responsibly is a good idea. If you know you're using it, and going to use it responsibly: choose one from the several suggestions mentioned in this thread (and even more elsewhere), and make your own code explicitly behave this way.

adeak on 24 Feb 2019

@adeak I am not sure if this is good idea, but maybe some context manager would be a good compromise?

Pseudocode:

@contextmanager
def default_dtype(dtype):
    # read current default dtype and change to the one provided
    original_dtype = read_current_default_dtype()
    change_default_dtype(dtype)
    yield
    # change default dtype to original value
    change_default_dtype(original_dtype)

Usage:

with np.default_dtype(np.float32):
    # do float32 stuff

dankal444 on 25 Feb 2019

@dankal444 if I understand correctly nothing would stop people from being lazy and calling the ominous change_default_dtype(dtype) manually, with no guarantee for cleanup.

adeak on 25 Feb 2019

@adeak I thought that this "ominous" method could be hidden from people perspective and only context manager made available

dankal444 on 25 Feb 2019

I suspect that the people demanding this feature wouldn't be happy with a context manager; that would be even more cumbersome than a single custom configuration step to be done once. People could just start using the non-public function that has global state to get it over with, defeating the purpose.

adeak on 25 Feb 2019

I do not think context managers help much. You will always get into the issue that you may use downstream functions that use the larger precision for a good reason, and you just break them. Heck, you may even cause segfault, because C-interfacing code has hardly a reason to double check that a freshly created array has the wrong datatype.

seberg on 25 Feb 2019

I find in low level, there is a NPY_DEFAULT_TYPE, maybe numpy can provide a function to modify this Variable value to float32?

It is really a pain to declare np.float32 dtype when creating a new array

https://docs.scipy.org/doc/numpy-1.15.1/reference/c-api.dtype.html?highlight=default_type#c.NPY_DEFAULT_TYPE

zhezh on 1 Mar 2019

2zly45

[Bob] How can I create a random float32 array that consumes 90% of available RAM?
[Numpy] Just double the RAM...

Everyone has an opinion these days that lurks for expression, mine is that this is probably one of the most "insane and ruthless"* design decisions I've ever seen, deserves a rightful nomination my private hall of fame

"insane and ruthless"* - idiomatic expression originating from Russian

[Aphorism 1] If it's limiting then it doesn't matter how slim is your architecture.
[Aphorism 2] In many cases "pythonic" is just a label, the last one that covers shame

lu4 on 27 Apr 2019

Again: the reason for not implementing this is not that we like wasting your memory, it's that it will break all kinds of stuff and cause you to silently get wrong answers. The fact that so many people think it's an "obvious" thing to do confirms that most people don't understand the full consequences here, and wouldn't be prepared to judge when this feature is safe to use and when it isn't.

I hear the pain you all are experiencing; that's totally valid, and we would like to help if we can. But to do that someone has to come up with a plan that doesn't break everything.

Locking this issue since it's clearly a magnet for unproductive comments. If you have a new idea, please open a new issue.

njsmith on 27 Apr 2019

Was this page helpful?

0 / 5 - 0 ratings