Caffe: Check failed: error == cudaSuccess (2 vs. 0) out of memory in solver phase.

Created on 11 Jul 2015 · 3Comments · Source: BVLC/caffe

Hello,

I am using the pre-trained GoogleNet Places model.
For the solver phase I am utilizing the cuDNN support, but by GPU has only 2G of RAM (GTX 960) and some higher layers fail on images larger than 1280x800...

Is this problem batch-size related? I saw that some people are experiencing it during the training phase and reducing the batch-size seems to help. However, I am not sure how to handle the issue with a pre-trained model...

Would it be possible to fix this by re-training or is there something else that I am missing?

Regards,
Vlad

Source

vladislavdonchev

👍1

Most helpful comment

Hello @seanbell. First of all thanks to you and all BVLC contributors for this amazing project :)

I'm doing an extensive research on this error because we are a lot of people to encounter it, all are Caffe related.

In the future, please ask questions like this on the caffe-users group (https://groups.google.com/forum/#!forum/caffe-users). Github is for code issues, not software usage.

This is where I disagree : "Check failed: error == cudaSuccess (2 vs. 0) out of memory in solver phase." is a pretty valid code issue.

Caffe should not break while in GPU mode at all, because it does not on CPU mode.
While having a GTX970 instead of a 1500$ TITAN should make your program way slower, it should not break it. I'm sorry...

guillaumeguerin on 8 Dec 2017

👍2

All 3 comments

Images larger than 1280x800? Are you trying to make deepdream images? If so, the batchsize is already as small as it can be (i.e. 1), and you need either a larger GPU (e.g. the TITAN X), or you need to make your input images smaller. 2GB of RAM is pretty tiny for deep learning. You could also modify the code to take the input in smaller chunks; you'll need to do something clever to handle the seams between chunks though.

If you're training a model for a task, you can reduce the batchsize and correspondingly increase iter_size (default: 1) by the same factor, in the solver.prototxt file. This will give you an equivalent result, since gradients are accumulated.

In the future, please ask questions like this on the caffe-users group (https://groups.google.com/forum/#!forum/caffe-users). Github is for code issues, not software usage.

seanbell on 11 Jul 2015

👍2 👎1

Thanks for the info! I tried to post in the Google group a while ago but it seems that it takes some time before the post actually appears on the page (it's being moderated I guess). I asked this in the Gitter chatroom as well, but did not get any responses.

I tried this model with deepdream too, but I faced the same issue as with my application. I am trying to implement a recognition algorithm that highlights buildings in videos, so it might actually be possible to split the frames into smaller images and process them separately. Another approach would be to scale the images down, process them and output a "mask" with the highlights that can be scaled up and applied to the original images.

I will get a better GPU soon, but I am hoping to get this running on Tegra devices as well, so I will definitely have to optimize a lot of things for better portability.

Thanks again! Closing issue.

vladislavdonchev on 11 Jul 2015

Hello @seanbell. First of all thanks to you and all BVLC contributors for this amazing project :)

I'm doing an extensive research on this error because we are a lot of people to encounter it, all are Caffe related.

In the future, please ask questions like this on the caffe-users group (https://groups.google.com/forum/#!forum/caffe-users). Github is for code issues, not software usage.

This is where I disagree : "Check failed: error == cudaSuccess (2 vs. 0) out of memory in solver phase." is a pretty valid code issue.

guillaumeguerin on 8 Dec 2017

👍2

Was this page helpful?

0 / 5 - 0 ratings