[theano-users] Limit in CNMeM (CUDA) parameter configuration

BorjaH_85 Wed, 03 Aug 2016 07:06:30 -0700

Hi all,

I have been doing a benchmark regarding the gpu-based execution of Theano 
(0.8.2) and Keras (1.0.5) on Windows 10. I am comparing the training time 
of a CNN over the MNIST dataset in 3 scenarios: CPU, GPU without cuDNN and 
GPU with CUDNN. CUDA v7.5 and cuDNN v5.0


First two approaches go well, but I have gone into troubles with the sysenv 
variables configuration for the GPU+cuDNN trial. My error source is the 
CNMeM configuration. If I run the same python code with CNMeM values from 0 
(disbaled) up to 0.75, I have no problems. Moreover, I obtain a drop in the 
training time of more than 50% compared to the general GPU execution.

But I cannot go behind from 0.8. With 0.8, I receive the same compilation 
error.

RuntimeError: ('The following error happened while compiling the node', 
GpuDnnConv{algo='small', inplace=True}(GpuContiguous.0, GpuContiguous.0, 
GpuAllocEmpty.0, GpuDnnConvDesc{border_mode='valid', subsample=(1, 1), 
conv_mode='conv', precision='float32'}.0, Constant{1.0}, Constant{0.0}), 
'\n', 'could not create cuDNN handle: CUDNN_STATUS_INTERNAL_ERROR')

Best regards,

Borja

-- 

--- 
You received this message because you are subscribed to the Google Groups 
"theano-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

[theano-users] Limit in CNMeM (CUDA) parameter configuration

Reply via email to