Hi Maciej,
does the problem also show up for smaller values of k? 2000 should be
alright, but I've already seen cases where a video in the background
caused such kind of problems (because GPU-RAM was almost exhausted).
I also CC: Chales Determan, who is the author of gpuR.
Best regards,
Karli
On 06/27/2017 09:28 AM, Maciej Janiec wrote:
I was able to use the gpuR package just once. After the first time, it
is crashing every time.
System: Windows 10
GPU: GeForce GT 730
gpuMatrix is created, but the code crashed at gpuA %*% gpuA.
This works:
> k <- 2000
>
> system.time( {
+
+ gpuA <- gpuMatrix(rnorm(k^2), nrow=k, ncol=k)
+ # gpuB <- gpuA %*% gpuA
+
+ } )
user system elapsed
0.39 0.05 0.44
This crashes:
> k <- 2000
>
> system.time( {
+
+ gpuA <- gpuMatrix(rnorm(k^2), nrow=k, ncol=k)
+ gpuB <- gpuA %*% gpuA
+
+ } )
ViennaCL: FATAL ERROR: Kernel start failed for 'assign_cpu'.
ViennaCL: Smaller work sizes could not solve the problem.
Show Traceback
Rerun with Debug
Error in cpp_gpuMatrix_gemm(A@address, B@address, C@address, 8L) :
ViennaCL: FATAL ERROR: CL_MEM_OBJECT_ALLOCATION_FAILURE
ViennaCL could not allocate memory on the device. Most likely the
device simply ran out of memory.
If you think that this is a bug in ViennaCL, please report it at
[email protected]
<mailto:[email protected]> and supply at least the
following information:
* Operating System
* Which OpenCL implementation (AMD, NVIDIA, etc.)
* ViennaCL version
Many thanks in advance! Timing stopped at: 0.42 0.07 0.5
System stats:
> gpuInfo()
$deviceName
[1] "GeForce GT 730"
$deviceVendor
[1] "NVIDIA Corporation"
$numberOfCores
[1] 2
$maxWorkGroupSize
[1] 1024
$maxWorkItemDim
[1] 3
$maxWorkItemSizes
[1] 1024 1024 64
$deviceMemory
[1] 2147483648
$clockFreq
[1] 1400
$localMem
[1] 49152
$maxAllocatableMem
[1] 536870912
$available
[1] "yes"
$deviceExtensions
[1] "cl_khr_global_int32_base_atomics"
"cl_khr_global_int32_extended_atomics" "cl_khr_local_int32_base_atomics"
[4] "cl_khr_local_int32_extended_atomics" "cl_khr_fp64"
"cl_khr_byte_addressable_store"
[7] "cl_khr_icd" "cl_khr_gl_sharing"
"cl_nv_compiler_options"
[10] "cl_nv_device_attribute_query" "cl_nv_pragma_unroll"
"cl_nv_d3d10_sharing"
[13] "cl_khr_d3d10_sharing" "cl_nv_d3d11_sharing"
"cl_nv_copy_opts"
$double_support
[1] TRUE
>
> detectPlatforms()
[1] 1
> detectGPUs()
[1] 1
MJ
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
ViennaCL-support mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/viennacl-support
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
ViennaCL-support mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/viennacl-support