In a test to optimize GPU memory for Opencl execution of dt, I have tried
different updates in "preferences > processing > cpu / gpu / memory > tune
OpenCL performance" and in "preferences > processing > cpu / gpu / memory >
darktable resources".
But when checking in "darktable-log.txt", I seem to get all the time the
same memory allocation, only 2 GB from the 8 GB available GPU memory as
shown hereunder.
Now I am fine with the response time of dt execution on my computer. But
this seems not in line with what I read in the dt 4.4 documentation.
Does anybody have further advice about Opencl GPU memory optimisation for dt
4.4.1 on Windows on a NVIDIA GeForce RTX?
Thank you,
Marc.
EXTRACT from darktable-log.txt
--
version: darktable 4.4.1
start: 2023:07:12 20:56:44
0,4468 [dt_get_sysresource_level] switched to 3 as `unrestricted'
0,4468 total mem: 32607MB
0,4469 mipmap cache:4075MB
0,4469 available mem: 28531MB
0,4469 singlebuff: 32607MB
0,4469 OpenCL tune mem: WANTED
0,4469 OpenCL pinned: WANTED
[opencl_init] opencl related configuration options:
[opencl_init] opencl: ON
[opencl_init] opencl_scheduling_profile: 'default'
[opencl_init] opencl_library: 'default path'
[opencl_init] opencl_device_priority: '+0/+0/*/+0/+0'
[opencl_init] opencl_mandatory_timeout: 725
[opencl_init] opencl library 'OpenCL.dll' found on your system and loaded
[opencl_init] found 2 platforms
[opencl_init] found 2 devices
[dt_opencl_device_init]
DEVICE: 0: 'NVIDIA GeForce RTX 2080 with Max-Q Design'
PLATFORM NAME & VENDOR: NVIDIA CUDA, NVIDIA Corporation
CANONICAL NAME: nvidiacudanvidiageforcertx2080withmaxqdesign
DRIVER VERSION: 535.98
DEVICE VERSION: OpenCL 3.0 CUDA, SM_20 SUPPORT
DEVICE_TYPE: GPU
GLOBAL MEM SIZE: 8192 MB
<
MAX MEM ALLOC:2048 MB
<
MAX IMAGE SIZE: 32768 x 32768
MAX WORK GROUP SIZE: 1024
MAX WORK ITEM DIMENSIONS: 3
MAX WORK ITEM SIZES: [ 1024 1024 64 ]
ASYNC PIXELPIPE: YES
PINNED MEMORY TRANSFER: WANTED
MEMORY TUNING:WANTED
FORCED HEADROOM: 400
AVOID ATOMICS:NO
MICRO NAP:0
ROUNDUP WIDTH:16
ROUNDUP HEIGHT: 16
CHECK EVENT HANDLES: 1024
PERFORMANCE: 5.938
TILING ADVANTAGE: 0.000
DEFAULT DEVICE: NO
KERNEL BUILD DIRECTORY: C:\Program
Files\darktable\share\darktable\kernels
KERNEL DIRECTORY: C:\Users\Marc
Cabuy\AppData\Local\Microsoft\Windows\INetCache\darktable\cached_v1_kernels_
for_NVIDIACUDANVIDIAGeForceRTX2080withMaxQDesign_53598
CL COMPILER OPTION: -cl-fast-relaxed-math
KERNEL LOADING TIME: 11.4726 sec
darktable user mailing list
to unsubscribe send a mail to darktable-user+unsubscr...@lists.darktable.org