Hi Jochen,

On Mon, 16 Dec 2019 21:03:39 +0100
Jochen Keil <[email protected]> wrote:

The 3 modules which is responsible for 75% of the processing time is
the ones below
23,918032 [dev_pixelpipe] took 17,204 secs (41,352 CPU)
processed `exposure 1' on GPU, blended on GPU [export]

40,646382 [dev_pixelpipe] took 16,728 secs (41,418 CPU) processed
`exposure 2' on GPU, blended on GPU [export]

53,002052 [dev_pixelpipe] took 10,305 secs (27,939 CPU) processed `tone
curve 3' on GPU, blended on GPU [export]

Those matches mine:
13,928084 [dev_pixelpipe] took 8,073 secs (63,809 CPU) processed
`exposure 1' on GPU, blended on GPU [export] 
22,513049 [dev_pixelpipe] took 8,585 secs (65,274 CPU) processed
`exposure 2' on GPU, blended on GPU [export] 
30,857945 [dev_pixelpipe] took 6,512 secs (75,135 CPU) processed `tone
curve 3' on GPU, blended on GPU [export]

The reason the amount for such long time is that the GPU run out of
memory and had to be processed by the CPU.

Your specs:
[opencl_init] device 0: GeForce GTX 1060 6GB 
     GLOBAL_MEM_SIZE:          6063MB
     MAX_WORK_GROUP_SIZE:      1024
     MAX_WORK_ITEM_DIMENSIONS: 3
     MAX_WORK_ITEM_SIZES:      [ 1024 1024 64 ]
     DRIVER_VERSION:           390.116
     DEVICE_VERSION:           OpenCL 1.2 CUDA

CPU: Intel Core i5 7600K
Memory: 64GB.

My specs:
[opencl_init] device 0: GeForce GTX 1050 Ti 
     GLOBAL_MEM_SIZE:          4032MB
     MAX_WORK_GROUP_SIZE:      1024
     MAX_WORK_ITEM_DIMENSIONS: 3
     MAX_WORK_ITEM_SIZES:      [ 1024 1024 64 ]
     DRIVER_VERSION:           430.64
     DEVICE_VERSION:           OpenCL 1.2 CUDA

[opencl_init] device 1: GeForce GTX 1050 
     GLOBAL_MEM_SIZE:          2000MB
     MAX_WORK_GROUP_SIZE:      1024
     MAX_WORK_ITEM_DIMENSIONS: 3
     MAX_WORK_ITEM_SIZES:      [ 1024 1024 64 ]
     DRIVER_VERSION:           430.64
     DEVICE_VERSION:           OpenCL 1.2 CUDA

CPU: AMD Ryzen 1700 overclocked to 3.8 GHz
Memory: 32 GB DDR 4 at 3200 MHz

So using a less capable GPU than you I am able to do
39,681519 [dev_process_export] pixel pipeline processing took 38,916
secs (360,590 CPU)

And you are able to do
53,785899 [dev_process_export] pixel pipeline processing took 53,032
secs (130,516 CPU)

That is more or less 20% faster than you. The only reason to this is
that you are using a very old version of the Nvidia driver (390.116)
while I am using (430.64). I suspect that if you use the same Nvidia
driver version that me you would be able to cut between 20 and 30
seconds of your processing time.

-- 
Hilsen/Regards
Michael Rasmussen

Get my public GnuPG keys:
michael <at> rasmussen <dot> cc
https://pgp.key-server.io/pks/lookup?search=0xD3C9A00E
mir <at> datanom <dot> net
https://pgp.key-server.io/pks/lookup?search=0xE501F51C
mir <at> miras <dot> org
https://pgp.key-server.io/pks/lookup?search=0xE3E80917
--------------------------------------------------------------
/usr/games/fortune -es says:
Ask not for whom the telephone bell tolls...
if thou art in the bathtub, it tolls for thee.

Attachment: pgp5u4be7vWct.pgp
Description: OpenPGP digital signature

Reply via email to