[gdal-dev] gdalwarp OpenCL Performance (Week 9)

Seth Price Tue, 27 Jul 2010 01:08:58 -0700

I just finished the first performance tests of my gdalwarp OpenCLcode. It's doing better than I expected. I used this command:"time gdalwarp -q -r lanczos -t_srs '+proj=merc +a=6378137.0+b=6378137.0 +nadgri...@null +wktext +units=m' big_test.tifbig_test.out.tif"

I can compile the OpenCL code two different ways. I can run OpenCLcode on the CPU and distribute it across processors by selecting theCPU as the device. This compiles a multithreaded version of the code.By selecting the GPU device, the OpenCL code compiles to run on my MacPro's graphics card, a GeForce GTX 285. To test, I used a 80 MB RGBraster, with 8 bits per channel.

With the original lanczos resampler code I get 5:31, with OpenCL on myMac Pro's 16 cores 0:39, and with OpenCL on my GTX 285 0:10. That's a36x speedup.

Using cubicspline resampling, the original code takes 0:59, the OpenCLCPU code takes 0:13, and the OpenCL GPU code takes 0:08. Still asignificant speedup.

And with cubic resampling, the original code takes 0:19, OpenCL CPUtakes 0:09, and OpenCL GPU takes 0:07. Still better than twice as fast.

Basically, the OpenCL GPU code in all cases is I/O bound. The GPU islaughing and requesting more difficult work.

I haven't tested all different types of data and commands. If anyonehas any samples and warping commands for testing, now would be thetime to send them to me. I don't know of any GPU bugs in the currentcode.


Here is my current code:
http://github.com/mailseth/OpenCL-integration-for-GRASS---GDAL

~Seth
_______________________________________________
gdal-dev mailing list
[email protected]
http://lists.osgeo.org/mailman/listinfo/gdal-dev

[gdal-dev] gdalwarp OpenCL Performance (Week 9)

Reply via email to