Hi,

just wanted to point that if you are interested in a "framework" for submit jobs to a thread pool, I can point to

https://github.com/uclouvain/openjpeg/blob/master/src/lib/openjp2/thread.h

https://github.com/uclouvain/openjpeg/blob/master/src/lib/openjp2/thread.c

which is a port in C I've done from the equivalent C++ code of GDAL (https://github.com/OSGeo/gdal/blob/master/port/cpl_worker_thread_pool.h).

It has a pthread and Win32 implementation. It could be easily extracted from libopenjp2 (pending a opj_ -> grass_ renaming to avoid conflicts if both are combined)

The high level API is the opj_thread_pool_* one.

Probably not super fancy, but serves my need well. The user is responsible for selecting the number of threads and splitting the work load in jobs that are queued to the thread pool and consumed by the threads as soon as they are no longer busy.

Even


Le 14/10/2022 à 11:02, Moritz Lennert a écrit :
Am 09.10.2022 20:45 schrieb Brad ReDacted:
Those variables would be...? Is this documented somewhere? If not, can
we get it documented?

https://grasswiki.osgeo.org/wiki/Working_with_GRASS_without_starting_it_explicitly


OpenMP can do far more than just loop "unrolling", these days. Tasking
sections to run concurrent is also quite trivial. It can also offload
to GPU, etc. Check out the v4.5+ spec. It's pretty impressive. I
believe it can do most of what pthreads does, but you certainly lose
control of implementation details. Some compilers have an omp library
while others convert to pthreads.

I do find myself rewriting algorithms so that OpenMP can handle them.
It doesn't seem to handle nested loops with breaks very well and I'm
not entirely sure why.

On 10/9/2022 10:43 AM, William Hargrove wrote:
Can still run GRASS outside the shell by setting all of the environment variables appropriately ...

OpenMP just works by "unrolling" all of the determinate loops, i.e., the ones that iterate a fixed number of times.  No speedups to anything else.

Speedup from OpenMP will be limited, depending on the number of determinate loops present, and how much of the load they represent.

pthreads are totally flexible, but the programmer has to specify everything, very carefully ...

But pthreads can speed up lots of stuff outside of determinate loops ...


HTH,

Bill H.


On 10/9/2022 12:37 PM, Brad ReDacted wrote:
Hello,

I'm working on adding parallelism to modules, but debugging is turning out to be a logistical nightmare:

Why do I not get any reporting from GCC option '-fsanitize=address|thread"?

I am also having trouble getting the profiler to work properly inside GRASS (I assume due to shell?). The gmon.out file produced has no usable data.

OpenMP is extremely poorly supported by most tools. valgrind with helgrind reports a lot of nonsense. I can't seem to get the Intel linux tools to work properly, either.

BTW, we are supporting both pthreads and OpenMP. While this isn't an issue in most cases, there can be races and deadlocks if not handled properly. Pthreads aren't entirely portable. OpenMP is. However, pthreads gives us a more control. May I suggest using OpenMP for most modules and reserve Pthreads to libraries, etc? Or should we start moving away from pthreads?

Any suggestions would be greatly appreciated!



--
Best Regards,
-Brad

_______________________________________________
grass-dev mailing list
grass-dev@lists.osgeo.org
https://lists.osgeo.org/mailman/listinfo/grass-dev

_______________________________________________
grass-dev mailing list
grass-dev@lists.osgeo.org
https://lists.osgeo.org/mailman/listinfo/grass-dev

--
http://www.spatialys.com
My software is free, but my time generally not.
_______________________________________________
grass-dev mailing list
grass-dev@lists.osgeo.org
https://lists.osgeo.org/mailman/listinfo/grass-dev

Reply via email to