I get the following error when trying to run SHMEM programs using UCX.

[xiii@shini dir]$ oshrun -n 1 ./target/debug/main
[1556046469.890238] [shini:19769:0]            sys.c:619  UCX  ERROR
shmget(size=2097152 flags=0xfb0) for mm_recv_desc failed: Operation not
permitted, please check shared memory limits by 'ipcs -l'
[1556046469.895859] [shini:19769:0]            sys.c:619  UCX  ERROR
shmget(size=2097152 flags=0xfb0) for mm_recv_desc failed: Operation not
permitted, please check shared memory limits by 'ipcs -l'
[1556046469.899577] [shini:19769:0]            sys.c:619  UCX  ERROR
shmget(size=270532608 flags=0xfb0) for user allocation failed: Operation
not permitted, please check shared memory limits by 'ipcs -l'

As far as I can tell, the programs I'm running are not actually resource
constrained, and `ipcs -l` seems to indicate there's plenty of available
shared memory.  Other than this error, my code seems to run normally.

[xiii@shini dir]$ ipcs -l
------ Messages Limits --------
max queues system wide = 32000
max size of message (bytes) = 8192
default max size of queue (bytes) = 16384

------ Shared Memory Limits --------
max number of segments = 4096
max seg size (kbytes) = 18014398509465599
max total shared memory (kbytes) = 18014398509481980
min seg size (bytes) = 1

------ Semaphore Limits --------
max number of arrays = 32000
max semaphores per array = 32000
max semaphores system wide = 1024000000
max ops per semop call = 500
semaphore max value = 32767

Do you know what's causing this / if I need to worry about it / how I can
fix this error?

Thanks,

Ben
_______________________________________________
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users

Reply via email to