Tyler,

can you please confirm you are using ompi v1.8 or v1.10 ?

this should be fixed in v2.x or master, and it seems a backport was missing.

can you please give a try to this patch and check whether it fixes your problem ?

Cheers,

Gilles



On 2/22/2016 11:12 AM, Tyler Benster wrote:

I believe Open MPI is looking for a linux binary (.so) when it should look for a Mach-O (.dylib) on OS X.

Steps to reproduce:

 1. build Open MPI with CUDA-aware support on OS X (‘./configure
    —with-cuda’)
 2. run job that supports CUDA libraries
 3. The following error is generated:

    The library attempted to open the following supporting CUDA
    libraries, but each of them failed. CUDA-aware support is
    disabled. dlopen(libcuda.so.1, 9): image not found
    dlopen(/usr/lib64/libcuda.so.1, 9): image not found If you are not
    interested in CUDA-aware support, then run with –mca
    mpi_cuda_support 0 to suppress this message. If you are interested
    in CUDA-aware support, then try setting LD_LIBRARY_PATH to the
    location of libcuda.so.1 to get passed this issue.

I do have libcuda.dylib in my path.

Any help is much appreciated!






_______________________________________________
users mailing list
us...@open-mpi.org
Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
Link to this post: 
http://www.open-mpi.org/community/lists/users/2016/02/28562.php

diff --git a/ompi/mca/common/cuda/common_cuda.c 
b/ompi/mca/common/cuda/common_cuda.c
index 1ad679c..91fed98 100644
--- a/ompi/mca/common/cuda/common_cuda.c
+++ b/ompi/mca/common/cuda/common_cuda.c
@@ -252,7 +252,7 @@ static int 
mca_common_cuda_init(opal_common_cuda_function_table_t *ftable)
 int mca_common_cuda_stage_one_init(void)
 {
     int retval, i, j;
-    char *cudalibs[] = {"libcuda.so.1", NULL};
+    char *cudalibs[] = {"libcuda.so.1", "libcuda.dylib", NULL};
     char *searchpaths[] = {"", "/usr/lib64", NULL};
     char **errmsgs = NULL;
     char *errmsg = NULL;

Reply via email to