Thanks Ralph for the reply. Sorry about the log file, I think I forgot to put an extension to the file. Please find a new one attached with this email.
I'm sorry for not enough debugging information, but 'omp_info' and '--debug-devel' are the only ways I know for collecting information, are there any other things I can try to provide more info? When I execute 'mpirun --debug-devel -np 1 ./helloworld', all the output is the logging information in my last email. It got stuck at "[fpga1:00718] tmp: /tmp", and nothing from my helloworld program is printed out to the screen. So I think it is mpirun failing to start my executable, not failing to terminate. I was wondering if this has anything to do with my newer kernel version, since it works well in the old case. Thanks, -- Di Wu (Allan) PhD student, VAST Laboratory <http://vast.cs.ucla.edu/>, Department of Computer Science, UC Los Angeles Email: al...@cs.ucla.edu Date: Tue, 25 Nov 2014 07:29:51 -0800 From: Ralph Castain <r...@open-mpi.org> To: Open MPI Developers <de...@open-mpi.org> Subject: Re: [OMPI devel] OpenMPI v1.8 and v1.8.3 mpirun hangs at execution on an embedded ARM Linux kernel version 3.15.0 Message-ID: <898cb117-f6a6-4569-89c3-49b75d65b...@open-mpi.org> Content-Type: text/plain; charset="utf-8" I don?t know what you put in that log file, but it was an executable and I?m not feeling that trusting :-) I?m afraid there isn?t enough debug output there to really tell anything. >From what little I can see, I?m guessing that the application ran fine and you got the usual ?hello? output and the helloworld process exited safely - is that correct? And so it is solely mpirun that is failing to cleanly terminate? > On Nov 24, 2014, at 11:24 PM, Allan Wu <al...@cs.ucla.edu> wrote: > > Hello everyone, > > I have cross-compiled OpenMPI for an embedded ARM Linux. Everything works fine for my system based on Linux 3.8.0. I have previously submitted a post related to my compilation, which can be found here: http://www.open-mpi.org/community/lists/devel/2014/04/14440.php < http://www.open-mpi.org/community/lists/devel/2014/04/14440.php>. When I recently upgraded my Linux kernel to 3.15.0, mpirun begins to stuck at even the helloworld program. The program consists only simple APIs: MPI_Init, MPI_Comm_size, MPI_Comm_rank, MPI_Finalize. The problem occurs even at 'mpirun -np 1 ./helloworld', and below are the output with --debug-devel (before it got stuck): > [fpga1:00716] sess_dir_finalize: job session dir not empty - leaving > [fpga1:00716] procdir: /tmp/openmpi-sessions-root@fpga1_0/63813/0/0 > [fpga1:00716] jobdir: /tmp/openmpi-sessions-root@fpga1_0/63813/0 > [fpga1:00716] top: openmpi-sessions-root@fpga1_0 > [fpga1:00716] tmp: /tmp > [fpga1:00718] procdir: /tmp/openmpi-sessions-root@fpga1_0/63813/1/0 > [fpga1:00718] jobdir: /tmp/openmpi-sessions-root@fpga1_0/63813/1 > [fpga1:00718] top: openmpi-sessions-root@fpga1_0 > [fpga1:00718] tmp: /tmp > > I suspect maybe it is due to incompatible kernel version or some missing kernel modules. I tried also with the latest version 1.8.3, and had the same problem. Does anyone have any thoughts? I have attached the output of 'ompi-info --all' with this email. > > Please let me know if I need to provide more information. Thanks in advance! > > Regards, > -- > Di Wu (Allan) > PhD student, VAST?Laboratory <http://vast.cs.ucla.edu/>, > Department of Computer Science, UC Los Angeles > Email: al...@cs.ucla.edu <mailto:al...@cs.ucla.edu> > <log.tar.gz>_______________________________________________ > devel mailing list > de...@open-mpi.org > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel > Link to this post: http://www.open-mpi.org/community/lists/devel/2014/11/16330.php
log.tar.gz
Description: GNU Zip compressed data