Following Larry's suggestion to use /proc/config.gz, Allan sent me kernel
configs for the old (3.8) and new (3.15) kernels.
While there were more changes than I expected, none relates to removing an
API/feature that Open MPI is likely to be using.

-Paul

On Tue, Nov 25, 2014 at 11:28 AM, Larry Baker <ba...@usgs.gov> wrote:

> Allan,
>
> If you can still boot the old embedded system, a lot of times the config
> parameters are saved as /proc/config.gz.  You can at least them compare the
> two configs.
>
> Larry Baker
> US Geological Survey
> 650-329-5608
> ba...@usgs.gov
>
>
>
> On 25 Nov 2014, at 11:11 AM, Allan Wu wrote:
>
> Thanks Paul! Unfortunately '/boot' is not available in my embedded linux,
> and I do not have the configuration file for the old kernel since it is
> provided as is. However, I have the new kernel configuration since I
> compiled it myself. Would it be helpful if I provide you the .config file
> when I compile the kernel? It maybe quite painful to look through that file
> though. Is there any other way that I can obtain the configuration?
>
> I checked my config for the new kernel, and UNIX-domain sockets and Sys V
> IPC are both enabled in the build. Are there any other possibilities I can
> check?
>
> Thanks,
> Di
>
> --
> Di Wu (Allan)
> PhD student, VAST Laboratory <http://vast.cs.ucla.edu/>,
> Department of Computer Science, UC Los Angeles
> Email: al...@cs.ucla.edu
>
> On Tue, Nov 25, 2014 at 10:45 AM, Paul Hargrove <phhargr...@lbl.gov>
> wrote:
>
>> Allan,
>>
>> A likely possibility is that some important kernel feature (that Open MPI
>> assumes is present) is missing.
>> That includes not only "kernel modules" as you mention, but also features
>> configure in (or out) of the base kernel.
>> For instance, some embedded kernels omit UNIX-domain sockets and SysV IPC
>> support.
>>
>> If you can send me (preferably off-list) the kernel config files for the
>> old an new kernels I may be able to spot something.
>> If present, you are looking for /boot/config-[VERSION]
>>
>> -Paul
>>
>> On Tue, Nov 25, 2014 at 10:25 AM, Allan Wu <al...@cs.ucla.edu> wrote:
>>
>>> I'm sorry I forgot to change the subject when I reply to the digest
>>> issue. Please find my original email below.
>>>
>>> Regards,
>>> Di
>>>
>>> On Tue, Nov 25, 2014 at 10:19 AM, Allan Wu <al...@cs.ucla.edu> wrote:
>>>
>>>> Thanks Ralph for the reply. Sorry about the log file, I think I forgot
>>>> to put an extension to the file. Please find a new one attached with this
>>>> email.
>>>>
>>>> I'm sorry for not enough debugging information, but 'omp_info' and
>>>> '--debug-devel' are the only ways I know for collecting information, are
>>>> there any other things I can try to provide more info?
>>>>
>>>> When I execute 'mpirun --debug-devel -np 1 ./helloworld', all the
>>>> output is the logging information in my last email. It got stuck at
>>>>  "[fpga1:00718] tmp: /tmp", and nothing from my helloworld program is
>>>> printed out to the screen. So I think it is mpirun failing to start my
>>>> executable, not failing to terminate.
>>>>
>>>> I was wondering if this has anything to do with my newer kernel
>>>> version, since it works well in the old case.
>>>>
>>>> Thanks,
>>>> --
>>>> Di Wu (Allan)
>>>> PhD student, VAST Laboratory <http://vast.cs.ucla.edu/>,
>>>> Department of Computer Science, UC Los Angeles
>>>> Email: al...@cs.ucla.edu
>>>>
>>>>
>>>> Date: Tue, 25 Nov 2014 07:29:51 -0800
>>>> From:
>>>> Ralph Castain <r...@open-mpi.org>
>>>> To: Open MPI Developers <de...@open-mpi.org>
>>>> Subject: Re: [OMPI devel] OpenMPI v1.8 and v1.8.3 mpirun hangs at
>>>>         execution       on an embedded ARM Linux kernel version 3.15.0
>>>> Message-ID: <898cb117-f6a6-4569-89c3-49b75d65b...@open-mpi.org>
>>>> Content-Type: text/plain; charset="utf-8"
>>>>
>>>> I don?t know what you put in that log file, but it was an executable
>>>> and I?m not feeling that trusting :-)
>>>>
>>>> I?m afraid there isn?t enough debug output there to really tell
>>>> anything. From what little I can see, I?m guessing that the application ran
>>>> fine and you got the usual ?hello? output and the helloworld process exited
>>>> safely - is that correct? And so it is solely mpirun that is failing to
>>>> cleanly terminate?
>>>>
>>>>
>>>> > On Nov 24, 2014, at 11:24 PM, Allan Wu <al...@cs.ucla.edu> wrote:
>>>> >
>>>> > Hello everyone,
>>>> >
>>>> > I have cross-compiled OpenMPI for an embedded ARM Linux. Everything
>>>> works fine for my system based on Linux 3.8.0. I have previously submitted
>>>> a post related to my compilation, which can be found here:
>>>> http://www.open-mpi.org/community/lists/devel/2014/04/14440.php <
>>>> http://www.open-mpi.org/community/lists/devel/2014/04/14440.php>. When
>>>> I recently upgraded my Linux kernel to 3.15.0, mpirun begins to stuck at
>>>> even the helloworld program. The program consists only simple APIs:
>>>> MPI_Init, MPI_Comm_size, MPI_Comm_rank, MPI_Finalize. The problem occurs
>>>> even at 'mpirun -np 1 ./helloworld', and below are the output with
>>>> --debug-devel (before it got stuck):
>>>> > [fpga1:00716] sess_dir_finalize: job session dir not empty - leaving
>>>> > [fpga1:00716] procdir: /tmp/openmpi-sessions-root@fpga1_0/63813/0/0
>>>> > [fpga1:00716] jobdir: /tmp/openmpi-sessions-root@fpga1_0/63813/0
>>>> > [fpga1:00716] top: openmpi-sessions-root@fpga1_0
>>>> > [fpga1:00716] tmp: /tmp
>>>> > [fpga1:00718] procdir: /tmp/openmpi-sessions-root@fpga1_0/63813/1/0
>>>> > [fpga1:00718] jobdir: /tmp/openmpi-sessions-root@fpga1_0/63813/1
>>>> > [fpga1:00718] top: openmpi-sessions-root@fpga1_0
>>>> >
>>>> [fpga1:00718] tmp: /tmp
>>>> >
>>>> > I suspect maybe it is due to incompatible kernel version or some
>>>> missing kernel modules. I tried also with the latest version 1.8.3, and had
>>>> the same problem. Does anyone have any thoughts? I have attached the output
>>>> of 'ompi-info --all' with this email.
>>>> >
>>>> > Please let me know if I need to provide more information. Thanks in
>>>> advance!
>>>> >
>>>> > Regards,
>>>> > --
>>>> > Di Wu (Allan)
>>>> > PhD student, VAST?Laboratory <http://vast.cs.ucla.edu/>,
>>>> > Department of Computer Science, UC Los Angeles
>>>> > Email: al...@cs.ucla.edu <mailto:al...@cs.ucla.edu>
>>>> > <log.tar.gz>_______________________________________________
>>>> > devel mailing list
>>>> > de...@open-mpi.org
>>>> > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>>> > Link to this post:
>>>> http://www.open-mpi.org/community/lists/devel/2014/11/16330.php
>>>>
>>>>
>>>
>>> _______________________________________________
>>> devel mailing list
>>> de...@open-mpi.org
>>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>> Link to this post:
>>> http://www.open-mpi.org/community/lists/devel/2014/11/16341.php
>>>
>>
>>
>>
>> --
>> Paul H. Hargrove                          phhargr...@lbl.gov
>> Computer Languages & Systems Software (CLaSS) Group
>> Computer Science Department               Tel: +1-510-495-2352
>> Lawrence Berkeley National Laboratory     Fax: +1-510-486-6900
>>
>
> _______________________________________________
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post:
> http://www.open-mpi.org/community/lists/devel/2014/11/16343.php
>
>
>


-- 
Paul H. Hargrove                          phhargr...@lbl.gov
Computer Languages & Systems Software (CLaSS) Group
Computer Science Department               Tel: +1-510-495-2352
Lawrence Berkeley National Laboratory     Fax: +1-510-486-6900

Reply via email to