I've solved this problem by omitting --with-libevent=/usr from the configuration to force it to use the internal version. I thought I had tried this before posting but evidently did something wrong.
Dave On Tue, Dec 13, 2016 at 9:57 PM, <users-requ...@lists.open-mpi.org> wrote: > Send users mailing list submissions to > users@lists.open-mpi.org > > To subscribe or unsubscribe via the World Wide Web, visit > https://rfd.newmexicoconsortium.org/mailman/listinfo/users > or, via email, send a message with subject or body 'help' to > users-requ...@lists.open-mpi.org > > You can reach the person managing the list at > users-ow...@lists.open-mpi.org > > When replying, please edit your Subject line so it is more specific > than "Re: Contents of users digest..." > > > Today's Topics: > > 1. epoll add error with OpenMPI 2.0.1 and SGE (Dave Turner) > > > ---------------------------------------------------------------------- > > Message: 1 > Date: Tue, 13 Dec 2016 21:57:40 -0600 > From: Dave Turner <drdavetur...@gmail.com> > To: users@lists.open-mpi.org > Subject: [OMPI users] epoll add error with OpenMPI 2.0.1 and SGE > Message-ID: > <CAFGXdkwWieF2C9cdpEV2zJAN+QY0Uf4T7Mj_4=wOUMCdeGYV=g@ > mail.gmail.com> > Content-Type: text/plain; charset="utf-8" > > [warn] Epoll ADD(4) on fd 1 failed. Old events were 0; read change was 0 > (none); write change was 1 (add): Operation not permitted > > Gentoo with compiled OpenMPI 2.0.1 and SGE > ompi_info --all file attached > > We recently did a maintenance upgrade to our cluster including > moving to OpenMPI 2.0.1. Fortran programs now give the > epoll add error above at the start of a run and the stdout file > freezes until the end of the run when all info is dumped. > > I've read about this problem and it seems to be a file lock > issue where OpenMPI and SGE are both trying to lock the > same output file. We have not seen this problem with > previous versions of OpenMPI. > > We've tried compiling OpenMPI with and without > specifying --with-libevent=/usr, and I've tried compiling > with --disable-event-epoll and using -mca opal_event_include poll. > Both of these were suggestions from a few years back but > neither affects the problem. I've also tried redirecting the output > manually as: > > mpirun -np 4 ./app > file.out > > This just locks file.out instead with all the output again being > dumped at the end of the run. > > We also do not have this issue with 1.10.4 installed. > > Any suggestions? Has anyone else run into this problem? > > Dave Turner > -- > Work: davetur...@ksu.edu (785) 532-7791 > 2219 Engineering Hall, Manhattan KS 66506 > Home: drdavetur...@gmail.com > cell: (785) 770-5929 > -------------- next part -------------- > An HTML attachment was scrubbed... > URL: <https://rfd.newmexicoconsortium.org/mailman/private/users/ > attachments/20161213/beb370b0/attachment.html> > -------------- next part -------------- > A non-text attachment was scrubbed... > Name: ompi_info.2.0.1.all > Type: application/octet-stream > Size: 202298 bytes > Desc: not available > URL: <https://rfd.newmexicoconsortium.org/mailman/private/users/ > attachments/20161213/beb370b0/attachment.obj> > > ------------------------------ > > Subject: Digest Footer > > _______________________________________________ > users mailing list > users@lists.open-mpi.org > https://rfd.newmexicoconsortium.org/mailman/listinfo/users > > ------------------------------ > > End of users Digest, Vol 3675, Issue 2 > ************************************** > -- Work: davetur...@ksu.edu (785) 532-7791 2219 Engineering Hall, Manhattan KS 66506 Home: drdavetur...@gmail.com cell: (785) 770-5929
_______________________________________________ users mailing list users@lists.open-mpi.org https://rfd.newmexicoconsortium.org/mailman/listinfo/users