We see the following error: *14:26:55* + taskset -c 2,3 timeout -s SIGSEGV 15m /var/lib/jenkins/jobs/gh-ompi-master-pr/workspace/ompi_install1/bin/mpirun -np 8 -bind-to none -mca pml ob1 -mca btl self,tcp taskset -c 2,3 /var/lib/jenkins/jobs/gh-ompi-master-pr/workspace/ompi_install1/examples/hello_c*14:26:55* /var/lib/jenkins/jobs/gh-ompi-master-pr/workspace/ompi_install1/bin/mpirun: Error: unknown option "-np"
The reason is that <install>/lib has no "openmpi" directory. My guess that it is somehow removed during jenkins execution. I'm checking now. 2016-07-21 20:11 GMT+06:00 Jeff Squyres (jsquyres) <jsquy...@cisco.com>: > On Jul 21, 2016, at 3:53 AM, Gilles Gouaillardet <gil...@rist.or.jp> > wrote: > > > > Folks, > > > > Mellanox Jenkins marks recent PR's as failed for very surprising reasons. > > > > mpirun --mca btl sm,self ... > > > > failed because processes could not contact each other. i was able to > reproduce this once on my workstation, > > > > and found the root cause was a dirty build and/or install dir. > > Gilles: I note that these tests also failed in MTT last night (e.g., at > Absoft, which *rarely* has errors): > > https://mtt.open-mpi.org/index.php?do_redir=2336 > > Are you sure that the cause is a dirty build and/or install dir? > > -- > Jeff Squyres > jsquy...@cisco.com > For corporate legal information go to: > http://www.cisco.com/web/about/doing_business/legal/cri/ > > _______________________________________________ > devel mailing list > de...@open-mpi.org > Subscription: https://www.open-mpi.org/mailman/listinfo.cgi/devel > Link to this post: > http://www.open-mpi.org/community/lists/devel/2016/07/19254.php > -- С Уважением, Поляков Артем Юрьевич Best regards, Artem Y. Polyakov