Hmmm...well, from what I see, mpirun was actually giving you the right answer! 
I only see TWO cores on each node, yet you told it to bind FOUR processes on 
each node, each proc to be bound to a unique core.

The error message was correct - there are not enough cores on those nodes to do 
what you requested.


On May 30, 2012, at 6:19 AM, Mike Dubman wrote:

> attached.
> 
> On Wed, May 30, 2012 at 2:32 PM, Jeff Squyres <jsquy...@cisco.com> wrote:
> On May 30, 2012, at 7:20 AM, Jeff Squyres wrote:
> 
> >> $hwloc-ls --of console
> >> Machine (32GB)
> >>  NUMANode L#0 (P#0 16GB) + Socket L#0 + L3 L#0 (20MB) + L2 L#0 (256KB) + 
> >> L1 L#0 (32KB) + Core L#0
> >>    PU L#0 (P#0)
> >>    PU L#1 (P#2)
> >>  NUMANode L#1 (P#1 16GB) + Socket L#1 + L3 L#1 (20MB) + L2 L#1 (256KB) + 
> >> L1 L#1 (32KB) + Core L#1
> >>    PU L#2 (P#1)
> >>    PU L#3 (P#3)
> >
> > Is this hwloc output exactly the same on both nodes?
> 
> 
> More specifically, can you send the lstopo xml output from each of the 2 
> nodes you ran on?
> 
> --
> Jeff Squyres
> jsquy...@cisco.com
> For corporate legal information go to: 
> http://www.cisco.com/web/about/doing_business/legal/cri/
> 
> 
> _______________________________________________
> devel mailing list
> de...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/devel
> 
> <lstopo-out.tbz>_______________________________________________
> devel mailing list
> de...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/devel

Reply via email to