Re: [OMPI devel] Jenkins nowhere land again

2017-10-03 Thread George Bosilca
We have an unused mac that we can add to the pool. I'll be more than happy
to help set it up.

  George.



On Tue, Oct 3, 2017 at 5:43 PM, Barrett, Brian via devel <
devel@lists.open-mpi.org> wrote:

> My MacOS box is back up and jobs are progressing again. The queue got kind
> of long, so it might be an hour or so before it catches up. I have some
> thoughts on monitoring so we get emails in case this happens and my team’s
> Product Manager found an unused Amazon-owned Mac Mini we’ll add to the pool
> so that I won’t have to drive home if this happens again.
>
> Brian
>
> On Oct 3, 2017, at 13:40, "r...@open-mpi.org"  wrote:
>
> I’m not sure either - I have the patch to fix the loop_spawn test problem,
> but can’t get it into the repo.
>
>
> On Oct 3, 2017, at 1:22 PM, Barrett, Brian via devel <
> devel@lists.open-mpi.org> wrote:
>
>
> I’m not sure entirely what we want to do.  It looks like both Nathan and
> I’s OS X servers died on the same day.  It looks like mine might be a
> larger failure than just Jenkins, because I can’t log into the machine
> remotely.  It’s going to be a couple hours before I can get home.  Nathan,
> do you know what happened to your machine?
>
>
> The only options for the OMPI builder are to either wait until Nathan or I
> get home and get our servers running again or to not test OS X (which has
> its own problems).  I don’t have a strong preference here, but I also don’t
> want to make the decision unilaterally.
>
>
> Brian
>
>
>
> On Oct 3, 2017, at 1:14 PM, r...@open-mpi.org wrote:
>
>
> We are caught between two infrastructure failures:
>
>
> Mellanox can’t pull down a complete PR
>
>
> OMPI is hanging on the OS-X server
>
>
> Can someone put us out of our misery?
>
> Ralph
>
>
> ___
>
> devel mailing list
>
> devel@lists.open-mpi.org
>
> https://lists.open-mpi.org/mailman/listinfo/devel
>
>
> ___
>
> devel mailing list
>
> devel@lists.open-mpi.org
>
> https://lists.open-mpi.org/mailman/listinfo/devel
>
>
> ___
> devel mailing list
> devel@lists.open-mpi.org
> https://lists.open-mpi.org/mailman/listinfo/devel
>
>
> ___
> devel mailing list
> devel@lists.open-mpi.org
> https://lists.open-mpi.org/mailman/listinfo/devel
>
___
devel mailing list
devel@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/devel

Re: [OMPI devel] Jenkins nowhere land again

2017-10-03 Thread Barrett, Brian via devel
My MacOS box is back up and jobs are progressing again. The queue got kind of 
long, so it might be an hour or so before it catches up. I have some thoughts 
on monitoring so we get emails in case this happens and my team's Product 
Manager found an unused Amazon-owned Mac Mini we'll add to the pool so that I 
won't have to drive home if this happens again.

Brian

On Oct 3, 2017, at 13:40, "r...@open-mpi.org" 
> wrote:

I'm not sure either - I have the patch to fix the loop_spawn test problem, but 
can't get it into the repo.


On Oct 3, 2017, at 1:22 PM, Barrett, Brian via devel 
> wrote:

I'm not sure entirely what we want to do.  It looks like both Nathan and I's OS 
X servers died on the same day.  It looks like mine might be a larger failure 
than just Jenkins, because I can't log into the machine remotely.  It's going 
to be a couple hours before I can get home.  Nathan, do you know what happened 
to your machine?

The only options for the OMPI builder are to either wait until Nathan or I get 
home and get our servers running again or to not test OS X (which has its own 
problems).  I don't have a strong preference here, but I also don't want to 
make the decision unilaterally.

Brian


On Oct 3, 2017, at 1:14 PM, r...@open-mpi.org wrote:

We are caught between two infrastructure failures:

Mellanox can't pull down a complete PR

OMPI is hanging on the OS-X server

Can someone put us out of our misery?
Ralph

___
devel mailing list
devel@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/devel

___
devel mailing list
devel@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/devel

___
devel mailing list
devel@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/devel
___
devel mailing list
devel@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/devel

Re: [OMPI devel] Jenkins nowhere land again

2017-10-03 Thread r...@open-mpi.org
I’m not sure either - I have the patch to fix the loop_spawn test problem, but 
can’t get it into the repo.


> On Oct 3, 2017, at 1:22 PM, Barrett, Brian via devel 
>  wrote:
> 
> I’m not sure entirely what we want to do.  It looks like both Nathan and I’s 
> OS X servers died on the same day.  It looks like mine might be a larger 
> failure than just Jenkins, because I can’t log into the machine remotely.  
> It’s going to be a couple hours before I can get home.  Nathan, do you know 
> what happened to your machine?
> 
> The only options for the OMPI builder are to either wait until Nathan or I 
> get home and get our servers running again or to not test OS X (which has its 
> own problems).  I don’t have a strong preference here, but I also don’t want 
> to make the decision unilaterally.
> 
> Brian
> 
> 
>> On Oct 3, 2017, at 1:14 PM, r...@open-mpi.org wrote:
>> 
>> We are caught between two infrastructure failures:
>> 
>> Mellanox can’t pull down a complete PR
>> 
>> OMPI is hanging on the OS-X server
>> 
>> Can someone put us out of our misery?
>> Ralph
>> 
>> ___
>> devel mailing list
>> devel@lists.open-mpi.org
>> https://lists.open-mpi.org/mailman/listinfo/devel
> 
> ___
> devel mailing list
> devel@lists.open-mpi.org
> https://lists.open-mpi.org/mailman/listinfo/devel

___
devel mailing list
devel@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/devel

Re: [OMPI devel] Jenkins nowhere land again

2017-10-03 Thread Barrett, Brian via devel
I’m not sure entirely what we want to do.  It looks like both Nathan and I’s OS 
X servers died on the same day.  It looks like mine might be a larger failure 
than just Jenkins, because I can’t log into the machine remotely.  It’s going 
to be a couple hours before I can get home.  Nathan, do you know what happened 
to your machine?

The only options for the OMPI builder are to either wait until Nathan or I get 
home and get our servers running again or to not test OS X (which has its own 
problems).  I don’t have a strong preference here, but I also don’t want to 
make the decision unilaterally.

Brian


> On Oct 3, 2017, at 1:14 PM, r...@open-mpi.org wrote:
> 
> We are caught between two infrastructure failures:
> 
> Mellanox can’t pull down a complete PR
> 
> OMPI is hanging on the OS-X server
> 
> Can someone put us out of our misery?
> Ralph
> 
> ___
> devel mailing list
> devel@lists.open-mpi.org
> https://lists.open-mpi.org/mailman/listinfo/devel

___
devel mailing list
devel@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/devel