Re: [SOLVED] Temporary Outage - ovirt.org Website

2016-03-07 Thread Michael Scherer
Le lundi 07 mars 2016 à 15:50 +0200, Barak Korren a écrit :
> >
> > While we can't do much for the first or the 2nd (besides blocking), I
> > would suggest that the 3rd ip owner decide to reduce the frequency of
> > check from 1 every 2 seconds to 1 every 5 minutes.
> >
> 
> Sure, that was us, I just removed the whole thing, I guess you have
> your own and better monitoring now.

Nope, we don't. nagios is fine, but well, every second is excessive :/

-- 
Michael Scherer
Sysadmin, Community Infrastructure and Platform, OSAS




signature.asc
Description: This is a digitally signed message part
___
Infra mailing list
Infra@ovirt.org
http://lists.ovirt.org/mailman/listinfo/infra


Re: [SOLVED] Temporary Outage - ovirt.org Website

2016-03-07 Thread Barak Korren
>
> While we can't do much for the first or the 2nd (besides blocking), I
> would suggest that the 3rd ip owner decide to reduce the frequency of
> check from 1 every 2 seconds to 1 every 5 minutes.
>

Sure, that was us, I just removed the whole thing, I guess you have
your own and better monitoring now.

-- 
Barak Korren
bkor...@redhat.com
RHEV-CI Team
___
Infra mailing list
Infra@ovirt.org
http://lists.ovirt.org/mailman/listinfo/infra


Re: [SOLVED] Temporary Outage - ovirt.org Website

2016-03-07 Thread Michael Scherer
Le lundi 07 mars 2016 à 14:13 +0100, Mikey Ariel a écrit :
> Hi folks,
> 
> It seems that our logs on OpenShift filled up and the ovirt.org website
> was unavailable for a few hours this morning.
> 
> The issue was resolved and the site is back online.

So to complete what Mikey said:
- we have increased the partition from 1G to 3G (this was done
automatically with previous openshift version, so insert my usual rant
about changes in *aaS), so we would have more room next time.

- a RCA (root cause analysis) show that we got 250 000 requests today
from 1 single IP at OVH, around 100 000 from bing bot reindexing the
website (on 4 or 5 ip ), and 30 000 requests from a ip of  bezeqint.net,
with a nagios check_http signature. 

While we can't do much for the first or the 2nd (besides blocking), I
would suggest that the 3rd ip owner decide to reduce the frequency of
check from 1 every 2 seconds to 1 every 5 minutes.

-- 
Michael Scherer
Sysadmin, Community Infrastructure and Platform, OSAS




signature.asc
Description: This is a digitally signed message part
___
Infra mailing list
Infra@ovirt.org
http://lists.ovirt.org/mailman/listinfo/infra


[SOLVED] Temporary Outage - ovirt.org Website

2016-03-07 Thread Mikey Ariel
Hi folks,

It seems that our logs on OpenShift filled up and the ovirt.org website
was unavailable for a few hours this morning.

The issue was resolved and the site is back online.

Cheers,
Mikey

-- 
Mikey Ariel
Community Lead, oVirt
www.ovirt.org

"To be is to do" (Socrates)
"To do is to be" (Jean-Paul Sartre)
"Do be do be do" (Frank Sinatra)

Mobile: +420-702-131-141
IRC: mariel / thatdocslady
Twitter: @ThatDocsLady





signature.asc
Description: OpenPGP digital signature
___
Infra mailing list
Infra@ovirt.org
http://lists.ovirt.org/mailman/listinfo/infra