Woke up this morning to find that my cron jobs were not running from 23:39 
on 6/21 until 01:47 on 6/22.
A Cron Job is configured to run every 5 minutes.

The logs show: (note the missing activity after 23:29)

<https://lh3.googleusercontent.com/-doSyjtzggvE/UcWdlnM7DiI/AAAAAAAAC40/iHuhRzoE3h8/s1600/chronlogs.jpg>
The dashboard shows:

<https://lh4.googleusercontent.com/-75CtZdGa5s8/UcWe_wkvrSI/AAAAAAAAC5I/bGoN_tI7ZlM/s1600/chrondash.jpg>
There were no quota denials.

I have a couple of questions:

1) How do I proceed with my investigation to determine the root cause of 
this SNAFU.

2) How do I provide reliable service to my customers in light of this 
failure? Do I need to have the cron job send
something to a watchdog service? EC2? Heroku? Or is there a "Hey I have not 
been 'pinged' in 10 minutes, so I'm gonna throw a s&*t fit" service?
(Maybe it is called something like a 'button-down-no-longer-down' detection 
service?)

3) How do I calm my current anxiety and deep seated dread? As well as my 
anticipatory anxiety?

David
PS: I've already sacrificed a unicorn. See http://imgur.com/gallery/sAYJi4v

-- 
You received this message because you are subscribed to the Google Groups 
"Google App Engine" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/google-appengine.
For more options, visit https://groups.google.com/groups/opt_out.


Reply via email to