I took a look. Not yet sure what the cause was, but perhaps these data
will help you.

First thing was to zoom-in on the relationship between latency and
instances. Attached is the graph. You can see there that first
latencies shoot up, and then subsequently the scheduler is adding
instances, and then latencies drop. Then, the scheduler starts
releasing instances. There are two clear examples of this
multi-phased-reaction-cycle in the graph.

Zooming in on the logs at 16:40 (an interesting point because the
latency just shot up) you can see many examples where ms= is around
3-4 seconds and pending_ms= is near zero:
https://appengine.google.com/logs?app_id=showmypc&version_id=1.355223391043075267&severity_level_override=1&severity_level=3&tz=US%2FPacific&filter=&filter_type=regex&date_type=datetime&date=2011-12-07&time=16%3A40%3A50&limit=20&view=Search

Which suggests that the app code is stuck doing something, but since
cpu_ms=0 it must just be waiting on an api call. Now, the app
primarily uses the memcache api, but it also uses the urlfetch api to
make requests to service1.showmypc.com. One possible explanation is
that this remote website went down, causing all your instances to
hang, further incoming requests to go to the pending queue, and the
scheduler to go into reactive mode. At present, the scheduler doesn't
try to specifically diagnose that the app is down in a way where more
instances wouldn't help, it just keeps adding.

But I don't have concrete proof that this explains what happened in
these time periods, it's just what stands out in the data.

On Wed, Dec 7, 2011 at 6:29 PM, smwatch <[email protected]> wrote:
> We are using Python 2.6 and not yet on HRD
>
> Please suggest what else could it be, can someone from google see into
> our application console , why suddenly after months of running, so
> many instances came and blew the top of our daily limits.
> As I said no new traffic at all, this is a usual controlled traffic.
>
> The only thing I can see was the maintenance google did yesterday.
>
>
> On Dec 7, 5:12 pm, Felippe Bueno <[email protected]> wrote:
>> Are you using python 2.7 ? HRD ?
>>
>> Python 2.7 has a known issue in combination with threadsafe: yes
>>
>> see issue 6323
>>
>>
>>
>>
>>
>>
>>
>> On Wed, Dec 7, 2011 at 11:03 PM, smwatch <[email protected]> wrote:
>> > NOW our billing rate out.
>>
>> > Today we are seeing upto 400 Frontend Instances , with the usual
>> > normal traffic and our daily billing is ready to expire.
>>
>> > Our usual number of Instances that are shown on any average day are 6
>> > instance, we have a Limit of maximum 1 idle instance.
>>
>> > Traffic is normal, but we see a lot of errors reported in last 6
>> > hours. No new code has been deployed in last many days, no other
>> > billing errors.
>>
>> > Anyone from Google please suggest whats going on? What can we do here
>> > in this case.
>>
>> > We just try to do a dummy deployment  with minor valid change in the
>> > code to see if the instances kill themselves but still the problem
>> > remains and our money is running out.
>>
>> > --
>> > You received this message because you are subscribed to the Google Groups
>> > "Google App Engine" group.
>> > To post to this group, send email to [email protected].
>> > To unsubscribe from this group, send email to
>> > [email protected].
>> > For more options, visit this group at
>> >http://groups.google.com/group/google-appengine?hl=en.
>
> --
> You received this message because you are subscribed to the Google Groups 
> "Google App Engine" group.
> To post to this group, send email to [email protected].
> To unsubscribe from this group, send email to 
> [email protected].
> For more options, visit this group at 
> http://groups.google.com/group/google-appengine?hl=en.
>

-- 
You received this message because you are subscribed to the Google Groups 
"Google App Engine" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/google-appengine?hl=en.

<<attachment: Screenshot-8.png>>

Reply via email to