bdoyle0182 opened a new pull request, #5386:
URL: https://github.com/apache/openwhisk/pull/5386

   ## Description
   The memory queue will now track the last time that an activation has been 
pulled by a grpc request to send to an invoker. If an activation is dropped 
from the queue from aging out and there hasn't been a single activation grabbed 
by an invoker for the entire duration that the activation sat in the queue, 
then this gauge will fire. This is needed as a fail safe to be made aware to 
any edge cases in the system where etcd data gets out of sync and containers 
are thought to exist by the scheduler that do not exist. Simply looking at 
activations timing out is not enough to determine an issue since the action may 
just be hitting throttling limits for number of containers.
   
   So this can fire in two cases:
   1. There's a bug in the scheduler that needs a restart of either the 
invokers and schedulers to get the queue back into a healthy state.
   2. The system configured queue retention timeout is not longer than the max 
timeout of an action such that all available containers are in use up to the 
limit and are validly in use longer than the max retention timeout of the queue 
and the system operator should take action anyways to correct that.
   
   ## Related issue and scope
   - [ ] I opened an issue to propose and discuss this change (#????)
   
   ## My changes affect the following components
   - [ ] API
   - [ ] Controller
   - [ ] Message Bus (e.g., Kafka)
   - [ ] Loadbalancer
   - [X] Scheduler
   - [ ] Invoker
   - [ ] Intrinsic actions (e.g., sequences, conductors)
   - [ ] Data stores (e.g., CouchDB)
   - [ ] Tests
   - [ ] Deployment
   - [ ] CLI
   - [ ] General tooling
   - [ ] Documentation
   
   ## Types of changes
   - [ ] Bug fix (generally a non-breaking change which closes an issue).
   - [X] Enhancement or new feature (adds new functionality).
   - [ ] Breaking change (a bug fix or enhancement which changes existing 
behavior).
   
   ## Checklist:
   - [X] I signed an [Apache 
CLA](https://github.com/apache/openwhisk/blob/master/CONTRIBUTING.md).
   - [X] I reviewed the [style 
guides](https://github.com/apache/openwhisk/blob/master/CONTRIBUTING.md#coding-standards)
 and followed the recommendations (Travis CI will check :).
   - [ ] I added tests to cover my changes.
   - [ ] My changes require further changes to the documentation.
   - [ ] I updated the documentation where necessary.
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to