Re: [onap-discuss] Resiliency: Applying resource limits to a container - WIP

FREEMAN, BRIAN D Wed, 30 May 2018 13:20:41 -0700

Michael,

Did you set the flags in Java to adhere to the limits set in the chart. For 
some reason I remember a stackoverflow article about having to make sure that 
in Java 8 there were specific settings to enforce the limit.


Brian


From: [email protected] <[email protected]> 
On Behalf Of OBRIEN, FRANK MICHAEL
Sent: Wednesday, May 30, 2018 3:37 PM
To: [email protected]
Subject: [onap-discuss] Resiliency: Applying resource limits to a container - 
WIP

Team,
    Reaching out the community as we are applying some parts of resiliency 
changes for the first time.
    A discussion of working resource limits will benefit the community and the 
log pods - in both directions
    I am having difficulty getting cpu limits applied to a particular pod - 
collaborating with the community in case anyone else is bringing in resource 
requirements - I found another override in mariadb-galera but looking at the 
rendered yaml In the k8s dashboard - shows an empty resources section there as 
well - same issue.

    Ideally we all (PTL's) do this together (There is a hierarchy in progress) 
as we will need to decide on who get allocated from 2/4/8/16/32 cores on a 
particular cluster VM flavor.
    If anyone has implemented %percentage allocations let us know.  We can 
answer questions on what happens if multiple requests for 2 cores on a 2 core 
vm occur for example.

                The following patch looks straightforward - but it does not 
actually have any effect yet (with/without quotes) - I am going over overrides 
above and attempting to hardcode the values in the deployment.yaml to at least 
work backwards from a working override.

https://gerrit.onap.org/r/#/c/49553/1/kubernetes/log/charts/log-logstash/values.yaml<https://urldefense.proofpoint.com/v2/url?u=https-3A__gerrit.onap.org_r_-23_c_49553_1_kubernetes_log_charts_log-2Dlogstash_values.yaml&d=DwMFAg&c=LFYZ-o9_HUMeMTSQicvjIg&r=e3d1ehx3DI5AoMgDmi2Fzw&m=CzaKXuTUVBNjiTY4ghL1zOaWOelPdhLEWXeLDf09Yyc&s=kiEgVifMJ8rlgqV6OsoePvxHJigRVFqn08-ok1c-F_A&e=>

resources:
  limits:
    cpu: "2"
  requests:
    cpu: "2"

    Background:
-------------------
    LOG-376 deals with a runaway logstash container where it will take (n-1) 
vCores on 1 to 2 VMs on a 4-12 node cluster - I have seen 7 and 15 core 
saturation.
https://jira.onap.org/browse/LOG-376<https://urldefense.proofpoint.com/v2/url?u=https-3A__jira.onap.org_browse_LOG-2D376&d=DwMFAg&c=LFYZ-o9_HUMeMTSQicvjIg&r=e3d1ehx3DI5AoMgDmi2Fzw&m=CzaKXuTUVBNjiTY4ghL1zOaWOelPdhLEWXeLDf09Yyc&s=F2Og73RAghTP868OWL1S0ppwJshYL14c5JJMmFlFNB0&e=>
an example of a runaway pod that takes over 50% of the vCPU capacity of a 4 
node 64core/256g cluster
https://jira.onap.org/secure/attachment/11827/Screenshot%202018-05-30%2013.26.34.png<https://urldefense.proofpoint.com/v2/url?u=https-3A__jira.onap.org_secure_attachment_11827_Screenshot-25202018-2D05-2D30-252013.26.34.png&d=DwMFAg&c=LFYZ-o9_HUMeMTSQicvjIg&r=e3d1ehx3DI5AoMgDmi2Fzw&m=CzaKXuTUVBNjiTY4ghL1zOaWOelPdhLEWXeLDf09Yyc&s=OQdW63nRmcJJZU68AjoT7NbE3BPbXt5GeB4Hvu6Oh2A&e=>

    The root cause southbound/northbound is the main issue and being looked at 
- but for now I would like to limit the
    The ELK stack had logstash clustered into a ReplicaSet with periodic 
success and last week into a DaemonSet (1 container per VM) - however load 
balancing is still asymmetric - likely due to misuse of the LB service - 
looking into all of this - the current patch above is just to get the cluster 
back to a working state
    This issue with the ELK stack is at least 3 weeks old.

    Thank you
    /michael

This message and the information contained herein is proprietary and 
confidential and subject to the Amdocs policy statement,
you may review at 
https://www.amdocs.com/about/email-disclaimer<https://urldefense.proofpoint.com/v2/url?u=https-3A__www.amdocs.com_about_email-2Ddisclaimer&d=DwMFAg&c=LFYZ-o9_HUMeMTSQicvjIg&r=e3d1ehx3DI5AoMgDmi2Fzw&m=CzaKXuTUVBNjiTY4ghL1zOaWOelPdhLEWXeLDf09Yyc&s=UEIYptYOaNorhYZj8NNWQB77pW_DBFIvVDcEccg7FJY&e=>

_______________________________________________
onap-discuss mailing list
[email protected]
https://lists.onap.org/mailman/listinfo/onap-discuss

Re: [onap-discuss] Resiliency: Applying resource limits to a container - WIP

Reply via email to