Re: [kubernetes-users] Build queue hangs after timer task error

2018-08-08 Thread Rodrigo Campos
Can you please elaborate? What job? How are you running it? I don't even understand how Kubernetes is involved in this. Can you explain me more? I really don't follow On Wednesday, August 8, 2018, Basanta Kumar Panda wrote: > builds of all other jobs hangs after timer task error .. > Aug 07,

Re: [kubernetes-users] How to monitor/alert on container/pod death or restart

2018-08-08 Thread Rodrigo Campos
It really depends on the monitoring solution. Usually this metrics are exported and you can just predicate on them, in the language they provide. In my case, I'm using a hosted solution (signalfx) that gives you a daemon set and sends that metric to them. You can then predicate. We have alerts

Re: [kubernetes-users] Does GKE CNI calico plugin shares "etcd" with control plane?

2018-08-08 Thread 'Tim Hockin' via Kubernetes user discussion and Q
We only use Calico in the mode that reads node.spec.podCIDR, so it doesn't need etcd. On Wed, Aug 8, 2018 at 3:36 PM parthi.geo wrote: > Wondering if Google Kubernetes Engine native CNI add-on (calico) shares > "etcd" with master / control plane. > > > Regards > Parthiban,S > > -- > You

[kubernetes-users] Best way to back-up and replicate GKE cluster in different region

2018-08-08 Thread parthi.geo
Pointers on tools & best practices to back-up and replicate a running GKE cluster in a different region in a consistent fashion. Considering GKE is a managed service i.e control plane availability is managed by GCP, what better options are available to snapshot & replicate a cluster. Kubectl

[kubernetes-users] Does GKE CNI calico plugin shares "etcd" with control plane?

2018-08-08 Thread parthi.geo
Wondering if Google Kubernetes Engine native CNI add-on (calico) shares "etcd" with master / control plane. Regards Parthiban,S -- You received this message because you are subscribed to the Google Groups "Kubernetes user discussion and Q" group. To unsubscribe from this group and stop

Re: [kubernetes-users] How to monitor/alert on container/pod death or restart

2018-08-08 Thread Agrawal, Punit
David, What we do is export the kubernetes cluster events to Cloud PubSub using Stackdriver Export and then we have SumoLogic setup to ingest logs from PubSub. Then we use the SumoLogic Scheduled Search Capabilities to send alerts based on certain events. Punit Agrawal Site Reliability

Re: [kubernetes-users] How to monitor/alert on container/pod death or restart

2018-08-08 Thread Marcio Garcia
David, In Datadog events you can see the killed pods. But, if you have containers that need to be killed because they don't die when receiving a stop, you'll see a lot of events like: KILLED, DESTROYED, and this is not necessarily an error, could be only a container being restarted, keep that

Re: [kubernetes-users] How to monitor/alert on container/pod death or restart

2018-08-08 Thread 'Tim Hockin' via Kubernetes user discussion and Q
Most of what you're asking for is available via the k8s API, if you watch it. On Wed, Aug 8, 2018 at 12:58 PM David Rosenstrauch wrote: > As we're getting ready to go to production with our k8s-based system, > we're trying to pin down exactly how we're going to do all the needed >

Re: [kubernetes-users] How to monitor/alert on container/pod death or restart

2018-08-08 Thread David Rosenstrauch
Thanks for the response, Marcio. We've actually recently started using Datadog already. (At least in dev/qa.) But DD is a bit of a sea of metrics, and I'm not clear how we would accomplish one of the specific tasks I've mentioned - for example, alerting when k8s has killed a container or

Re: [kubernetes-users] How to monitor/alert on container/pod death or restart

2018-08-08 Thread Marcio Garcia
Hi David, You can use DataDog to achieve this. On 8/8/18, David Rosenstrauch wrote: > As we're getting ready to go to production with our k8s-based system, > we're trying to pin down exactly how we're going to do all the needed > monitoring/alerting for it. We can easily collect many of

[kubernetes-users] How to monitor/alert on container/pod death or restart

2018-08-08 Thread David Rosenstrauch
As we're getting ready to go to production with our k8s-based system, we're trying to pin down exactly how we're going to do all the needed monitoring/alerting for it. We can easily collect many of the metrics we need (using kube-state-metrics to feed into prometheus, and/or Datadog) and

Re: [kubernetes-users] How to mount a service status file with pvc in kubernetes?

2018-08-08 Thread 'Tim Hockin' via Kubernetes user discussion and Q
Can you explain more what you mean? Who writes this file? Who reads this file? What is the lifetime of this file? Is this a simple one-writer, one-reader case? On Fri, Aug 3, 2018 at 10:34 AM 'zulv' via Kubernetes user discussion and Q wrote: > The issue is that I would like to persistent

Re: [kubernetes-users] Build queue hangs after timer task error

2018-08-08 Thread Basanta Kumar Panda
builds of all other jobs hangs after timer task error .. Aug 07, 2018 11:37:51 PM SEVERE hudson.triggers.SafeTimerTask run Timer task hudson.slaves.NodeProvisioner$NodeProvisionerInvoker@7dffefc4 failed On Wed, Aug 8, 2018 at 6:20 PM, Rodrigo Campos wrote: > Can you please elaborate? What do

Re: [kubernetes-users] Tools to monitor for K8

2018-08-08 Thread Jay Pipes
On 08/08/2018 10:56 AM, Ben Kochie wrote: On Wed, Aug 8, 2018 at 3:07 PM Jay Pipes > wrote: cadvisor is a library dependency for kubelet: https://github.com/kubernetes/kubernetes/blob/master/pkg/kubelet/cadvisor/BUILD#L29-L31 so when you install

Re: [kubernetes-users] Tools to monitor for K8

2018-08-08 Thread Ben Kochie
On Wed, Aug 8, 2018 at 3:07 PM Jay Pipes wrote: > cadvisor is a library dependency for kubelet: > > > https://github.com/kubernetes/kubernetes/blob/master/pkg/kubelet/cadvisor/BUILD#L29-L31 > > so when you install kubelet, you will install the cadvisor library. > > Prometheus does not use

Re: [kubernetes-users] Tools to monitor for K8

2018-08-08 Thread Vasu Srivasthav
FYI, https://github.com/kubernetes/kubernetes/issues/56523 Thanks, Sri. On Wed, Aug 8, 2018, 6:07 AM Jay Pipes wrote: > cadvisor is a library dependency for kubelet: > > > https://github.com/kubernetes/kubernetes/blob/master/pkg/kubelet/cadvisor/BUILD#L29-L31 > > so when you install

[kubernetes-users] Kubernetes Office Hours, August 15th

2018-08-08 Thread Jorge Castro
Hey everyone, 1 week warning until our next office hours, which will be August 15th. Last month we barely had any questions from Europe and an overwhelming amount of questions during the US session. So if you're in a non-US time zone you can help even us out by submitting questions or attending

Re: [kubernetes-users] Tools to monitor for K8

2018-08-08 Thread Jay Pipes
cadvisor is a library dependency for kubelet: https://github.com/kubernetes/kubernetes/blob/master/pkg/kubelet/cadvisor/BUILD#L29-L31 so when you install kubelet, you will install the cadvisor library. Prometheus does not use cadvisor, no. Best, -jay On 08/08/2018 06:17 AM, Niranjan Kolly

Re: [kubernetes-users] Build queue hangs after timer task error

2018-08-08 Thread Rodrigo Campos
Can you please elaborate? What do you mean with build task? On Wednesday, August 8, 2018, Basanta Kumar Panda wrote: > Hi, > > Build queue hangs after timer task error is there any solution.. > Regards, > Basanta > > > > -- > You received this message because you are subscribed to the Google

[kubernetes-users] Survey for CoreDNS Adoption as Kubernetes's DNS Server

2018-08-08 Thread Francois Tur
If you are using CoreDNS as DNS Service of Kubernetes, please, take time to fill the survey and provide us some feedback . CoreDNS is now part of Kubernetes, delivering the DNS and Discovery Service

[kubernetes-users] Build queue hangs after timer task error

2018-08-08 Thread Basanta Kumar Panda
Hi, Build queue hangs after timer task error is there any solution.. Regards, Basanta -- You received this message because you are subscribed to the Google Groups "Kubernetes user discussion and Q" group. To unsubscribe from this group and stop receiving emails from it, send an email to

Re: [kubernetes-users] Tools to monitor for K8

2018-08-08 Thread Niranjan Kolly
Does it require cadvisor in the K8 to collect the metrics. ? Thanks, Niranjan On Wed, Aug 8, 2018 at 12:31 PM, Ben Kochie wrote: > You may be interested in the Prometheus Operator. > > https://coreos.com/blog/the-prometheus-operator.html >

Re: [kubernetes-users] Tools to monitor for K8

2018-08-08 Thread Ben Kochie
You may be interested in the Prometheus Operator. https://coreos.com/blog/the-prometheus-operator.html https://coreos.com/operators/prometheus/docs/latest/user-guides/getting-started.html https://github.com/coreos/prometheus-operator On Wed, Aug 8, 2018 at 8:06 AM Niranjan Kolly wrote: > Hi

Re: [kubernetes-users] Tools to monitor for K8

2018-08-08 Thread Niranjan Kolly
Hi Sri, Thanks for the reply. we are using K8 1.10.2, is cadvisor is bundled in the kubelet . How to check that ? Thanks, Niranjan On Tue, Aug 7, 2018 at 7:00 PM, Vasu Srivasthav wrote: > Hi, > > Open source tools to monitor K8, Grafana and Prometheus. > > > > -Sri > > On Tue, Aug 7, 2018,