美文网首页DevOps干货集市
kubernetes pod status 监控

kubernetes pod status 监控

作者: 骆的沙 | 来源:发表于2019-05-08 12:32 被阅读0次

dashboard

监控项:

  • 各个k8s集群所有pod not running的状态,监控pod的CrashLoopBackOff及一直处于ContainerCreating的状态,可通过grafana做告警
  • 监控pod的重启次数


    k8s-pod-status.png

TechStack

promtheus+grafana

PromQL

datasource接入prometheus,PromQL语句对应上图各项title

  • Container Waiting Reason
(sum(kube_pod_container_status_waiting_reason{reason!="ContainerCreating",namespace=~"$namespace",pod=~"$pod"} ) by (reason,namespace,pod) >0) 
*on(pod) group_right(reason) sum(kube_pod_info) by (pod,node,host_ip,pod_ip,namespace) 
or
(sum(kube_pod_container_status_waiting_reason{reason="ContainerCreating",namespace=~"$namespace",pod=~"$pod"} ) by (reason,namespace,pod) >0) 
-on(pod) group_right(reason) sum(kube_pod_info) by (pod,node,host_ip,pod_ip,namespace) 
  • pod重启次数(Last 5m)
(sum(kube_pod_container_status_restarts_total{namespace=~"$namespace",pod=~"$pod"}) by(namespace,pod) *on(pod) group_right() sum(kube_pod_info) by (pod,node,host_ip,pod_ip,namespace) 
-sum(kube_pod_container_status_restarts_total{namespace=~"$namespace",pod=~"$pod"} offset 5m) by(namespace,pod) *on(pod) group_right() sum(kube_pod_info) by (pod,node,host_ip,pod_ip,namespace))

相关文章

网友评论

    本文标题:kubernetes pod status 监控

    本文链接:https://www.haomeiwen.com/subject/lddmoqtx.html