瀏覽代碼

Enable telegraf and prometheus for mongodb

Change-Id: I1c7ba61e72cb733f2d1f061faa70edd62afed184
PROD-Related: PROD-21195
master
Dmitry Kalashnik 6 年之前
父節點
當前提交
c836872022
共有 3 個檔案被更改,包括 48 行新增1 行删除
  1. +5
    -1
      metadata/service/support.yml
  2. +37
    -0
      mongodb/meta/prometheus.yml
  3. +6
    -0
      mongodb/meta/telegraf.yml

+ 5
- 1
metadata/service/support.yml 查看文件

@@ -8,4 +8,8 @@ parameters:
sensu:
enabled: true
sphinx:
enabled: false
enabled: false
prometheus:
enabled: true
telegraf:
enabled: true

+ 37
- 0
mongodb/meta/prometheus.yml 查看文件

@@ -0,0 +1,37 @@
{%- from "mongodb/map.jinja" import server with context %}
{%- if server.get('enabled', False) %}
{%- raw %}
server:
alert:
MongoDBServiceDown:
if: >-
mongodb_up == 0
for: 1m
labels:
severity: minor
service: mongodb
annotations:
summary: "MongoDB service is down"
description: "The MongoDB service on the {{ $labels.host }} node is down for 1 minute."
MongoDBServiceOutage:
if: >-
count(mongodb_up == 0) == count(mongodb_up)
for: 1m
labels:
severity: critical
service: mongodb
annotations:
summary: "MongoDB service outage"
description: "All MongoDB services are down for 1 minute."
MongoDBNoPrimaryMember:
if: >-
absent({__name__=~"mongodb.*",state="PRIMARY"})
for: 1m
labels:
severity: critical
service: mongodb
annotations:
summary: "MongoDB cluster has no primary member"
description: "MongoDB cluster has no primary member for 1 minute."
{%- endraw %}
{%- endif %}

+ 6
- 0
mongodb/meta/telegraf.yml 查看文件

@@ -0,0 +1,6 @@
{%- from "mongodb/map.jinja" import server with context %}
{%- if server.get('enabled', False) %}
agent:
input:
mongodb:
{%- endif %}

Loading…
取消
儲存