Jaymes Mosher
aa2a52cf9b
Scratch using interfaces_override
7 vuotta sitten
Jaymes Mosher
603e62ab9e
Keep regex as default but still allow overrides.
7 vuotta sitten
Jaymes Mosher
cf6dbf1d6a
Use Pillar to chose which interfaces to monitor.
The `linux_netlink.ls` function used a regex to choose which interfaces
to collect metric for.
`_alphanum_re = re.compile(r'^[a-z0-9]+$')`
Unfortunately, by default this excludes vlan and tap interfaces, which
are kind of important. ie `bond0.120` or `tap2a3dab86-fb`.
We also have a problem where even if we update the regex to include
these interfaces... if someone deletes and spawns a new instance then
the tap device name changes on the compute host, which will not be
monitored unless someone re-runs the `collectd` on the compute again.
Less than ideal.
This commit lets us choose `VerboseInterface "all"` using Pillar data
to avoid this problem.
7 vuotta sitten
Simon Pasquier
4d290b5eec
Add Prometheus alerts for dropped packets
Change-Id: If50f18367b22338b3fba1ff15902d557a0bdf2ea
7 vuotta sitten
Simon Pasquier
d32688e7aa
Reword Prometheus alert messages
Change-Id: I54e02e0741d53ec7b2335145dc968b7b8c8f5e00
7 vuotta sitten
Ales Komarek
02f35a537c
Graph metadata
Change-Id: If0ee6f1ac5ab697559fcd853225e1520de2e8c1c
7 vuotta sitten
Simon Pasquier
234e14acda
Add Grafana dashboard for Prometheus datasource
Change-Id: Icacb0ca22a34f1ff438a895700040563d250bac9
7 vuotta sitten
Simon Pasquier
b1813426dc
Enable kernel, net and process metrics for Telegraf
Change-Id: I008818853c2058746be08365283b949177efa254
Depends-On: I3c3c569a013aff8c3ab8e46cffb93a60d74ddf09
7 vuotta sitten
Swann Croiset
d66a782570
Enable diskio input telegraf plugin
Change-Id: I80193afad1842f67967d1bab164f049078e3cd75
7 vuotta sitten
Erick Cantwell
e5770ac50f
[MMO-132] Check the length of the dict, instead of if it's defined (it
will always be defined since the default is an empty dict)
7 vuotta sitten
Filip Pytloun
ea11327afe
Fix grains generation when linux_netlink.ls is not available
Change-Id: Id4b0b405872457bd8b20f450e4031d6808d3cf59
7 vuotta sitten
Filip Pytloun
e70606d0d2
Manage grains using support metadata
Change-Id: I25fb0eb0d4b922b8853eceb0c1c220a4040e1704
7 vuotta sitten
Bartosz Kupidura
d8b54c95da
Add variables in prometheus alerts
Change-Id: I1765fc6aa4a8c3da25330f19bb043ddbf548b9ad
7 vuotta sitten
Damian Szeluga
1e47abe149
Add option to parametrize checks
7 vuotta sitten
Bartosz Kupidura
0bd8565876
Add support for prometheus
Change-Id: I66576b4ed40ef160c5f13747a908f018f252b6b4
7 vuotta sitten
Bartosz Kupidura
df9b40d973
Add telegraf support
Change-Id: I03bed44bafdebbcd22f487e59ef0de45dfbf3463
7 vuotta sitten
Simon Pasquier
a4a6f16bbe
Fix severity for the linux_system_cpu_warning alarm
Change-Id: Ic3a1e77f2d38c5d916dd3c07211a6ea160559e6f
7 vuotta sitten
Simon Pasquier
89b97640d0
Report swap metrics in bytes
Change-Id: Ic39fa0f18e0d9aeca0ef73ae6d985d12d15a1c3a
7 vuotta sitten
vmikes
37837f3280
Revert "turn off check swap if needed"
This reverts commit a63f4053f3
.
8 vuotta sitten
vmikes
a63f4053f3
turn off check swap if needed
8 vuotta sitten
Éric Lemoine
6d6f5b4c00
Remove support for log_collector
The support for collecting syslog is going to be moved to the rsyslog
formula.
8 vuotta sitten
Guillaume Thouvenin
b4f82c6013
Put Grafana dashboards into their own directory
8 vuotta sitten
Éric Lemoine
2f06db9e6d
Add more alarms
This commit adds more built-in alarms to the Linux formula.
8 vuotta sitten
Guillaume Thouvenin
e29d0a4f77
Provides Grafana dashboard
8 vuotta sitten
Swann Croiset
210e98304e
Redefine alerting property
The alerting property can be one of 'disabled', 'enabled' or
'enabled_with_notification'
8 vuotta sitten
Simon Pasquier
8db94b38f4
Fix Syslog pattern for system logs
Currently Syslog doesn't log the priority ('<PRI>').
8 vuotta sitten
Simon Pasquier
e877605126
Add timezone support for system logs
8 vuotta sitten
Éric Lemoine
1787f0b297
Rename netlink.py to linux_netlink.py
This is to comply to a comment from @cznewt in
https://github.com/tcpcloud/salt-formula-heka/pull/24 .
8 vuotta sitten
Éric Lemoine
1c39744e43
Use netlink collectd plugin instead of interface
This patch replaces the "interface" collectd plugin by the "netlink" one. The
"netlink" plugin provides the same metrics as "interface" but plus other
metrics such as the number of dropped packets.
8 vuotta sitten
Éric Lemoine
3035609caf
Remove Heka decoder tz handling
This is now handled by the Heka formula the same way for all the Heka sandbox
decoders. https://github.com/tcpcloud/salt-formula-heka/pull/20
8 vuotta sitten
Adam Tengler
599068289d
Orchestration metadata
8 vuotta sitten
Simon Pasquier
318ebd1569
Remove the log counter filter from meta/heka
This filter should be configured by the heka formula itself.
8 vuotta sitten
Ales Komarek
480003965f
Sample alarms
8 vuotta sitten
Éric Lemoine
b87ccd327d
Add timezone to syslog decoder config
8 vuotta sitten
Éric Lemoine
bf02e9dede
Use the proper module directory
The stacklight module dir is /usr/share/lma_collector/common, not
/usr/share/lma_collector_modules. This fixes it.
8 vuotta sitten
Éric Lemoine
1a1f375498
Set "hostname" in the linux_hdd_errors|counters filters
8 vuotta sitten
Éric Lemoine
fb25b9d60a
Fix decoder name
This fixes the decoder name in meta/heka.yml.
8 vuotta sitten
Éric Lemoine
48199ab618
Remove the alarm-related filters
This commit removes the alarm-related filter configuration from meta/heka.yml.
Alarms will be defined in a different way (still under discussion).
8 vuotta sitten
Ales Komarek
b02c10f0be
Collectd fixes
8 vuotta sitten
Ales Komarek
f94e16c569
Global collectd update
8 vuotta sitten
vmikes
a457359f89
update warning threshold
8 vuotta sitten
Filip Pytloun
e7a1ef7f4c
Enable contextswitch collectd plugin
8 vuotta sitten
vmikes
d5ba24b4af
if storage.swap is defined
8 vuotta sitten
vmikes
b6fe1ab532
never too many
8 vuotta sitten
Filip Pytloun
25c9de7ced
Revert "Don't check swap if not present"
This reverts commit 6edb3a7a68
.
8 vuotta sitten
Filip Pytloun
6edb3a7a68
Don't check swap if not present
8 vuotta sitten
Adam Tengler
e0ff433cf0
Description added to sphinx doc
9 vuotta sitten
Ales Komarek
252d5f20b0
rewrite rules definition
9 vuotta sitten
Ales Komarek
825c92716d
collectd load check
9 vuotta sitten
Ales Komarek
6080b8c87f
No process check
9 vuotta sitten