How to compute a rate while rejecting jumps

jnavila · November 17, 2023, 3:49pm

Problem/Question

I would like to monitor and raise an alert when the filling rate of a disk/partition is above the limit

I have modified the default rule for disk fill rate this way:

template: disk_fill_rate_alarm
      on: disk.space
      os: linux freebsd
   hosts: *
families: /var
  lookup: min -10m at -60m unaligned of avail
    calc: ($this - $avail) / (($now - $after) / 3600)
   every: 1m
   units: GB/hour
    warn: $this > (($status >= $WARNING)  ? (40e-3): (50e-3))
    info: average rate the disk fills up (positive) for the last hour

The problem is that whenever the disk has filled a bit and an administrator removes some log files, the alarm does not work because the comparison is performed with the status one hour ago.

What I would like is a way to recompute the rate by summing the differences of avail between consecutive values, rejecting big negative jumps and summing them back.

Relevant docs you followed/actions you took to solve the issue

The grouping methods described in Database queries/lookup | Learn Netdata does not allow the specific processing I’m looking for. It seems there’s no way to design a custom grouping method.

Environment/Browser/Agent’s version etc

netdata_1.33.1 on ubuntu

What I expected to happen

An integrated grouping method “rate” with optional rejection parameters. Or a way to design a custom grouping.

Topic		Replies	Views
disk_space_usage Alerts	0	21467	November 10, 2021
email notification from disk_space_usage Help	1	19	December 3, 2024
Disk Alert Help agent , configuration	7	285	November 14, 2023
10m disk utilization Help agent	1	517	November 7, 2022
False alarms detection Help agent	16	1201	September 7, 2021

How to compute a rate while rejecting jumps

Problem/Question

Relevant docs you followed/actions you took to solve the issue

Environment/Browser/Agent’s version etc

What I expected to happen

Related topics