I’m experiencing missed notifications, I believe, on at least one of our nodes.
How do I debug and make sure that both local and cloud notifications are being sent?
eg. Local and cloud notifications are somettimes incosistent.
I find that reloading the health config seems to send the missed emails:
sudo netdatacli reload-health
Your help is kindly appreciated. Thanks
Edit 1
Out of the blue, some missing notifications began to appear overnight. Odd.
Edit 2
I’ve just discovered some scripts in this this dir:
/usr/libexec/netdata/plugins.d
I will check it out and see what I can find.
Edit 3
Q: If I were to manually cause a notification (eg Fill disk to over 90%), what log file can I check that will show that netdata has picked up on this event and is sending a notification to the local node. And sending the event to the cloud instance?
Edit 4
I’ve just found /var/log/netdata/health.log
which appears to be what I’m looking for.
But the last entry in that file was from last year: 2023-11-23 00:17:34
.
Q: Why is my health log file not being populated?
Edit 5
I’m finding that my netdata logs are largely not being written to even though there are alerts going on:
-rw-r--r-- 1 netdata netdata 0 Nov 24 2023 health.log
-rw-r--r-- 1 netdata netdata 0 Nov 24 2023 error.log
-rw-r--r-- 1 netdata netdata 0 Nov 24 2023 collector.log
-rw-rw---- 1 netdata netdata 0 Jul 13 07:22 daemon.log
-rw-rw---- 1 netdata netdata 0 Jul 13 07:22 collectors.log
-rw-r--r-- 1 netdata netdata 434 Jul 24 19:58 access.log.14.gz
...
-rw-r--r-- 1 netdata netdata 2156554 Aug 5 06:31 aclk.log.1
-rw-r--r-- 1 netdata netdata 100641 Aug 5 23:52 access.log.2.gz
-rw-r--r-- 1 netdata netdata 0 Aug 6 00:00 aclk.log
-rw-r--r-- 1 netdata netdata 185422 Aug 6 23:51 access.log.1
-rw-r--r-- 1 netdata netdata 252 Aug 7 00:17 access.log
Q: Is it strange that many of the logs here are empty?
Environment
- netdata v1.46.3
- Debian 11 (Bulls Eye)
- Default configurations