I have a Synology DS920+ 4-bay NAS, populated with 3 HDDs and configured using SHR1 (Synology’s Hybrid RAID with 1 disk redundancy).
netstat is running in a docker container and is raising a critical alert as shown in the attached screenshot. It seems to be treating the empty bay as ‘Down’.
It’s really about getting mdstat to not be showing a missing/failed disk, not about doing something in Netdata.
A shot in the dark here, because I have no previous experience on this and just looked up a couple of things online: It looks like you can remove a disk from an array (see section 3). I can’t figure out how you could even know how to remove something that doesn’t exist/isn’t listed, but it sounds like these disks have predictable identifiers, so in your case it would either be called sata0p1 or sata4p1?
template: mdstat_disks
on: md.disks
class: Errors
type: System
component: RAID
charts: !*md0* *
units: failed devices
every: 10s
calc: $down
crit: $this > 0
info: number of devices in the down state for the $family array. \
Any number > 0 indicates that the array is degraded.
to: sysadmin
template: mdstat_disks_md0
on: md.disks
class: Errors
type: System
component: RAID
charts: *md0* !*
units: failed devices
every: 10s
calc: $down
crit: $this > 1
info: number of devices in the down state for the $family array. \
Any number > 0 indicates that the array is degraded.
to: sysadmin
Thanks for the suggestions - because it’s using SHR-1 rather than pure RAID, I don’t want to mess around with mdadm unless there is a specific Synology guide (which I haven’t been able to find).
I’ll have a play with adjusting the alert trigger level as suggested.
I asked on Reddit on your behalf @pk1966 and some of the replies are very scary. A couple of people are saying that you might lose data if you leave it like this. Again, I personally don’t know enough about this to judge, but maybe check the replies out here
I got confirmation from Synology that it is OK. md0 and md1 are system partitions which is mirrored across all available disks - Synology support said “this is not a cause for concern” and included an example of a singe disk NAS which shows [4/1]
Hi all, sorry for raising an old topic, but regarding having critical alerts from mystery raid device. I have just encountered something similar with QNAP NAS. In my case, these are either swap space and/or something else managed by QTS OS.