Netdata Community

hdfs_num_failed_volumes

hdfs_num_failed_volumes

Storage | HDFS

This is an alert about the Hadoop Distributed File System (HDFS). The Netdata Agent monitors the number of failed volumes. This alert indicates that there are failed volumes on some DataNodes. It may indicate a hardware failure or misconfiguration, e.g. duplicate mounts. By default, a single volume failing on a DataNode will cause the entire node to go offline. The NameNode must copy any under-replicated blocks that were lost on that node, causing a burst in network traffic and potential performance degradation.