cgroup-name.sh isn't called to get container names

data-smith · August 8, 2023, 8:15pm

In the Netdata logs I only see cgroup-name.sh in the logs once: INFO: cgroup ‘systemd’ is called ‘systemd’, labels ‘’

On another node I see all kinds of logs starting with cgroup-name.sh and then the pod name.

Any ideas on why this is skipped on some nodes? Is there some place I can look on filesystem to see why Netdata skips this step?

ilyam8 · August 9, 2023, 12:49pm

Hey, it is probably due to log flood protection. You can disable it in netdata.conf by setting errors flood protection period to 0.

Netdata executes cgroup-name.sh for all cgroups that match the following pattern (can be found in netdata.conf too):

enable by default cgroups matching
run script to rename cgroups matching

data-smith · August 9, 2023, 1:32pm

I don’t believe it’s a logging problem. It seems that it’s not getting pod metrics. The pod metrics aren’t appearing in the dashboard.
It’d be nice if there was a place i could look on the node filesystem to see why netdata isn’t picking up the data. i looked in cgroup-name.sh but i don’t know the code well enough to figure out why netdata seems to be skipping this step.
What’s weird is it works fine on other nodes.

ilyam8 · August 9, 2023, 2:07pm

I don’t believe it’s a logging problem.

Then you just check: check if you have a line withFLOOD in logs, e.g.

2023-08-09 14:01:43: netdata LOG FLOOD PROTECTION too many logs (201 logs in 13 seconds, threshold is set to 200 logs in 1200 seconds). Preventing more logs from process ‘netdata’ for 1187 seconds

Also, I suggest you provide details about your setup, it helps to debug issues. Based on provided info my only suggestion is to check logs:

disable flood protection
grep all lines that have cgroup word

data-smith · August 9, 2023, 3:25pm

Ok i disabled flood protection and now i don’t even see calls to cgroup-name.sh. The dash also just shows node metrics without any pod metrics. Unfortunately I can’t bring files down from my private network. It looks like it calls k8s api to get the pods and then looks in /sys/fs/cgroup. For some reason some nodes don’t process the pods to get metrics while other nodes that are almost similar do. I’d be nice if there was a log line that said ‘calls to /api/pods found N pods’

data-smith · August 9, 2023, 10:00pm

It would be really great if it logged ‘hey i tried to get pods by calling this url and this many came back’ on initial startup.

I got the netdata-debug container and and set debug flags = 0x0000000000100000 but i don’t see anything new.

Topic		Replies	Views
where does the netdata cgroup plugin get the list of cgroup names Help	4	574	July 22, 2022
Flaky cgroups graphs and hostname on Balena Help agent-collector , agent	4	643	January 28, 2022
LXC Containers Stats Are Not Shown Help	72	2951	September 28, 2023
Need some help with cgroups configuration for qemu VMs Help agent-collector , agent	25	4159	April 14, 2021
No metrics for docker containers Help agent	6	1663	October 2, 2020

cgroup-name.sh isn't called to get container names

Related topics