I moved a VM to another host and I copied the /var/lib/netdata folder so that it would retain the cloud configuration. The agent is running, but it can’t connect to the cloud, and in the error log I see
2021-06-21 07:00:46: netdata INFO : ACLK_Main : Attempting connection now
2021-06-21 07:00:47: netdata INFO : ACLK_Main : HTTPS "GET" request to "app.netdata.cloud" finished with HTTP code: 200
2021-06-21 07:00:47: netdata INFO : ACLK_Main : Getting Cloud /env successful
2021-06-21 07:00:47: netdata INFO : ACLK_Main : HTTPS "GET" request to "app.netdata.cloud" finished with HTTP code: 200
2021-06-21 07:00:47: netdata INFO : ACLK_Main : ACLK_OTP Got Challenge from Cloud
2021-06-21 07:00:47: netdata ERROR : ACLK_Main : Decryption of the challenge failed: error:04099079:rsa routines:RSA_padding_check_PKCS1_OAEP_mgf1:oaep decoding error
2021-06-21 07:00:47: netdata ERROR : ACLK_Main : Output buffer for encoding size=512 is not large enough for 18446744073709551615-bytes input
2021-06-21 07:00:47: netdata INFO : ACLK_Main : HTTPS "POST" request to "app.netdata.cloud" finished with HTTP code: 403
2021-06-21 07:00:47: netdata ERROR : ACLK_Main : ACLK_OTP Password HTTP code not 201 Created (got 403)
2021-06-21 07:00:47: netdata ERROR : ACLK_Main : Cloud returned EC="TODO trace-id", Msg-Key:"ErrIncorrectResponse", Msg:"incorrect challenge response", BlockRetry:false, Backoff:0s (-1 unset by cloud)
2021-06-21 07:00:47: netdata ERROR : ACLK_Main : Error passing Challenge/Response to get OTP
2021-06-21 07:00:47: netdata INFO : ACLK_Main : Wait before attempting to reconnect in 0.000 seconds
2021-06-21 07:00:47: netdata INFO : ACLK_Main : Attempting connection now
2021-06-21 07:00:48: netdata INFO : ACLK_Main : HTTPS "GET" request to "app.netdata.cloud" finished with HTTP code: 200
2021-06-21 07:00:48: netdata INFO : ACLK_Main : Getting Cloud /env successful
2021-06-21 07:00:48: netdata INFO : ACLK_Main : HTTPS "GET" request to "app.netdata.cloud" finished with HTTP code: 409
2021-06-21 07:00:48: netdata ERROR : ACLK_Main : ACLK_OTP Challenge HTTP code not 200 OK (got 409)
2021-06-21 07:00:48: netdata ERROR : ACLK_Main : Cloud returned EC="TODO trace-id", Msg-Key:"ErrDuplicatedChallenge", Msg:"delay retry 1m0s: duplicated challenge", BlockRetry:false, Backoff:60s (-1 unset by cloud)
2021-06-21 07:00:48: netdata ERROR : ACLK_Main : Error passing Challenge/Response to get OTP
2021-06-21 07:00:48: netdata INFO : ACLK_Main : Wait before attempting to reconnect in 60.000 seconds
especially the ACLK_Main : Output buffer for encoding size=512 is not large enough for 18446744073709551615-bytes input seems concerning?
I also don’t have the old metrics any more, can I copy those from somewhere? I still have the old image available.
@wmertens I will try the Nix agent. The one you move from is the same version? The error log posted seems at first sight not directly related to the transfer of the credentials.
I seem to have an issue using netdata from nix-shell. From claiming to netdata not starting due to missing files etc. I guess will have to study the thing a bit first.
Oh right, the configuration actually comes from the “module” which is NixOS-only, not nixpkgs (which you are using now). So the binary is the same but it’s missing the config file and directory configuration.