Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AADSSHLoginForLinux extension fails to install #1834

Open
marchesir opened this issue Nov 1, 2023 · 2 comments
Open

AADSSHLoginForLinux extension fails to install #1834

marchesir opened this issue Nov 1, 2023 · 2 comments

Comments

@marchesir
Copy link

AADSSHLoginForLinux version: 1.0.2385.1
OS: Ubuntu 20.04LTS

hi,

we have a vmss managed by terraform and it install 3 extensions:
AADSSHLoginForLinux
CustomScript
ApplicationHealthLinux

on provisioning of VMs AADSSHLoginForLinux fails with below error, this only started since Friday.

Multiple VM extensions failed to be provisioned on the VM. Please see the VM extension instance view for other failures. The first extension failed due to the error: VM 'xxxx_4' has not reported status for VM agent or extensions. Verify that the OS is up and healthy, the VM has a running VM agent, and that it can establish outbound connections to Azure storage. Please refer to https://aka.ms/vmextensionlinuxtroubleshoot for additional VM agent troubleshooting information.

  1. why is this trying to reach out to storage?
  2. the storage error has been there for long time and never stopped install until now:
    2023-11-01T09:47:11.292363Z INFO ExtHandler ExtHandler Downloading artifacts profile blob
    2023-11-01T09:47:42.420549Z WARNING ExtHandler ExtHandler Fetch failed: [HttpError] [HTTP Failed] GET https://md-hdd-d12hgqdtfwl4.z34.blob.storage.azure.net/$system/ghec-dev-runners-vmsstests_2.7bb722e2-74be-462b-ab64-dd83a1e36506.vmSettings -- IOError EOF occurred in violation of protocol (_ssl.c:1131) -- 6 attempts made
    202
  3. is there a way to pin a specific version of the extension or can an offline install in the base image be done?
  4. if an offline install is possible how do we force the vm to not install latest on startup?

regards

@yanchoyanev
Copy link

yanchoyanev commented Nov 30, 2023

The failure is caused by the fact that the installation starts at the time when HTTP stack is not quite ready. This is happening only on VMSS, not regular VMs. The VMSS and waagent teams are actively looking for the root cause and a possible fix. As a workaround, change the extension order if you have multiple extensions, or insert a script extension that just waits for a couple of seconds.
Storage access - this is not coming from the extension; probably from the agent. It seems the same HTTP problem.
There is no point to install older extension version because the problem is not in the extension itself but rather the time it is installed.

@marchesir
Copy link
Author

marchesir commented Dec 1, 2023

great thanks for update. I actually fixed the issue as you suggested forcing the aadssh extension to run last.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants