Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ibmcloud: slow VM startup #2021

Closed
squarti opened this issue Aug 29, 2024 · 0 comments · Fixed by #2022
Closed

ibmcloud: slow VM startup #2021

squarti opened this issue Aug 29, 2024 · 0 comments · Fixed by #2022
Labels
bug Something isn't working

Comments

@squarti
Copy link
Contributor

squarti commented Aug 29, 2024

Describe the bug

The process-user-data service in peer pod VM makes requests to the VM metadata service to determine the VM flavor (AWS, Azure, etc) (see code). These requests are timing out in ibmcloud VMs (see logs below). Each request adds 30 seconds to the start up time.

root@podvm-nginx-794753b9:~# journalctl -u process-user-data
-- Logs begin at Wed 2024-08-28 17:43:09 UTC, end at Wed 2024-08-28 18:36:43 UTC. --
Aug 28 17:43:16 podvm-nginx-794753b9 systemd[1]: Starting Process user data...
Aug 28 17:43:47 podvm-nginx-794753b9 process-user-data[929]: failed to send request: Get "http://169.254.169.254/metadata/instance/compute?api-version=2021-01-01": dial tcp 169.254.169.254:80: i/o>
Aug 28 17:44:17 podvm-nginx-794753b9 process-user-data[929]: failed to send request: Get "http://169.254.169.254/latest/dynamic/instance-identity/document": dial tcp 169.254.169.254:80: i/o timeout
Aug 28 17:44:17 podvm-nginx-794753b9 process-user-data[929]: failed to send request: Get "http://metadata.google.internal/computeMetadata/v1/instance": dial tcp: lookup metadata.google.internal on>
Aug 28 17:44:17 podvm-nginx-794753b9 process-user-data[929]: 2024/08/28 17:44:17 [userdata/provision] unsupported user data provider, we extract and calculate initdata hash only.
Aug 28 17:44:17 podvm-nginx-794753b9 process-user-data[929]: 2024/08/28 17:44:17 [userdata/provision] File /run/peerpod/initdata not found, skipped initdata processing.
Aug 28 17:44:17 podvm-nginx-794753b9 systemd[1]: Finished Process user data.
root@podvm-nginx-794753b9:~# journalctl -u agent-protocol-forwarder
-- Logs begin at Wed 2024-08-28 17:43:09 UTC, end at Wed 2024-08-28 18:37:49 UTC. --
Aug 28 17:44:17 podvm-nginx-794753b9 systemd[1]: Starting Agent Protocol Forwarder...
Aug 28 17:44:18 podvm-nginx-794753b9 agent-protocol-forwarder[1185]: agent-protocol-forwarder version unknown
Aug 28 17:44:18 podvm-nginx-794753b9 agent-protocol-forwarder[1185]:   commit: 76e56780b519afbbf5944c72622fd1289103fc31-dirty
Aug 28 17:44:18 podvm-nginx-794753b9 agent-protocol-forwarder[1185]:   go: go1.21.12
Aug 28 17:44:18 podvm-nginx-794753b9 agent-protocol-forwarder[1185]: 2024/08/28 17:44:18 [forwarder] Starting agent-protocol-forwarder listener on address 0.0.0.0:15150
Aug 28 17:44:18 podvm-nginx-794753b9 agent-protocol-forwarder[1185]: 2024/08/28 17:44:18 [forwarder] TLS is configured. Configure TLS listener
Aug 28 17:44:18 podvm-nginx-794753b9 systemd[1]: Started Agent Protocol Forwarder.

How to reproduce

Run demo

CoCo version information

main

What TEE are you seeing the problem on

None

Failing command and relevant log output

No response

@squarti squarti added the bug Something isn't working label Aug 29, 2024
squarti added a commit to squarti/cloud-api-adaptor that referenced this issue Aug 29, 2024
This PR enables the VM metadata service to avoid the timeouts
when starting up.

Fixes: confidential-containers#2021

Signed-off-by: Silenio Quarti <silenio_quarti@ca.ibm.com>
squarti added a commit to squarti/cloud-api-adaptor that referenced this issue Aug 29, 2024
This PR enables the VM metadata service to avoid the timeouts
when starting up.

Fixes: confidential-containers#2021

Signed-off-by: Silenio Quarti <silenio_quarti@ca.ibm.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant