Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Crucial logs not visible on s3-csi-node or systemd level #407

Open
spynode opened this issue Mar 7, 2025 · 2 comments
Open

Crucial logs not visible on s3-csi-node or systemd level #407

spynode opened this issue Mar 7, 2025 · 2 comments

Comments

@spynode
Copy link

spynode commented Mar 7, 2025

/kind bug
I was trying to diagnose an issue with mountpoint-s3-csi-driver in one of our clusters and got following logs from driver components.

Event for pod, which uses S3 PV:
Unable to attach or mount volumes: unmounted volumes=[some-volume], unattached volumes=[], failed to process volumes=[]: timed out waiting for the condition
s3-csi-node pod logs on relevant node:
E0307 10:10:12.090636 1 driver.go:130] GRPC error: rpc error: code = Internal desc = Could not mount "preprod-mountpoint-s3-csi-bucket-some-app" at "/var/lib/kubelet/pods/c822f696-e0f1-4927-8c44-68d2c7b3e4a2/volumes/kubernetes.io~csi/some-volume/mount": Mount failed: Failed to start systemd unit, context cancelled output: Error: Timeout after 30 seconds while waiting for mount process to be ready

logs from relevant node:
Mar 07 10:12:44 some-node mount-s3[3581292]: [ERROR] mountpoint_s3::cli: timeout after 30 seconds waiting for message from child process

Only when I launched mount-s3 manually with -f argument, I was able to understand what is the issue:

2025-03-07T10:16:22.506270Z ERROR awscrt::channel-bootstrap: id=0x55ad48244550: Connection failed with error_code 1048.    
2025-03-07T10:16:22.506280Z ERROR awscrt::http-connection: static: Client connection failed with error 1048 (AWS_IO_SOCKET_TIMEOUT).    
2025-03-07T10:16:22.506290Z  WARN awscrt::connection-manager: id=0x55ad48496080: Failed to obtain new connection from http layer, error 1048(socket operation timed out.)    

Network connectivity issues probably should be visible in csi driver pod logs or in systemd status or system logs.

@unexge
Copy link
Contributor

unexge commented Mar 7, 2025

Thanks for the report @spynode! As part of #279, we plan to run Mountpoint process in a Pod, which will redirect it logs to Kubernetes. Then you would be able to do kubectl logs -n mounts3 mp-... to get logs from Mountpoint.

@unexge
Copy link
Contributor

unexge commented Mar 13, 2025

Created this on Mountpoint side to enable error logs by default awslabs/mountpoint-s3#1316

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants