-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Deployment Issues] docling-serve on AWS g5.xlarge Instance #54
Comments
Do you know which CUDA driver version is installed on your machine? The image should be using CUDA 12.4. |
@dolfim-ibm , r5.xlarge using CUDA 12.6. nvidia-smi +-----------------------------------------------------------------------------------------+ |
I have updated my docker command by: Still get errors as above. Based the research I got: huggingface/transformers#35349 |
Down torch version solved the issues: torch==2.5.1 torchvision==0.20.1 |
Description
I'm encountering errors while deploying
docling-serve
on an AWSg5.xlarge
instance. Below are the details of my setup:docling-serve
Image:docker pull ghcr.io/ds4sd/docling-serve:sha256-e797326e42984edac7f6640fba316dd908a7cfd23265e0d43616be6b6448e55e
g5.xlarge
Deep Learning OSS Nvidia Driver AMI GPU PyTorch 2.5.1 (Amazon Linux 2023) 20250216
docker run --gpus all --shm-size=8g -e CUDA_HOME=/usr/local/cuda -e PATH='/opt/app-root/bin:/opt/app-root/bin:/opt/app-root/src/.local/bin/:/opt/app-root/src/bin:/opt/app-root/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin' -e TORCH_CUDA_ARCH_LIST='8.0 8.6 8.9+PTX' -it -p 5001:5001 docling:test
Could you provide guidance on the recommended hardware requirements for deploying
docling-serve
? Any insights on troubleshooting this issue would also be greatly appreciated.Thanks!
The text was updated successfully, but these errors were encountered: