Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unrecognized configuration class to build an AutoTokenizer for microsoft/Florence-2-base-ft #7726

Closed
shihao28 opened this issue Oct 22, 2024 · 1 comment

Comments

@shihao28
Copy link

Description
I was trying to host https://huggingface.co/microsoft/Florence-2-base-ft using Triton's Python-based vLLM backend and encounter an error (Unrecognized configuration class <class 'transformers_modules.microsoft.Florence-2-base.ee1f1f163f352801f3b7af6b2b96e4baaa6ff2ff.configuration_florence2.Florence2Config'> to build an AutoTokenizer.)

Triton Information
Images: nvcr.io/nvidia/tritonserver:24.09-vllm-python-py3
I pulled the images from the Nvidia image repository and use it as it is

To Reproduce

  1. Set up a model repo directory: ~/work/model_repository/florence-2-base-ft/1
  2. Downloaded model.json and config.pbtxt as suggested in the tutorial
{
    "model":"microsoft/Florence-2-base-ft",
    "disable_log_requests": "true",
    "gpu_memory_utilization": 0.5,
    "enforce_eager": "true",
    "trust_remote_code": "true"
}
backend: "vllm"

instance_group [
  {
    count: 1
    kind: KIND_MODEL
  }
]
  1. Run trition inference server
  • cd ~/work
  • docker run --gpus all -it --net=host --rm -p 8001:8001 --shm-size=1G --ulimit memlock=-1 --ulimit stack=67108864 -v ./:/models -w /work nvcr.io/nvidia/tritonserver:24.09-vllm-python-py3 tritonserver --model-store /models

Error
image

@rmccorm4
Copy link
Contributor

Hi @shihao28, this looks like a lack of model support in vLLM itself. Please see this issue: vllm-project/vllm#5934.

Feel free to re-open if this is incorrect.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

2 participants