Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Single Model Deployment (KServe) #54

Open
danielezonca opened this issue Jun 9, 2023 · 0 comments
Open

Single Model Deployment (KServe) #54

danielezonca opened this issue Jun 9, 2023 · 0 comments

Comments

@danielezonca
Copy link
Contributor

Currently model serving capability in ODH is based on ModelMesh that is designed for high-scale, high-density and frequently-changing model use cases.
There are other scenarios where users might prefer single model deployment and benefit of other features like scale-to-zero, revision management, tracing etc.
KServe Model Serving runtime is able to solve similar scenarios.
Single model deployment is also the better candidate to serve LLM runtimes

This component uses Knative and Istio as dependency.

This is a tracker for bringing KServe into ODH as Trier 1 component.

Requirements

  • Batching
  • Revision Management
  • Scale-to-zero

Individual Efforts

UX: (add link)
Lead: TBD
UI: (add link)
Lead: TBD
Dev: (add link)
Lead: TBD
QE: (add link)
Lead: TBD
Doc: (add link)
Lead: TBD

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant