-
Notifications
You must be signed in to change notification settings - Fork 44
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Split caikit-tgis into separate containers #107
Conversation
31b65e4
to
5b53ed0
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Left a few small ones on the Dockerfile and the ServingRuntime, but overall looking great!
Overall, PR is looking great, great job! |
5b53ed0
to
1124fd3
Compare
Added a few more notes to my comments (message here for timestamp tracking) |
- create caikit image using UBI as base - use poetry to lock deps instead of pipenv
1124fd3
to
b69f9f4
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Great work!
/hold |
Updated the images/tags names as discussed with @Xaenalt |
/unhold |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: dtrifiro, Xaenalt The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
do you guys know in which version of RHOAI this will land? |
1.36 is the official release, you can use it ahead of time from the ODH repo |
@Xaenalt , actually, it's this image that I'm using at the moment: |
yep |
@@ -9,7 +9,10 @@ metadata: | |||
spec: | |||
predictor: | |||
model: | |||
apiVersion: serving.kserve.io/v1alpha2 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we want to align this with the current KServe version, so I think we'd want this to be v1beta1
Description
Dockerfile
forcaikit-nlp
withtgis
backendInferenceService
+ServingRuntime
)How Has This Been Tested?
This can be tested by running the provided example in either a local kserve deployment or openshift:
This example uses a pvc/pvc, but the model can be also stored in s3 or other object storage
Download a model and convert it to
caikit
format (note: this requiresgit-lfs
to be installed and configuredCopy the model to the pv
Edit
docs/caikit-isvc.yaml
and makestorageUri
point to the model:storageUri: pvc://caikit-claim/flan-t5-small-caikit/
3 Apply the manifests,
Verify that the inference is running on the tgis instance by looking at the logs of the transformer container:
In a subsequent PR, I'll provide an automatic method of testing this setup, either via docker-compose and/or by deploying these manifests.
Merge criteria: