Skip to content
This repository has been archived by the owner on Jul 15, 2024. It is now read-only.

Wait for model directory to exist when "lazy_load_local_models" is enabled #21

Open
rhuss opened this issue Jan 23, 2024 · 0 comments
Open

Comments

@rhuss
Copy link

rhuss commented Jan 23, 2024

Is your feature request related to a problem? Please describe.

With the new modelcar feature of KServe it is possible to access model data directly from within an OCI image without downloading or copying. However, since this model is injected as a sidecar, the startup order of the containers in non-deterministic.

Since the modelcar container creates the symbolic link /mnt/models to point to the model stored within that image, the path /mnt/models might not exist when the caikit transformer container starts. This will trigger line

if self._local_models_dir and not os.path.exists(self._local_models_dir):
so that the startup fails.

Describe the solution you'd like

When lazy loading is enabled via lazy_load_local_models, the runtime should wait a certain amount of time for the path (/mnt/models) to come up before giving up. This can be tricky since when the model still needs to be loaded in the node's OCI runtime, it gets pulled from the registry, which might take quite some time. But if it is already loaded, the model can be delayed in a matter of seconds.

Additional context

More about the modelcar approach can be found here:

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant