Incorporate runtime into model configuration #285

kthui · 2023-11-03T22:21:06Z

Related PRs:

Previously, the runtime for each backend is determined automatically when loading the backend. The PR adds the ability for user to choose which runtime to use for each model on the model configuration. For example:

runtime: "libtriton_tensorrt.so"

runtime: "model.py"

To retain backward compatibility, the logic previously used to determine runtime is incorporated into autocomplete, which will attempt to fill-in the "runtime" field if left empty. The core will load the exact runtime as specified on the "runtime" field on the model configuration, after any applicable autocomplete.

As a part of the change, the autocomplete logic will fill in Python backend based runtime for backends that are clearly Python backend based backends, for example, vLLM backend.

If a backend provides both C++ and Python backend based runtime, for example, PyTorch backend, the autocomplete will look at the default model filename on the model configuration to try determine whether the C++ or Python backend based runtime is more appropriate. For example, if the default model filename on PyTorch backend is "model.pt", the C++ runtime will be selected. If the default model filename is "model.py", the Python backend based runtime will be selected. In any case, the autocomplete will not alter the runtime if it is explicitly provided by the user.

src/backend_model.cc

src/constants.h

src/model_config_utils.h

src/backend_model.cc

src/model_config_utils.cc

src/backend_model.cc

nnshah1 · 2023-11-28T16:35:21Z

src/constants.h

 constexpr char kPyTorchBackend[] = "pytorch";

 constexpr char kPythonFilename[] = "model.py";
 constexpr char kPythonBackend[] = "python";

+constexpr char kVLLMBackend[] = "vllm";


Do we need these for any new python based backend? Would be good to avoid that if possible - we shouldn't need to update the constants if someone adds a backend in the future

We do not need them. constexpr char kVLLMBackend[] = "vllm"; is removed, and they are handled as custom backend.

…jacky-python-based-pytorch

src/backend_model.cc

rmccorm4

Very clean implementation 🚀

Please make sure to run a relatively full pipeline that tests some custom backends and L0_backend_python and PYTORCH:TEST etc. before merge - not just standard L0

Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>

kthui mentioned this pull request Nov 3, 2023

Add runtime to model configuration triton-inference-server/common#103

Merged

kthui force-pushed the jacky-python-based-pytorch branch from ae1bc12 to 368aa6b Compare November 4, 2023 00:55

kthui mentioned this pull request Nov 4, 2023

Bring back Python backend based PyTorch backend triton-inference-server/server#6518

Merged

kthui force-pushed the jacky-python-based-pytorch branch 2 times, most recently from 50932e1 to 5dd3ac5 Compare November 8, 2023 00:36

kthui added 5 commits November 13, 2023 16:23

Add autocomplete for runtime model config field

72a197f

Use runtime for backend library name

ac7c9a1

Runtime may be left empty

10edad1

Add check for well formed runtime

d9b586d

Enable conda pack support for Python backend based backends

3959a19

kthui force-pushed the jacky-python-based-pytorch branch from 5dd3ac5 to 3959a19 Compare November 14, 2023 03:36

kthui mentioned this pull request Nov 14, 2023

Bring back Python backend based PyTorch backend triton-inference-server/pytorch_backend#117

Merged

kthui requested review from tanmayv25, rmccorm4 and GuanLuo November 14, 2023 18:29

kthui marked this pull request as ready for review November 14, 2023 18:40

nnshah1 reviewed Nov 14, 2023

View reviewed changes

src/backend_model.cc Outdated Show resolved Hide resolved

Rename variable name

7da30c9

kthui requested a review from nnshah1 November 14, 2023 22:30

rmccorm4 reviewed Nov 27, 2023

View reviewed changes

src/constants.h Outdated Show resolved Hide resolved

rmccorm4 reviewed Nov 27, 2023

View reviewed changes

src/model_config_utils.h Outdated Show resolved Hide resolved

rmccorm4 reviewed Nov 27, 2023

View reviewed changes

src/model_config_utils.h Outdated Show resolved Hide resolved

rmccorm4 reviewed Nov 28, 2023

View reviewed changes

src/backend_model.cc Outdated Show resolved Hide resolved

kthui added 2 commits November 27, 2023 16:32

Shorten function name

f22a84c

Add .. to runtime library name check

1c2c618

nnshah1 reviewed Nov 28, 2023

View reviewed changes

src/model_config_utils.cc Outdated Show resolved Hide resolved

nnshah1 reviewed Nov 28, 2023

View reviewed changes

src/backend_model.cc Outdated Show resolved Hide resolved

nnshah1 reviewed Nov 28, 2023

View reviewed changes

kthui force-pushed the jacky-python-based-pytorch branch 2 times, most recently from df1f9ff to 0d89719 Compare November 28, 2023 18:07

Limit Python based backend search to backend directory

b0e4663

kthui force-pushed the jacky-python-based-pytorch branch from 0d89719 to b0e4663 Compare December 5, 2023 00:34

kthui added 5 commits December 4, 2023 18:20

Remove runtime autocomplete

2e5b509

Merge branch 'main' of github.com:/triton-inference-server/core into …

1fc54ce

…jacky-python-based-pytorch

Update path escape check logic for runtime field

f502bfc

Merge branch 'main' of github.com:/triton-inference-server/core into …

5a2d824

…jacky-python-based-pytorch

Merge branch 'main' of github.com:/triton-inference-server/core into …

80f9516

…jacky-python-based-pytorch

kthui mentioned this pull request Jan 8, 2024

Python backend based PyTorch backend documentations triton-inference-server/backend#94

Merged

Update copyright

1a38e2f

kthui requested review from Tabrizian, rmccorm4, oandreeva-nv and nnshah1 January 10, 2024 17:52

rmccorm4 reviewed Jan 10, 2024

View reviewed changes

src/backend_model.cc Outdated Show resolved Hide resolved

rmccorm4 previously approved these changes Jan 10, 2024

View reviewed changes

Update docs wording

01edc17

Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>

kthui dismissed rmccorm4’s stale review via 01edc17 January 10, 2024 23:54

Fix pre-commit issue

34b335b

kthui requested a review from rmccorm4 January 10, 2024 23:57

rmccorm4 approved these changes Jan 11, 2024

View reviewed changes

kthui merged commit 7363cfe into main Jan 11, 2024
1 check passed

kthui deleted the jacky-python-based-pytorch branch January 11, 2024 17:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Incorporate runtime into model configuration #285

Incorporate runtime into model configuration #285

kthui commented Nov 3, 2023 •

edited

Loading

nnshah1 Nov 28, 2023

kthui Dec 8, 2023

rmccorm4 left a comment •

edited

Loading

Incorporate runtime into model configuration #285

Incorporate runtime into model configuration #285

Conversation

kthui commented Nov 3, 2023 • edited Loading

nnshah1 Nov 28, 2023

Choose a reason for hiding this comment

kthui Dec 8, 2023

Choose a reason for hiding this comment

rmccorm4 left a comment • edited Loading

Choose a reason for hiding this comment

kthui commented Nov 3, 2023 •

edited

Loading

rmccorm4 left a comment •

edited

Loading