axs2kiss

Automated KRAI X workflows for dedicated inference engines on selected backends: vLLM and SGLang on CUDA and ROCm, NIM on CUDA, using the OpenAI API compatible LoadGen client.

To import this repository and its dependencies into your work_collection, run:

axs byquery git_repo,collection,repo_name=axs2kiss

License

Unless explicitly stated otherwise, the software in this repository is provided under the permissive MIT license.

Contact

Please contact info@krai.ai for any queries.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
llama2_using_openai_loadgen		llama2_using_openai_loadgen
llama3_1_using_openai_loadgen		llama3_1_using_openai_loadgen
openai_nginx_server_recipe		openai_nginx_server_recipe
openai_nim_server_recipe		openai_nim_server_recipe
openai_sglang_server_recipe		openai_sglang_server_recipe
openai_vllm_server_recipe		openai_vllm_server_recipe
xd670_h200_x8_nim		xd670_h200_x8_nim
xd670_h200_x8_sglang		xd670_h200_x8_sglang
xd670_h200_x8_vllm_064		xd670_h200_x8_vllm_064
xd670_h200_x8_vllm_064_flashinfer		xd670_h200_x8_vllm_064_flashinfer
xd670_h200_x8_vllm_073		xd670_h200_x8_vllm_073
xe9680_mi300x_x8_sglang		xe9680_mi300x_x8_sglang
xe9680_mi300x_x8_vllm_064		xe9680_mi300x_x8_vllm_064
xe9680_mi300x_x8_vllm_073		xe9680_mi300x_x8_vllm_073
LICENSE.txt		LICENSE.txt
README.md		README.md
data_axs.json		data_axs.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

axs2kiss

License

Contact

About

Releases

Packages

License

krai/axs2kiss

Folders and files

Latest commit

History

Repository files navigation

axs2kiss

License

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages