Releases · bentoml/OpenLLM

29 May 02:54

v0.5.1

5ff77d1

v0.5.1

Installation

pip install openllm==0.5.1

To upgrade from a previous version, use the following command:

pip install --upgrade openllm==0.5.1

Usage

To start a LLM: python -m openllm start HuggingFaceH4/zephyr-7b-beta

Find more information about this release in the CHANGELOG.md

Full Changelog: v0.5.0...v0.5.1

Assets 15

27 May 18:00

github-actions

v0.5.0-alpha.15

a4a6060

v0.5.0-alpha.15 Pre-release

Pre-release

Installation

pip install openllm==0.5.0-alpha.15

To upgrade from a previous version, use the following command:

pip install --upgrade openllm==0.5.0-alpha.15

Usage

All available models: openllm models

To start a LLM: python -m openllm start HuggingFaceH4/zephyr-7b-beta

To run OpenLLM within a container environment (requires GPUs): docker run --gpus all -it -P -v $PWD/data:$HOME/.cache/huggingface/ ghcr.io/bentoml/openllm:0.5.0-alpha.15 start HuggingFaceH4/zephyr-7b-beta

Find more information about this release in the CHANGELOG.md

What's Changed

chore(deps): bump docker/setup-buildx-action from 3.0.0 to 3.2.0 by @dependabot in #941
chore(deps): bump github/codeql-action from 3.24.3 to 3.24.9 by @dependabot in #939
ci: pre-commit autoupdate [pre-commit.ci] by @pre-commit-ci in #942
fix(compat): use annotated type from typing_compat by @rudeigerc in #943
docs: Update high-level messaging by @Sherlock113 in #949
ci: pre-commit autoupdate [pre-commit.ci] by @pre-commit-ci in #947
chore(deps): bump aquasecurity/trivy-action from 0.18.0 to 0.19.0 by @dependabot in #946
chore(deps): bump taiki-e/install-action from 2.27.9 to 2.32.9 by @dependabot in #945
Update README.md by @parano in #964
chore(deps): bump taiki-e/install-action from 2.32.9 to 2.33.9 by @dependabot in #970
chore(deps): bump sigstore/cosign-installer from 3.4.0 to 3.5.0 by @dependabot in #954
chore(deps): bump docker/metadata-action from 5.5.0 to 5.5.1 by @dependabot in #956
chore(deps): bump actions/setup-python from 5.0.0 to 5.1.0 by @dependabot in #955
chore(deps): bump pypa/gh-action-pypi-publish from 1.8.11 to 1.8.14 by @dependabot in #958
ci: pre-commit autoupdate [pre-commit.ci] by @pre-commit-ci in #959
fix: update correct CompletionOutput object by @aarnphm in #973
chore(deps): bump docker/build-push-action from 5.1.0 to 5.3.0 by @dependabot in #979
chore(deps): bump docker/login-action from 3.0.0 to 3.1.0 by @dependabot in #978
chore(deps): bump github/codeql-action from 3.24.9 to 3.25.3 by @dependabot in #977
chore(deps): bump docker/setup-buildx-action from 3.2.0 to 3.3.0 by @dependabot in #975
fix: make sure to respect additional parameters parse by @aarnphm in #981
chore(deps): bump peter-evans/create-pull-request from 5.0.2 to 6.0.5 by @dependabot in #976
ci: pre-commit autoupdate [pre-commit.ci] by @pre-commit-ci in #980
chore(deps): bump rlespinasse/github-slug-action from 4.4.1 to 4.5.0 by @dependabot in #988
chore(deps): bump softprops/action-gh-release from 1 to 2 by @dependabot in #987
ci: pre-commit autoupdate [pre-commit.ci] by @pre-commit-ci in #989
chore(deps): bump taiki-e/install-action from 2.33.9 to 2.33.22 by @dependabot in #985
chore(deps): bump actions/checkout from 4.1.1 to 4.1.5 by @dependabot in #984
chore(deps): bump next from 13.4.8 to 14.1.1 by @dependabot in #983
ci: pre-commit autoupdate [pre-commit.ci] by @pre-commit-ci in #994
chore(deps): bump actions/checkout from 4.1.5 to 4.1.6 by @dependabot in #993
chore(deps): bump github/codeql-action from 3.25.3 to 3.25.5 by @dependabot in #992
chore(deps): bump aquasecurity/trivy-action from 0.19.0 to 0.20.0 by @dependabot in #991
fix(docs): update correct BentoML links by @dennisrall in #995
tests: add additional basic testing by @aarnphm in #982
infra: prepare 0.5 releases by @aarnphm in #996
chore(deps): bump actions/upload-artifact from 3.1.3 to 4.3.3 by @dependabot in #986
chore(deps): bump actions/download-artifact from 3.0.2 to 4.1.7 by @dependabot in #990
chore(qol): update CLI options and performance upgrade for build cache by @aarnphm in #997
feat(ci): running CI on paperspace by @aarnphm in #998
chore(deps): bump taiki-e/install-action from 2.33.22 to 2.33.34 by @dependabot in #1000

New Contributors

@rudeigerc made their first contribution in #943
@dennisrall made their first contribution in #995

Full Changelog: v0.5.0-alpha.1...v0.5.0-alpha.15

Contributors

parano, rudeigerc, and 5 other contributors

Assets 15

27 May 18:29

github-actions

v0.5.0

2314a36

v0.5.0 Pre-release

Pre-release

Installation

pip install openllm==0.5.0

To upgrade from a previous version, use the following command:

pip install --upgrade openllm==0.5.0

Usage

To start a LLM: python -m openllm start HuggingFaceH4/zephyr-7b-beta

Find more information about this release in the CHANGELOG.md

What's Changed

ci: pre-commit autoupdate [pre-commit.ci] by @pre-commit-ci in #870
chore(deps): bump taiki-e/install-action from 2.25.9 to 2.26.18 by @dependabot in #899
ci: pre-commit autoupdate [pre-commit.ci] by @pre-commit-ci in #909
chore(deps): bump github/codeql-action from 3.23.1 to 3.24.3 by @dependabot in #908
chore(deps): bump sigstore/cosign-installer from 3.3.0 to 3.4.0 by @dependabot in #907
ci: pre-commit autoupdate [pre-commit.ci] by @pre-commit-ci in #931
feat: 1.2 APIs by @aarnphm in #821
chore(deps): bump taiki-e/install-action from 2.26.18 to 2.27.9 by @dependabot in #920
chore(deps): bump next from 13.4.8 to 13.5.1 by @dependabot in #912
ci: pre-commit autoupdate [pre-commit.ci] by @pre-commit-ci in #935
chore(deps): bump marocchino/sticky-pull-request-comment from 2.8.0 to 2.9.0 by @dependabot in #933
chore(deps): bump aquasecurity/trivy-action from 0.16.1 to 0.18.0 by @dependabot in #932
chore(deps): bump docker/setup-buildx-action from 3.0.0 to 3.2.0 by @dependabot in #941
chore(deps): bump github/codeql-action from 3.24.3 to 3.24.9 by @dependabot in #939
ci: pre-commit autoupdate [pre-commit.ci] by @pre-commit-ci in #942
fix(compat): use annotated type from typing_compat by @rudeigerc in #943
docs: Update high-level messaging by @Sherlock113 in #949
ci: pre-commit autoupdate [pre-commit.ci] by @pre-commit-ci in #947
chore(deps): bump aquasecurity/trivy-action from 0.18.0 to 0.19.0 by @dependabot in #946
chore(deps): bump taiki-e/install-action from 2.27.9 to 2.32.9 by @dependabot in #945
Update README.md by @parano in #964
chore(deps): bump taiki-e/install-action from 2.32.9 to 2.33.9 by @dependabot in #970
chore(deps): bump sigstore/cosign-installer from 3.4.0 to 3.5.0 by @dependabot in #954
chore(deps): bump docker/metadata-action from 5.5.0 to 5.5.1 by @dependabot in #956
chore(deps): bump actions/setup-python from 5.0.0 to 5.1.0 by @dependabot in #955
chore(deps): bump pypa/gh-action-pypi-publish from 1.8.11 to 1.8.14 by @dependabot in #958
ci: pre-commit autoupdate [pre-commit.ci] by @pre-commit-ci in #959
fix: update correct CompletionOutput object by @aarnphm in #973
chore(deps): bump docker/build-push-action from 5.1.0 to 5.3.0 by @dependabot in #979
chore(deps): bump docker/login-action from 3.0.0 to 3.1.0 by @dependabot in #978
chore(deps): bump github/codeql-action from 3.24.9 to 3.25.3 by @dependabot in #977
chore(deps): bump docker/setup-buildx-action from 3.2.0 to 3.3.0 by @dependabot in #975
fix: make sure to respect additional parameters parse by @aarnphm in #981
chore(deps): bump peter-evans/create-pull-request from 5.0.2 to 6.0.5 by @dependabot in #976
ci: pre-commit autoupdate [pre-commit.ci] by @pre-commit-ci in #980
chore(deps): bump rlespinasse/github-slug-action from 4.4.1 to 4.5.0 by @dependabot in #988
chore(deps): bump softprops/action-gh-release from 1 to 2 by @dependabot in #987
ci: pre-commit autoupdate [pre-commit.ci] by @pre-commit-ci in #989
chore(deps): bump taiki-e/install-action from 2.33.9 to 2.33.22 by @dependabot in #985
chore(deps): bump actions/checkout from 4.1.1 to 4.1.5 by @dependabot in #984
chore(deps): bump next from 13.4.8 to 14.1.1 by @dependabot in #983
ci: pre-commit autoupdate [pre-commit.ci] by @pre-commit-ci in #994
chore(deps): bump actions/checkout from 4.1.5 to 4.1.6 by @dependabot in #993
chore(deps): bump github/codeql-action from 3.25.3 to 3.25.5 by @dependabot in #992
chore(deps): bump aquasecurity/trivy-action from 0.19.0 to 0.20.0 by @dependabot in #991
fix(docs): update correct BentoML links by @dennisrall in #995
tests: add additional basic testing by @aarnphm in #982
infra: prepare 0.5 releases by @aarnphm in #996
chore(deps): bump actions/upload-artifact from 3.1.3 to 4.3.3 by @dependabot in #986
chore(deps): bump actions/download-artifact from 3.0.2 to 4.1.7 by @dependabot in #990
chore(qol): update CLI options and performance upgrade for build cache by @aarnphm in #997
feat(ci): running CI on paperspace by @aarnphm in #998
chore(deps): bump taiki-e/install-action from 2.33.22 to 2.33.34 by @dependabot in #1000
ci: pre-commit autoupdate [pre-commit.ci] by @pre-commit-ci in #1002

New Contributors

@rudeigerc made their first contribution in #943
@dennisrall made their first contribution in #995

Full Changelog: v0.4.44...v0.5.0

Contributors

parano, rudeigerc, and 5 other contributors

Assets 15

21 Mar 01:46

aarnphm

v0.5.0-alpha.1

12ac998

v0.5.0-alpha.1 Pre-release

Pre-release

Release 0.5.0-alpha.1 [generated by GitHub Actions]

Assets 2

15 Mar 09:28

aarnphm

v0.5.0-alpha

58c741c

v0.5.0-alpha Pre-release

Pre-release

Release 0.5.0-alpha [generated by GitHub Actions]

Assets 2

06 Feb 03:17

github-actions

v0.4.44

1b54d64

v0.4.44

Installation

pip install openllm==0.4.44

To upgrade from a previous version, use the following command:

pip install --upgrade openllm==0.4.44

Usage

All available models: openllm models

To start a LLM: python -m openllm start HuggingFaceH4/zephyr-7b-beta

To run OpenLLM within a container environment (requires GPUs): docker run --gpus all -it -P -v $PWD/data:$HOME/.cache/huggingface/ ghcr.io/bentoml/openllm:0.4.44 start HuggingFaceH4/zephyr-7b-beta

Find more information about this release in the CHANGELOG.md

What's Changed

fix: remove vllm dependency for pytorch bento by @larme in #893

Full Changelog: v0.4.43...v0.4.44

Contributors

larme

Assets 57

05 Feb 10:58

github-actions

v0.4.43

fe44c84

v0.4.43

Installation

pip install openllm==0.4.43

To upgrade from a previous version, use the following command:

pip install --upgrade openllm==0.4.43

Usage

All available models: openllm models

To start a LLM: python -m openllm start HuggingFaceH4/zephyr-7b-beta

To run OpenLLM within a container environment (requires GPUs): docker run --gpus all -it -P -v $PWD/data:$HOME/.cache/huggingface/ ghcr.io/bentoml/openllm:0.4.43 start HuggingFaceH4/zephyr-7b-beta

Find more information about this release in the CHANGELOG.md

What's Changed

fix: limit BentoML version range by @larme in #881
chore: bump up bentoml version to 1.1.11 by @larme in #883
Bump BentoML version in tools by @larme in #884

Full Changelog: v0.4.42...v0.4.43

Contributors

larme

Assets 57

02 Feb 12:31

github-actions

v0.4.42

d1583cc

v0.4.42

Installation

pip install openllm==0.4.42

To upgrade from a previous version, use the following command:

pip install --upgrade openllm==0.4.42

Usage

All available models: openllm models

To start a LLM: python -m openllm start HuggingFaceH4/zephyr-7b-beta

To run OpenLLM within a container environment (requires GPUs): docker run --gpus all -it -P -v $PWD/data:$HOME/.cache/huggingface/ ghcr.io/bentoml/openllm:0.4.42 start HuggingFaceH4/zephyr-7b-beta

Find more information about this release in the CHANGELOG.md

What's Changed

docs: Update opt example to ms-phi by @Sherlock113 in #805
chore(script): run vendored scripts by @aarnphm in #808
docs: README.md typo by @weibeu in #819
ci: pre-commit autoupdate [pre-commit.ci] by @pre-commit-ci in #818
chore(deps): bump docker/metadata-action from 5.3.0 to 5.4.0 by @dependabot in #814
chore(deps): bump taiki-e/install-action from 2.22.5 to 2.23.1 by @dependabot in #813
chore(deps): bump github/codeql-action from 3.22.11 to 3.22.12 by @dependabot in #815
ci: pre-commit autoupdate [pre-commit.ci] by @pre-commit-ci in #825
chore(deps): bump crazy-max/ghaction-import-gpg from 6.0.0 to 6.1.0 by @dependabot in #824
chore(deps): bump taiki-e/install-action from 2.23.1 to 2.23.7 by @dependabot in #823
docs: Add Llamaindex in freedom to build by @Sherlock113 in #826
ci: pre-commit autoupdate [pre-commit.ci] by @pre-commit-ci in #836
chore(deps): bump docker/metadata-action from 5.4.0 to 5.5.0 by @dependabot in #834
chore(deps): bump aquasecurity/trivy-action from 0.16.0 to 0.16.1 by @dependabot in #832
chore(deps): bump taiki-e/install-action from 2.23.7 to 2.24.1 by @dependabot in #833
chore(deps): bump vllm to 0.2.7 by @aarnphm in #837
chore: update discord link by @aarnphm in #838
improv(package): use python slim base image and let pytorch install cuda by @larme in #807
fix(dockerfile): conflict deps by @aarnphm in #841
chore: fix typo in list_models pydoc by @fuzzie360 in #847
docs: update README.md telemetry code link by @fuzzie360 in #842
chore(deps): bump taiki-e/install-action from 2.24.1 to 2.25.1 by @dependabot in #846
chore(deps): bump github/codeql-action from 3.22.12 to 3.23.0 by @dependabot in #844
ci: pre-commit autoupdate [pre-commit.ci] by @pre-commit-ci in #848
ci: pre-commit autoupdate [pre-commit.ci] by @pre-commit-ci in #858
chore(deps): bump taiki-e/install-action from 2.25.1 to 2.25.9 by @dependabot in #856
chore(deps): bump github/codeql-action from 3.23.0 to 3.23.1 by @dependabot in #855
fix: proper SSE handling for vllm by @larme in #877
chore: set stop to empty list by default by @larme in #878
fix: all runners sse output by @larme in #880

New Contributors

@weibeu made their first contribution in #819
@fuzzie360 made their first contribution in #847

Full Changelog: v0.4.41...v0.4.42

Contributors

larme, fuzzie360, and 5 other contributors

Assets 57

18 Dec 18:18

github-actions

v0.4.41

b09bd20

v0.4.41

GPTQ Supports

vLLM backend now support GPTQ with upstream

openlml start TheBloke/Mistral-7B-Instruct-v0.2-GPTQ --backend vllm --quantise gptq

Installation

pip install openllm==0.4.41

To upgrade from a previous version, use the following command:

pip install --upgrade openllm==0.4.41

Usage

All available models: openllm models

To start a LLM: python -m openllm start HuggingFaceH4/zephyr-7b-beta

To run OpenLLM within a container environment (requires GPUs): docker run --gpus all -it -P -v $PWD/data:$HOME/.cache/huggingface/ ghcr.io/bentoml/openllm:0.4.41 start HuggingFaceH4/zephyr-7b-beta

Find more information about this release in the CHANGELOG.md

What's Changed

docs: add notes about dtypes usage. by @aarnphm in #786
chore(deps): bump taiki-e/install-action from 2.22.0 to 2.22.5 by @dependabot in #790
chore(deps): bump github/codeql-action from 2.22.9 to 3.22.11 by @dependabot in #794
chore(deps): bump sigstore/cosign-installer from 3.2.0 to 3.3.0 by @dependabot in #793
chore(deps): bump actions/download-artifact from 3.0.2 to 4.0.0 by @dependabot in #791
chore(deps): bump actions/upload-artifact from 3.1.3 to 4.0.0 by @dependabot in #792
ci: pre-commit autoupdate [pre-commit.ci] by @pre-commit-ci in #796
fix(cli): avoid runtime __origin__ check for older Python by @aarnphm in #798
feat(vllm): support GPTQ with 0.2.6 by @aarnphm in #797
fix(ci): lock to v3 iteration of actions/artifacts workflow by @aarnphm in #799

Full Changelog: v0.4.40...v0.4.41

Contributors

dependabot, aarnphm, and pre-commit-ci

Assets 57

15 Dec 16:57

github-actions

v0.4.40

2e8fc28

v0.4.40

Installation

pip install openllm==0.4.40

To upgrade from a previous version, use the following command:

pip install --upgrade openllm==0.4.40

Usage

All available models: openllm models

To start a LLM: python -m openllm start HuggingFaceH4/zephyr-7b-beta

To run OpenLLM within a container environment (requires GPUs): docker run --gpus all -it -P -v $PWD/data:$HOME/.cache/huggingface/ ghcr.io/bentoml/openllm:0.4.40 start HuggingFaceH4/zephyr-7b-beta

Find more information about this release in the CHANGELOG.md

What's Changed

fix(infra): conform ruff to 150 LL by @aarnphm in #781
infra: update blame ignore to formatter hash by @aarnphm in #782
perf: upgrade mixtral to use expert parallelism by @aarnphm in #783

Full Changelog: v0.4.39...v0.4.40

Contributors

aarnphm

Assets 57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Installation

Usage

Installation

Usage

What's Changed

New Contributors

Contributors

Installation

Usage

What's Changed

New Contributors

Contributors

Installation

Usage

What's Changed

Contributors

Installation

Usage

What's Changed

Contributors

Installation

Usage

What's Changed

New Contributors

Contributors

GPTQ Supports

Installation

Usage

What's Changed

Contributors

Installation

Usage

What's Changed

Contributors

Releases: bentoml/OpenLLM

v0.5.1

Installation

Usage

v0.5.0-alpha.15

Installation

Usage

What's Changed

New Contributors

Contributors

v0.5.0

Installation

Usage

What's Changed

New Contributors

Contributors

v0.5.0-alpha.1

v0.5.0-alpha

v0.4.44

Installation

Usage

What's Changed

Contributors

v0.4.43

Installation

Usage

What's Changed

Contributors

v0.4.42

Installation

Usage

What's Changed

New Contributors

Contributors

v0.4.41

GPTQ Supports

Installation

Usage

What's Changed

Contributors

v0.4.40

Installation

Usage

What's Changed

Contributors