Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Length penalty draft #10

Draft
wants to merge 64 commits into
base: release
Choose a base branch
from
Draft

Length penalty draft #10

wants to merge 64 commits into from

Conversation

maxdebayser
Copy link
Contributor

This is my first go at implementing the length penalty in vLLM, just to get some feedback. But from our meeting earlier Nick hinted that there might be another way to implement this in our TGIS wrapper, without going deep in the vLLM implementation, so I'll try that approach too.

The relevant change in this draft are in sampler.py

I'm basing this PR on Joe's request-validation branch because it builds on some of the new features there, but I'm also pulling in some recent changes from the release branch here.

njhill and others added 30 commits March 20, 2024 14:51
Signed-off-by: Joe Runde <joseph.runde@ibm.com>
Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
This adds CI to build and push our UBI-based docker image to quay.
It also removes all of the CI for building and packaging the vllm library.

---------

Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Co-authored-by: Travis Johnson <tsjohnso@us.ibm.com>
Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Co-authored-by: Roger Wang <136131678+ywang96@users.noreply.github.com>
Co-authored-by: rsnm2 <rshaw@neuralmagic.com>
Co-authored-by: Luka <luka@paperspace>
Co-authored-by: Roy <jasonailu87@gmail.com>
Co-authored-by: Roger Meier <r.meier@siemens.com>
Co-authored-by: Jan van Lunteren <jvl@zurich.ibm.com>
And update UBI base image, grpcio-tools and accelerate dep versions.
Co-authored-by: MeloYang <meloyang05@gmail.com>
Co-authored-by: 44670 <44670@users.noreply.github.com>
joerunde and others added 14 commits March 26, 2024 13:27
Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
This PR adds a new TGIS specific readme with some docs about what this
repo is, the diffs it contains from vLLM, and instructions on how to
manage our contributions.

---------

Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.