Skip to content

Commit

Permalink
Fixing the oom maybe with 2.5.1 change. (#2958)
Browse files Browse the repository at this point in the history
  • Loading branch information
Narsil authored Jan 28, 2025
1 parent c690da5 commit eb3df0f
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion server/text_generation_server/models/globals.py
Original file line number Diff line number Diff line change
Expand Up @@ -28,7 +28,7 @@
raise RuntimeError("Prefix caching is only supported with flashinfer")

MEM_POOL = torch.cuda.graph_pool_handle() if torch.cuda.is_available() else None
TGI_WIGGLE_ROOM = float(os.getenv("TGI_WIGGLE_ROOM", "0.95"))
TGI_WIGGLE_ROOM = float(os.getenv("TGI_WIGGLE_ROOM", "0.93"))
assert TGI_WIGGLE_ROOM > 0
assert TGI_WIGGLE_ROOM < 1

Expand Down

0 comments on commit eb3df0f

Please sign in to comment.