-
-
Notifications
You must be signed in to change notification settings - Fork 4.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ Frontend ] Multiprocessing for OpenAI Server with zeromq
#6883
[ Frontend ] Multiprocessing for OpenAI Server with zeromq
#6883
Commits on Jul 25, 2024
-
Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Configuration menu - View commit details
-
Copy full SHA for bed649a - Browse repository at this point
Copy the full SHA bed649aView commit details -
Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Configuration menu - View commit details
-
Copy full SHA for 7de9d49 - Browse repository at this point
Copy the full SHA 7de9d49View commit details -
Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Configuration menu - View commit details
-
Copy full SHA for 9394a62 - Browse repository at this point
Copy the full SHA 9394a62View commit details -
Configuration menu - View commit details
-
Copy full SHA for dd8bf96 - Browse repository at this point
Copy the full SHA dd8bf96View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5c7fbff - Browse repository at this point
Copy the full SHA 5c7fbffView commit details -
Configuration menu - View commit details
-
Copy full SHA for 952e8ef - Browse repository at this point
Copy the full SHA 952e8efView commit details -
Configuration menu - View commit details
-
Copy full SHA for e8eac95 - Browse repository at this point
Copy the full SHA e8eac95View commit details -
Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Configuration menu - View commit details
-
Copy full SHA for 938a843 - Browse repository at this point
Copy the full SHA 938a843View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2b8d7cd - Browse repository at this point
Copy the full SHA 2b8d7cdView commit details
Commits on Jul 26, 2024
-
Configuration menu - View commit details
-
Copy full SHA for ea02d39 - Browse repository at this point
Copy the full SHA ea02d39View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4a2dc46 - Browse repository at this point
Copy the full SHA 4a2dc46View commit details -
Configuration menu - View commit details
-
Copy full SHA for 30f2bc9 - Browse repository at this point
Copy the full SHA 30f2bc9View commit details -
Configuration menu - View commit details
-
Copy full SHA for c718b68 - Browse repository at this point
Copy the full SHA c718b68View commit details -
Configuration menu - View commit details
-
Copy full SHA for b3d25c6 - Browse repository at this point
Copy the full SHA b3d25c6View commit details -
Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Configuration menu - View commit details
-
Copy full SHA for 2765b17 - Browse repository at this point
Copy the full SHA 2765b17View commit details -
Configuration menu - View commit details
-
Copy full SHA for b219778 - Browse repository at this point
Copy the full SHA b219778View commit details -
Configuration menu - View commit details
-
Copy full SHA for 932ea23 - Browse repository at this point
Copy the full SHA 932ea23View commit details -
Configuration menu - View commit details
-
Copy full SHA for f029114 - Browse repository at this point
Copy the full SHA f029114View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6854758 - Browse repository at this point
Copy the full SHA 6854758View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3b5ff66 - Browse repository at this point
Copy the full SHA 3b5ff66View commit details -
Configuration menu - View commit details
-
Copy full SHA for 79247c3 - Browse repository at this point
Copy the full SHA 79247c3View commit details -
Configuration menu - View commit details
-
Copy full SHA for a39ebc0 - Browse repository at this point
Copy the full SHA a39ebc0View commit details -
Configuration menu - View commit details
-
Copy full SHA for ef257f1 - Browse repository at this point
Copy the full SHA ef257f1View commit details
Commits on Jul 28, 2024
-
Configuration menu - View commit details
-
Copy full SHA for a6c9bc5 - Browse repository at this point
Copy the full SHA a6c9bc5View commit details -
Configuration menu - View commit details
-
Copy full SHA for d7490bc - Browse repository at this point
Copy the full SHA d7490bcView commit details -
Configuration menu - View commit details
-
Copy full SHA for f68fd60 - Browse repository at this point
Copy the full SHA f68fd60View commit details -
Configuration menu - View commit details
-
Copy full SHA for 38b5b9c - Browse repository at this point
Copy the full SHA 38b5b9cView commit details -
Configuration menu - View commit details
-
Copy full SHA for bc54311 - Browse repository at this point
Copy the full SHA bc54311View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3cccebb - Browse repository at this point
Copy the full SHA 3cccebbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4b78e29 - Browse repository at this point
Copy the full SHA 4b78e29View commit details -
Configuration menu - View commit details
-
Copy full SHA for 345bfdd - Browse repository at this point
Copy the full SHA 345bfddView commit details -
Configuration menu - View commit details
-
Copy full SHA for cfbb001 - Browse repository at this point
Copy the full SHA cfbb001View commit details -
Configuration menu - View commit details
-
Copy full SHA for d811b42 - Browse repository at this point
Copy the full SHA d811b42View commit details -
Configuration menu - View commit details
-
Copy full SHA for 852534e - Browse repository at this point
Copy the full SHA 852534eView commit details -
Configuration menu - View commit details
-
Copy full SHA for e42be96 - Browse repository at this point
Copy the full SHA e42be96View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5202a59 - Browse repository at this point
Copy the full SHA 5202a59View commit details -
Configuration menu - View commit details
-
Copy full SHA for 71b1bf9 - Browse repository at this point
Copy the full SHA 71b1bf9View commit details
Commits on Jul 29, 2024
-
Configuration menu - View commit details
-
Copy full SHA for a499079 - Browse repository at this point
Copy the full SHA a499079View commit details -
Configuration menu - View commit details
-
Copy full SHA for 88a1d08 - Browse repository at this point
Copy the full SHA 88a1d08View commit details -
Configuration menu - View commit details
-
Copy full SHA for 13ce2f1 - Browse repository at this point
Copy the full SHA 13ce2f1View commit details -
Configuration menu - View commit details
-
Copy full SHA for bb8ac06 - Browse repository at this point
Copy the full SHA bb8ac06View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6ebdb3d - Browse repository at this point
Copy the full SHA 6ebdb3dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 24c8100 - Browse repository at this point
Copy the full SHA 24c8100View commit details -
Configuration menu - View commit details
-
Copy full SHA for e707049 - Browse repository at this point
Copy the full SHA e707049View commit details -
Configuration menu - View commit details
-
Copy full SHA for baaf6bc - Browse repository at this point
Copy the full SHA baaf6bcView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9d19d92 - Browse repository at this point
Copy the full SHA 9d19d92View commit details -
Configuration menu - View commit details
-
Copy full SHA for f1be4b8 - Browse repository at this point
Copy the full SHA f1be4b8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8e417ad - Browse repository at this point
Copy the full SHA 8e417adView commit details -
🥅 handle shutdown and request errors
Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Configuration menu - View commit details
-
Copy full SHA for 4c16c5e - Browse repository at this point
Copy the full SHA 4c16c5eView commit details -
🎨 fmt and clean up shutdown handler
Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Configuration menu - View commit details
-
Copy full SHA for 6ddd4a7 - Browse repository at this point
Copy the full SHA 6ddd4a7View commit details -
Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Configuration menu - View commit details
-
Copy full SHA for 6d7da74 - Browse repository at this point
Copy the full SHA 6d7da74View commit details -
Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Configuration menu - View commit details
-
Copy full SHA for 97ea04d - Browse repository at this point
Copy the full SHA 97ea04dView commit details -
Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Configuration menu - View commit details
-
Copy full SHA for 6d753a4 - Browse repository at this point
Copy the full SHA 6d753a4View commit details -
Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Configuration menu - View commit details
-
Copy full SHA for 38e308e - Browse repository at this point
Copy the full SHA 38e308eView commit details -
Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Configuration menu - View commit details
-
Copy full SHA for ec19a7b - Browse repository at this point
Copy the full SHA ec19a7bView commit details
Commits on Jul 30, 2024
-
@robertgshaw2-neuralmagic This adds the `--disable-frontend-multiprocessing` flag and should also correctly pick up embeddings models to disable the multiprocessing here. (Also some unrelated formatting changes) The backend stuff is wrapped up in a context manager that handles the process startup and shutdown at exit as well, so that we don't have to muck around much in the existing server lifecycle code --------- Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Configuration menu - View commit details
-
Copy full SHA for 453939b - Browse repository at this point
Copy the full SHA 453939bView commit details
Commits on Jul 31, 2024
-
Features / Cleanup for MP Frontend (#387)
SUMMARY: * refactor to use single socket * cleanup comments / logging * add `do_log_stats` * add `abort`
Configuration menu - View commit details
-
Copy full SHA for 1f33286 - Browse repository at this point
Copy the full SHA 1f33286View commit details -
Use random port for backend (#390)
Picks an open port to use and boots both the client and server with it --------- Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Configuration menu - View commit details
-
Copy full SHA for 5362952 - Browse repository at this point
Copy the full SHA 5362952View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7214fb8 - Browse repository at this point
Copy the full SHA 7214fb8View commit details -
With all the extra fun refactors Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Configuration menu - View commit details
-
Copy full SHA for 98a7dab - Browse repository at this point
Copy the full SHA 98a7dabView commit details -
SUMMARY: * add endpoints to request `ModelConfig`, `SchedulerConfig`, `LoRAConfig`, `ParallelConfig` * factor out tokenizer group creation function to be a utility function * create tokenizer_group on client side
Configuration menu - View commit details
-
Copy full SHA for f5f0b45 - Browse repository at this point
Copy the full SHA f5f0b45View commit details -
Ensures no sockets are leaked on the client-side Also postpones the server shutdown await so that the backend can shutdown concurrently, and all connections can be cleaned up at the same time. This prevents hangs where the frontend blocks on remaining connections but the backend has not yet initiated shutdown --------- Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Configuration menu - View commit details
-
Copy full SHA for 0b351c0 - Browse repository at this point
Copy the full SHA 0b351c0View commit details -
SUMMARY: * fix issue with logit bias loading
Configuration menu - View commit details
-
Copy full SHA for 79fcc44 - Browse repository at this point
Copy the full SHA 79fcc44View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9da8c4a - Browse repository at this point
Copy the full SHA 9da8c4aView commit details -
🐛 messed up the revert in the merge commit :(
Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Configuration menu - View commit details
-
Copy full SHA for 4c65f74 - Browse repository at this point
Copy the full SHA 4c65f74View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9bc97f1 - Browse repository at this point
Copy the full SHA 9bc97f1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 68d8612 - Browse repository at this point
Copy the full SHA 68d8612View commit details
Commits on Aug 1, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 4337fe7 - Browse repository at this point
Copy the full SHA 4337fe7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 779d9bd - Browse repository at this point
Copy the full SHA 779d9bdView commit details -
Configuration menu - View commit details
-
Copy full SHA for a6044a3 - Browse repository at this point
Copy the full SHA a6044a3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 100189f - Browse repository at this point
Copy the full SHA 100189fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0fc8545 - Browse repository at this point
Copy the full SHA 0fc8545View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6383091 - Browse repository at this point
Copy the full SHA 6383091View commit details -
Configuration menu - View commit details
-
Copy full SHA for a09f57f - Browse repository at this point
Copy the full SHA a09f57fView commit details -
✅ add test for multiprocessing flag (#399)
Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Configuration menu - View commit details
-
Copy full SHA for 1bdbfcb - Browse repository at this point
Copy the full SHA 1bdbfcbView commit details -
(plus rounding out the protocol with an error on `.encode`) --------- Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
Configuration menu - View commit details
-
Copy full SHA for f3c0f1c - Browse repository at this point
Copy the full SHA f3c0f1cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9c415ad - Browse repository at this point
Copy the full SHA 9c415adView commit details -
Configuration menu - View commit details
-
Copy full SHA for 62036ad - Browse repository at this point
Copy the full SHA 62036adView commit details -
Configuration menu - View commit details
-
Copy full SHA for a177d87 - Browse repository at this point
Copy the full SHA a177d87View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9ca3b93 - Browse repository at this point
Copy the full SHA 9ca3b93View commit details -
Configuration menu - View commit details
-
Copy full SHA for f8b5fb1 - Browse repository at this point
Copy the full SHA f8b5fb1View commit details -
Update vllm/entrypoints/openai/rpc/server.py
Co-authored-by: Simon Mo <simon.mo@hey.com>
Configuration menu - View commit details
-
Copy full SHA for fca5a71 - Browse repository at this point
Copy the full SHA fca5a71View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5f07f86 - Browse repository at this point
Copy the full SHA 5f07f86View commit details
Commits on Aug 2, 2024
-
Configuration menu - View commit details
-
Copy full SHA for bd0fd76 - Browse repository at this point
Copy the full SHA bd0fd76View commit details