A high-performance inference system for large language models, designed for production environments.
-
Updated
Oct 19, 2024 - C++
A high-performance inference system for large language models, designed for production environments.
An implementation of printf + A preprocessor to pass the code through before compiling
An out-of-order execution CPU simulator for CS2410 Computer Architecture course final project at the University of Pittsburgh.
Add a description, image, and links to the speculative topic page so that developers can more easily learn about it.
To associate your repository with the speculative topic, visit your repo's landing page and select "manage topics."