Tensor library for machine learning
-
Updated
Mar 4, 2025 - C++
Tensor library for machine learning
High-speed Large Language Model Serving for Local Deployment
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
TinyChatEngine: On-Device LLM Inference Library
Fast, Flexible and Portable Structured Generation
Fast Multimodal LLM on Mobile Devices
Cuda implementation of Extended Long Short Term Memory (xLSTM) with C++ and PyTorch ports
Tiny C++11 GPT-2 inference implementation from scratch
Code & data for ICLR 2024 spotlight paper: 🍯MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data
CoreScheduler: A High-Performance Scheduler for Large Model Training
Bridging Items and Language: A Transition Paradigm for Large Language Model-Based Recommendation (KDD'24)
Vulkan & GLSL implementation of FlashAttention-2
This is a special PyTorch For Poor Guys Who can't afford big GPU
LLM-driven 3D terrain generation using OpenGL and C++
Add a description, image, and links to the large-language-models topic page so that developers can more easily learn about it.
To associate your repository with the large-language-models topic, visit your repo's landing page and select "manage topics."