AllenDou

AllenDou AllenDou

High Performance Gateway(L7) for Alipay, SSL offload, Intel QAT/Cavium Nitrox tech, Altera/Xilinx FPGA, AliRedis author, K8S/Kubeflow, AI Beginner since 2024.

19 followers · 6 following

Alibaba
Beijing
01:21 (UTC +08:00)

Achievements

Pinned Loading

vllm-project/vllm vllm-project/vllm Public

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 41.2k 6.2k
AutoAWQ AutoAWQ Public

Forked from casper-hansen/AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python
AutoFP8 AutoFP8 Public

Forked from neuralmagic/AutoFP8

Python
llm-compressor llm-compressor Public

Forked from vllm-project/llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AllenDou AllenDou

Achievements

Achievements

Block or report AllenDou

Pinned Loading