liangyuwang

Follow

Liangyu Wang liangyuwang

Follow

LLM, CUDA/System, CPU offloading, Distributed training

44 followers · 0 following

KAUST (King Abdullah University of Science and Technology)
https://liangyuwang.github.io/
in/liangyu-wang-in
@liangyuwang10

Achievements

Achievements

liangyuwang/README.md

Hi there 👋 I'm Liangyu Wang (王良宇)

PhD Candidate in Computer Science | Qwen team | Efficient LLM Systems

🎓 Education & Research

📍 PhD @ King Abdullah University of Science and Technology, advised by Prof. Di Wang
🎓 Formerly @ The Chinese University of Hong Kong, with Prof. Hongliang Ren
🔬 Research interests: efficient LLM training/inference, algorithm and infrastructure design, and efficient privacy-aware learning

📚 Find Me Online

🌐 Personal Website

📊 GitHub Stats

🌱 Always exploring new ideas at the intersection of large-scale LLM systems.

Pinned Loading

zo2 zo2 Public

ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory

Python 181 12
Tiny-FSDP Tiny-FSDP Public

Tiny-FSDP, a minimalistic re-implementation of the PyTorch FSDP

Python 74 7
Tiny-DeepSpeed Tiny-DeepSpeed Public

Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library

Python 43 8
Tiny-Megatron Tiny-Megatron Public

Tiny-Megatron, a minimalistic re-implementation of the Megatron library

Python 16 2
MetaProfiler MetaProfiler Public

MetaProfiler is a lightweight, structure-agnostic operator-level profiler for PyTorch models that leverages MetaTensor execution to simulate and benchmark individual ops without loading the full mo…

Python 1
Streaming-Dataloader Streaming-Dataloader Public

A memory-efficient streaming data loader designed for LLM pretraining under limited CPU and GPU memory constraints

Python 2