Skip to content
@mit-han-lab

MIT HAN Lab

Efficient AI Computing. PI: Song Han

Pinned Loading

  1. streaming-llm streaming-llm Public

    [ICLR 2024] Efficient Streaming Language Models with Attention Sinks

    Python 6.5k 363

  2. smoothquant smoothquant Public

    [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

    Python 1.2k 134

  3. llm-awq llm-awq Public

    [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

    Python 2.3k 177

  4. bevfusion bevfusion Public archive

    [ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

    Python 2.2k 406

  5. once-for-all once-for-all Public

    [ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment

    Python 1.9k 334

  6. temporal-shift-module temporal-shift-module Public

    [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

    Python 2.1k 418

Repositories

Showing 10 of 51 repositories
  • qserve Public

    QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

    mit-han-lab/qserve’s past year of commit activity
    Python 397 Apache-2.0 18 25 2 Updated Sep 5, 2024
  • proxylessnas Public

    [ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware

    mit-han-lab/proxylessnas’s past year of commit activity
    C++ 1,419 MIT 284 0 2 Updated Aug 30, 2024
  • spatten Public

    [HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning

    mit-han-lab/spatten’s past year of commit activity
    Scala 63 MIT 6 1 0 Updated Aug 27, 2024
  • fastcomposer Public

    [IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention

    mit-han-lab/fastcomposer’s past year of commit activity
    Python 644 MIT 36 16 0 Updated Aug 21, 2024
  • distrifuser Public

    [CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

    mit-han-lab/distrifuser’s past year of commit activity
    Python 554 MIT 20 8 0 Updated Aug 17, 2024
  • efficientvit Public

    EfficientViT is a new family of vision models for efficient high-resolution vision.

    mit-han-lab/efficientvit’s past year of commit activity
    Python 1,760 Apache-2.0 161 88 3 Updated Aug 9, 2024
  • torchsparse Public

    [MICRO'23, MLSys'22] TorchSparse: Efficient Training and Inference Framework for Sparse Convolution on GPUs.

    mit-han-lab/torchsparse’s past year of commit activity
    Cuda 1,186 MIT 137 20 1 Updated Jul 31, 2024
  • bevfusion Public archive

    [ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

    mit-han-lab/bevfusion’s past year of commit activity
    Python 2,237 Apache-2.0 406 0 0 Updated Jul 31, 2024
  • torchquantum Public

    A PyTorch-based framework for Quantum Classical Simulation, Quantum Machine Learning, Quantum Neural Networks, Parameterized Quantum Circuits with support for easy deployments on real quantum computers.

    mit-han-lab/torchquantum’s past year of commit activity
    Jupyter Notebook 1,283 MIT 191 57 (4 issues need help) 7 Updated Jul 21, 2024
  • llm-awq Public

    [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

    mit-han-lab/llm-awq’s past year of commit activity
    Python 2,331 MIT 177 119 8 Updated Jul 15, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.