Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024
-
Updated
Jan 9, 2025 - Python
Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024
Artifact for "Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving" [SOSP '24]
Code for paper "EdgeKE: An On-Demand Deep Learning IoT System for Cognitive Big Data on Industrial Edge Devices"
Official PyTorch implementation of "LGViT: Dynamic Early Exiting for Accelerating Vision Transformer" (ACM MM 2023)
Add a description, image, and links to the early-exit topic page so that developers can more easily learn about it.
To associate your repository with the early-exit topic, visit your repo's landing page and select "manage topics."