#

offloading

Here are 19 public repositories matching this topic...

FMInference / FlexLLMGen

Running large language models on a single GPU for throughput-oriented scenarios.

machine-learning deep-learning offloading high-throughput opt gpt-3 large-language-models

Updated Oct 28, 2024
Python

dvmazur / mixtral-offloading

Run Mixtral-8x7B models in Colab or consumer desktops

deep-learning pytorch offloading quantization language-model google-colab colab-notebook mixture-of-experts llm

Updated Apr 8, 2024
Python

pytorch / ao

PyTorch native quantization and sparsity for training and inference

training sparsity cuda inference optimizer pytorch transformer offloading llama quantization mx brrr dtypes float8

Updated Aug 9, 2025
Python

QECO

ImanRHT / QECO

A QoE-Oriented Computation Offloading Algorithm based on Deep Reinforcement Learning (DRL) for Mobile Edge Computing (MEC) | This algorithm captures the dynamics of the MEC environment by integrating the Dueling Double Deep Q-Network (D3QN) model with Long Short-Term Memory (LSTM) networks.

deep-reinforcement-learning dqn mdp offloading deep-q-network markov-decision-processes resource-management performance-evaluation mec ddqn edge-computing lstm-networks network-optimization d3qn

Updated Apr 5, 2025
Python

liangyuwang / zo2

ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory

offloading llama sft zeroth-order-optimization llms qwen deepseek

Updated Jul 16, 2025
Python

UMbreLLa

Infini-AI-Lab / UMbreLLa

LLM Inference on consumer devices

offloading llm-inference speculative-decoding

Updated Mar 17, 2025
Python

nareddyt / cs4365-task-offload-framework

A framework for IoT devices to offload tasks to the cloud, resulting in efficient computation and decreased cloud costs.

iot cloud framework offloading

Updated Jun 21, 2022
Python

ph4r05 / monero-agent

Monero hardware wallet protocol implementation for Trezor, agent

python protocol ed25519 trezor hardware-wallet offloading monero cryptonote transaction-signer bulletproofs trezor-crypto monero-agent trezor-monero borromean ringct key-image-sync

Updated Jun 17, 2025
Python

ubc-cirrus-lab / unfaasener

A lightweight framework that enables serverless users to reduce their bills by harvesting non-serverless compute resources such as their VMs, on-premise servers, or personal computers.

cloud serverless pubsub faas offloading offloading-framework google-cloud-function dag-scheduling serverless-workflow task-offloading

Updated Aug 16, 2024
Python

pittisl / AgileNN

Code for paper "Real-time Neural Network Inference on Extremely Weak Devices: Agile Offloading with Explainable AI" (MobiCom'22)

tensorflow neural-networks offloading

Updated Apr 13, 2023
Python

lablup / backend.ai-client-py

Backend.AI Client Library for Python

python api-client cloud-computing offloading backendai

Updated Sep 22, 2023
Python

rohankumar42 / pandaSQL

A Pandas-inspired data analysis project with lazy semantics and query-offloading to SQLite

sql database sqlite pandas offloading lazy-evaluation

Updated Feb 25, 2021
Python

K-Wu / FlashTrain

An Activation Offloading Framework to SSDs for Faster Large Language Model Training

hooks pytorch ssd offloading interposition llm-training

Updated Apr 18, 2025
Python

Telecooperation / flexEdge-microservices

microservices offloading microservices-architecture fog-computing edge-computing

Updated May 2, 2019
Python

free001style / efficient-dl-systems

Implementations of some popular approaches for efficient deep learning training and inference

profiler amp offloading multigpu quanti fsdp tensorparallel

Updated Mar 30, 2025
Python

carcaraaa / androidTestBed

Ferramenta para a criação de ambientes de testes com dispositivos Android

android docker offloading android-emulator command-line-tool docker-android testbed mobile-cloud-computing

Updated Oct 19, 2020
Python

bourbonbourbon / Pentest-automation-raspberry

Offloading Resource-Intensive Tasks to Raspberry Pi (or IoT Devices) Using SSH

python linux ssh raspberry-pi iot python3 pentesting offloading

Updated Dec 22, 2023
Python

Anonymous0-0paper / MOSAIC

MEES: Mobility-Aware Energy Efficient Scheduling for Cyber-Physical Systems in Computing Continuum

python kubernetes iot reinforcement-learning cloud-computing offloading fog-computing mobility grid5000 random-walk edge-computing levy-walks computing-continuum cyberphysical-systems task-scheduling workflow-scheduling edge-fog-cloud

Updated Mar 6, 2025
Python

virtualramblas / FlexLLMGenMPS

Running large language models on a single M1/M2 GPU for throughput-oriented scenarios.

python machine-learning deep-learning transformers pytorch offloading high-throughput opt huggingface large-language-models

Updated Jun 21, 2025
Python

Improve this page

Add a description, image, and links to the offloading topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the offloading topic, visit your repo's landing page and select "manage topics."