🤘 TT-NN operator library, and TT-Metalium low level kernel programming model.
-
Updated
Oct 14, 2024 - C++
🤘 TT-NN operator library, and TT-Metalium low level kernel programming model.
FPGA Accelerator for CNN using Vivado HLS
同时支持传送TCP与UDP的KCP通道,附带端口跳跃的功能,以及FEC,自带中继服务器支持
A FPGA Based CNN accelerator, following Google's TPU V1.
A Modeling and Verification Platform for SoCs using ILAs
Advanced Matrix Extensions (AMX) Guide
Open-source Framework for HPCA2024 paper: Gemini: Mapping and Architecture Co-exploration for Large-scale DNN Chiplet Accelerators
An example of using Ramulator as memory model in a cycle-accurate SystemC Design
Generate an accelerator extension that makes your Antlr parser in Python super-fast!
high-performance modeling of beam dynamics in particle accelerators with collective effects
NeuroSpector: Dataflow and Mapping Optimization of Deep Neural Network Accelerators
Tool to simulate beam dynamics in synchrotron light sources
NATSA is the first near-data-processing accelerator for time series analysis based on the Matrix Profile (SCRIMP) algorithm. NATSA exploits modern 3D-stacked High Bandwidth Memory (HBM) to enable efficient and fast matrix profile computation near memory. Described in ICCD 2020 by Fernandez et al. https://people.inf.ethz.ch/omutlu/pub/NATSA_time-…
NPUsim: Full-system, Cycle-accurate, Value-aware NPU Simulator
simulating connection of micro processor and accelerator on a bus context with systemc language
Open Source Code for Advanced Radiation Simulation
Out-of-the-box CHaiDNN implementation on Zynq ZCU104
C++ wrapper for the Nvidia C libraries (e.g. CUDA driver, nvrtc, cuFFT etc.)
Add a description, image, and links to the accelerator topic page so that developers can more easily learn about it.
To associate your repository with the accelerator topic, visit your repo's landing page and select "manage topics."