AI-based Neural Network Processing Unit 🚀

A high-performance, lightweight Verilog implementation of a Neural Network Processing Element (PE) for Multi-Layer Perceptron (MLP) acceleration. Developed as Exercise 3 in the AI Systems Course at the University of Tehran.

Project Goals 🎯

Design & Implementation: Develop a neural network processing unit capable of performing multiply-accumulate (MAC), ReLU activation, and quantization operations.
Hardware Acceleration: Use Verilog to create an optimized pipelined architecture for low-latency execution.
Lightweight & Scalable: Minimize execution time and resource usage with parameterizable data widths and pipeline depths.

Features ⭐

Pipelined MAC Unit: Overlapping multiply-accumulate operations to achieve high throughput.
ReLU Activation: Hardware-optimized rectified linear unit for non-linearity.
Quantizer: Fixed-point quantization to control dynamic range and bit-width.
SRAM Interface: Dual-port, ping-pong memory for efficient weight & data buffering.
Control FSM: Manages read/write cycles and pipeline sequencing.
Parameterizable Design: Configure word width, memory depth, and pipeline stages via Verilog parameters.

Project Structure 📂

AI-based-Neural-Network-Processing-Unit/
├── src/
│   ├── PE.v           # Top-level Processing Element module
│   ├── MAC.v          # Multiply-Accumulate Unit
│   ├── ReLU.v         # ReLU Activation Module
│   ├── Quantizer.v    # Fixed-point Quantization Unit
│   ├── SRAM.v         # SRAM Storage Interface
│   └── Controller.v   # Control FSM
├── tests/
│   └── PE_testbench.v # Functional verification testbench
├── docs/
│   ├── simulation/    # Waveforms and logs
│   └── README.md      # Documentation (this file)
└── reports/
    ├── 403_EAI-CA3.pdf    # Assignment instructions
    └── Gozaresh_final.pdf # Final project report

Installation & Simulation 🛠️

Clone the repository

git clone https://github.com/Alighorbani1380/AI-based-Neural-Network-Processing-Unit

Usage 🔧

Parameter Tuning: Edit src/PE.v to adjust data widths and pipeline stages.
SoC Integration: Instantiate PE module in your top-level design for on-chip acceleration.
Custom Verification: Use tests/PE_testbench.v as a template for targeted test scenarios.

Results 📊

Simulation confirms accurate MAC, ReLU, and quantization at 100 MHz with full throughput:

GitHub Topics (SEO) 🔍

verilog hardware-acceleration neural-network MLP processing-element FPGA AI-systems

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
403_EAI-CA3.pdf		403_EAI-CA3.pdf
Multiplx.v		Multiplx.v
Multiply.v		Multiply.v
PE.v		PE.v
Quantizer.v		Quantizer.v
README.md		README.md
Register_for_pipline.v		Register_for_pipline.v
Relu.v		Relu.v
SRAM.v		SRAM.v
controller.v		controller.v
memory_init.v		memory_init.v
tb_PE.v		tb_PE.v

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI-based Neural Network Processing Unit 🚀

Table of Contents

Project Goals 🎯

Features ⭐

Project Structure 📂

Installation & Simulation 🛠️

Usage 🔧

Results 📊

GitHub Topics (SEO) 🔍

About

Uh oh!

Releases

Packages

Languages

Alighorbani1380/AI-based-Neural-Network-Processing-Unit

Folders and files

Latest commit

History

Repository files navigation

AI-based Neural Network Processing Unit 🚀

Table of Contents

Project Goals 🎯

Features ⭐

Project Structure 📂

Installation & Simulation 🛠️

Usage 🔧

Results 📊

GitHub Topics (SEO) 🔍

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages