Digit Recognition using CNN

This repository demonstrates how to build a Convolutional Neural Network (CNN) using TensorFlow to perform digit recognition on the MNIST dataset. The notebook walks through all the necessary steps, from importing libraries to saving the trained model. The filename for the main code is main.ipynb.

Introduction

This project utilizes the MNIST dataset to recognize handwritten digits (0-9) using a CNN. The notebook explains step-by-step how the network is built, trained, and evaluated.

Dataset

The dataset used is the MNIST (Modified National Institute of Standards and Technology) dataset:

Training Data: 60,000 grayscale images (28x28 pixels).
Testing Data: 10,000 grayscale images (28x28 pixels).

Each image represents a handwritten digit, and the goal is to classify each image into one of 10 classes (0-9).

Steps in the Notebook

1. Import Libraries

Essential libraries such as TensorFlow, NumPy, and Matplotlib are imported for building and visualizing the model.

2. Load the Dataset

The MNIST dataset is loaded and its shapes are printed to verify the dimensions of the data.

3. Visualize the Dataset

Several sample images from the dataset are displayed using Matplotlib for better understanding.

4. Preprocess the Dataset

The pixel values of the images are normalized to the range [0, 1].
The dataset is reshaped to include a channel dimension (required by CNNs).

5. Create the Neural Network

A CNN architecture is defined using TensorFlow:

2 Convolutional Layers with MaxPooling.
1 Dense Hidden Layer.
1 Output Layer using Softmax activation.

6. Compile the Model

The model is compiled with:

Optimizer: Adam
Loss Function: Sparse Categorical Crossentropy
Metric: Accuracy

7. Visualize the Model

A graphical representation of the model is generated using the plot_model function.

8. Train the Network

The model is trained on the training dataset for 10 epochs with validation data.

9. Evaluate the Model

The trained model's accuracy is tested on the test dataset.

10. Visualize Training Results

Plots of training/validation accuracy and loss over epochs are generated to visualize the model's learning process.

11. Save the Model

The trained model is saved in HDF5 format (mnist_cnn_model.h5) for future use.

Results

The model achieved a test accuracy of approximately 99%, demonstrating excellent performance on the MNIST dataset.
Plots of accuracy and loss confirm effective training with minimal overfitting.

Usage

Clone the repository:

git clone https://github.com/KartikAg13/digit_recognition.git
cd digit_recognition

Open the Jupyter Notebook main.ipynb:
```
jupyter notebook main.ipynb
```
Follow the step-by-step instructions in the notebook to:
- Train the model.
- Evaluate its performance.
- Save the trained model.

Saved Model

The trained model is saved as mnist_cnn_model.h5 in the repository. You can load it in another program to make predictions on new data:

from tensorflow.keras.models import load_model
model = load_model('mnist_cnn_model.h5')
predictions = model.predict(new_data)

This concludes the steps for recognizing digits using a CNN. Happy learning!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Digit Recognition using CNN

Table of Contents

Introduction

Dataset

Steps in the Notebook

1. Import Libraries

2. Load the Dataset

3. Visualize the Dataset

4. Preprocess the Dataset

5. Create the Neural Network

6. Compile the Model

7. Visualize the Model

8. Train the Network

9. Evaluate the Model

10. Visualize Training Results

11. Save the Model

Results

Usage

Saved Model

Files

README.md

Latest commit

History

README.md

File metadata and controls

Digit Recognition using CNN

Table of Contents

Introduction

Dataset

Steps in the Notebook

1. Import Libraries

2. Load the Dataset

3. Visualize the Dataset

4. Preprocess the Dataset

5. Create the Neural Network

6. Compile the Model

7. Visualize the Model

8. Train the Network

9. Evaluate the Model

10. Visualize Training Results

11. Save the Model

Results

Usage

Saved Model