Skip to content

This is one of those projects i worked hard to understand and replicate with the dataset I had. Also, this is my first project repo!!!!. It is kind of a late submission (completed 2 weeks ago) .

Notifications You must be signed in to change notification settings

BharathwajManoharan/Sign_Language_Recognition

Repository files navigation

Sign-Language-Recognition-Project

A sign language interpreter using live video feed from the camera.

Technologies and Tools

  • Python
  • TensorFlow
  • Keras
  • OpenCV

Installation

  • To set up the environment, run the following command in the command prompt:

    pyton -m pip r install_packages.txt

For GPU support, use install_packages_gpu.txt instead.

Process

  • Set the hand histogram for creating gestures by running set_hand_histogram.py.
  • Save the histogram in the code folder.
  • Label the captured gestures using OpenCV and store them in a database by running create_gestures.py.
  • Add different variations to the captured gestures by flipping all the images using Rotate_images.py.
  • Split all the captured gestures into training, validation, and test sets by running load_images.py.
  • View all the gestures using display_gestures.py.
  • Train the model using Keras by running cnn_model_train.py.
  • Run final.py to open the gesture recognition window, which will use your webcam to interpret the trained American Sign Language gestures.

Code Examples

# Model Training using CNN

import numpy as np
import pickle
import cv2, os
from glob import glob
from keras import optimizers
from keras.models import Sequential
from keras.layers import Dense
from keras.layers import Dropout
from keras.layers import Flatten
from keras.layers.convolutional import Conv2D
from keras.layers.convolutional import MaxPooling2D
from keras.utils import np_utils
from keras.callbacks import ModelCheckpoint
from keras import backend as K
K.set_image_dim_ordering('tf')

os.environ['TF_CPP_MIN_LOG_LEVEL'] = '3'

def get_image_size():
	img = cv2.imread('gestures/1/100.jpg', 0)
	return img.shape

def get_num_of_classes():
	return len(glob('gestures/*'))

image_x, image_y = get_image_size()

def cnn_model():
	num_of_classes = get_num_of_classes()
	model = Sequential()
	model.add(Conv2D(16, (2,2), input_shape=(image_x, image_y, 1), activation='relu'))
	model.add(MaxPooling2D(pool_size=(2, 2), strides=(2, 2), padding='same'))
	model.add(Conv2D(32, (3,3), activation='relu'))
	model.add(MaxPooling2D(pool_size=(3, 3), strides=(3, 3), padding='same'))
	model.add(Conv2D(64, (5,5), activation='relu'))
	model.add(MaxPooling2D(pool_size=(5, 5), strides=(5, 5), padding='same'))
	model.add(Flatten())
	model.add(Dense(128, activation='relu'))
	model.add(Dropout(0.2))
	model.add(Dense(num_of_classes, activation='softmax'))
	sgd = optimizers.SGD(lr=1e-2)
	model.compile(loss='categorical_crossentropy', optimizer=sgd, metrics=['accuracy'])
	filepath="cnn_model_keras2.h5"
	checkpoint1 = ModelCheckpoint(filepath, monitor='val_acc', verbose=1, save_best_only=True, mode='max')
	callbacks_list = [checkpoint1]
	#from keras.utils import plot_model
	#plot_model(model, to_file='model.png', show_shapes=True)
	return model, callbacks_list

def train():
	with open("train_images", "rb") as f:
		train_images = np.array(pickle.load(f))
	with open("train_labels", "rb") as f:
		train_labels = np.array(pickle.load(f), dtype=np.int32)

	with open("val_images", "rb") as f:
		val_images = np.array(pickle.load(f))
	with open("val_labels", "rb") as f:
		val_labels = np.array(pickle.load(f), dtype=np.int32)

	train_images = np.reshape(train_images, (train_images.shape[0], image_x, image_y, 1))
	val_images = np.reshape(val_images, (val_images.shape[0], image_x, image_y, 1))
	train_labels = np_utils.to_categorical(train_labels)
	val_labels = np_utils.to_categorical(val_labels)

	print(val_labels.shape)

	model, callbacks_list = cnn_model()
	model.summary()
	model.fit(train_images, train_labels, validation_data=(val_images, val_labels), epochs=15, batch_size=500, callbacks=callbacks_list)
	scores = model.evaluate(val_images, val_labels, verbose=0)
	print("CNN Error: %.2f%%" % (100-scores[1]*100))
	#model.save('cnn_model_keras2.h5')

train()
K.clear_session();

Features

Our model was able to predict the 31 characters in the ASL with a prediction accuracy > 84.19% ( I topped this score in the latest model in sync intern project repo ).

Features that can be added:

  • Deploy the project on cloud and create an API for using it.
  • Increase the vocabulary of our model
  • Incorporate feedback mechanism to make the model more robust
  • Add more sign languages

Status

Project is: finished.

About

This is one of those projects i worked hard to understand and replicate with the dataset I had. Also, this is my first project repo!!!!. It is kind of a late submission (completed 2 weeks ago) .

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages