Giter VIP home page Giter VIP logo

indiansign-language-detector's Introduction

INDIAN SIGN LANGUAGE DETECTOR

INTRODUCTION & HOW DOES SIGN LANGUAGE WORK?

  • People who are hearing impaired communication with the help of gesture-energetic movements of the hand accompanied by a living facial expression
  • This language is also used at places where listening is not possible for ex... under water drivers, asutronuts….

DATA SUMMARY:

  • This dataset contain total 4972 jpg images in 24 classes A to y. as well as all classes data is imbalanced so we perform oversampling to balance the classes.

TASK: MULTICLASS CLASSIFICATION

WE DEVICE THIS PROJECT INTO MULTIPLE STEPS

  • Make subset of training, validation and testing as well as perform oversampling to balance the all classes
  • Prepare training, validation and testing set
  • Get all classes labels
  • Convert [Train, validation, Test] set into (X_train,y_train)(X_valid,y_valid),(X_test,y_test) and convert categorical lebel into binary metrix
  • Visualise training images
  • Used Resnet50, VGG16 & EfficientnetB3 pre-trained model
  • Model Compilation
  • Model Training
  • Model Evaluation
  • Model Saving
  • Prediction on test data

LODING DATA / PREPARING DATA

  • Make a subset of data into three parts train, test, and validation and balance the data of each class with the help of splifolder library.

DATA PROCESSING

[PREPARE TRAINING, TESTING, VALIDATION DATA]

  • Found 3586 images belonging to 24 classes.

  • Found 1390 images belonging to 24 classes.

  • Found 1365 images belonging to 24 classes.

  • After perform oversampling 3586 samples for training, 1390 for validation and 1365 samples goes for testing

GET ALL CLASSES LABELS:

{'A': 0,'B': 1,'C': 2,'D': 3,'E': 4,'F': 5,'G': 6,'H': 7,'I': 8, 'K': 9,'L': 10,'M': 11,'N': 12,'O': 13,'P': 14,'Q': 15,'R': 16, 'S': 17,'T': 18,'U': 19,'V': 20,'W': 21,'X': 22,'Y': 23}

Split Training, Validation and Testing Data Into (X_train,X_test),(X_valid,y_valid),(X_test,y_test) & Convert Labels Into Binary Matrix

  • Total 24 classes is present in this data we need to seprate the labels as well as encode the categorical integer into binary matrix

Visualise The Training Images:

image

BUILD ARCHITECTURE OF RESNET50, VGG16, EFFICIENTNET-B3:

  1. RESNET50:
  • In resnet50 model we train the existing weights to get a better Result.
  • Use dropout to supress overfitting problem as well as use batch Normalization
  • At last output layer use softmax activation function because the Task is multiclass classification.
  1. VGG16:
  • In VGG16 we don’t train the existing weights
  • Not use any dropout and batch normalization to check model Performance
  1. EFFICIENTNET-B3
  • In this model also we train initial weights to get better result.
  • Use Dropout and batch normalization to supress overfittingProblem
  • Add regularize at dense layer to reduce the losses

MODEL COMPILATION & TRAINING & EVALUATION:

  1. RESNET50:
  • Here we use categorical cross-entropy loss as well as Adamax optimizer.
  • Set learning rate and monitor validation loss and train Model on 300 epoch.
  • validation loss is decreases after every epoch as well as validation accuracy is increases.

EVALUATION:

  • 4/4 [==============================] - 1s 129ms/step - loss: 0.0067 - accuracy: 1.0000 Train loss & Accuracy: [0.00668429397046566, 1.0]
  • 4/4 [==============================] - 1s 128ms/step - loss: 0.3905 - accuracy: 0.8828 Testing loss & Accuracy [0.3905099034309387, 0.8828125]
  1. VGG16:
  • Here we use categorical cross-entropy loss as well as Adamax optimizer.
  • Set learning rate and monitor validation loss and train Model on 300 epoch.
  • VGG16 model perform well on training data but in validation side accuracy is extreme less, as well as validation loss is also high

EVALUATION:

  • 4/4 [==============================] - 1s 197ms/step - loss: 0.0771 - accuracy: 1.0000 VGG16 Training Loss & Accuracy [0.07713527977466583, 1.0]
  • 4/4 [==============================] - 1s 198ms/step - loss: 1.7651 - accuracy: 0.4766 VGG16 Testing Loss & Accuracy [1.765055537223816, 0.4765625]
  1. EFFICIENTNET-B3:
  • Here we use categorical cross-entropy loss as well as Adamax optimizer.
  • Set learning rate and monitor validation loss and train Model on 300 epoch.
  • VGG16 model perform well on training data but loss is very high or validation side accuracy is around 75% , as well as validation loss is also high

EVALUATION:

  • 4/4 [==============================] - 1s 149ms/step - loss: 3.4699 - accuracy: 1.0000 Training Loss & Accuracy [3.4699418544769287, 1.0]
  • 4/4 [==============================] - 1s 147ms/step - loss: 4.0854 - accuracy: 0.9062 Testing Loss & Accuracy [4.085390090942383, 0.90625]

MODEL SAVING:

• Save model in training time with the help of model check point with .h5 extension

PREDICTION & TESTING:

  • Resnet50 model predict correctly on test image Not perform any testing on VGG16 because model not perform Well.
  • Efficientnet B3 mode do some wrong prediction because of loss is very high.

CONCLUSION:

  • Resnet50 model perform well on training as well as testing data and both training and testing loss is less. as well as model predict correct result VGG16 model is not perform good on testing data
  • EfficientnetB3 model training & Testing accuracy is good but training and testing loss is high, that's why model predict wrong output From above models we are select Resnet50 model

DEPLOY MODEL USING FLASK FRAMWORK

LIBRARY USED:

  • Tensorflow
  • Keras
  • Matplotlib
  • Glob
  • Numpy
  • Splifolder
  • Resnet50
  • VGG16
  • Efficient-net B3

TOOL USED:

  • Jupyter Notebook
  • Pycharm

indiansign-language-detector's People

Contributors

pawarmukesh avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.