Giter VIP home page Giter VIP logo

isl_interpreter's Introduction

Sign Shakti

A real-time automated interpreter for Indian/Pakistani Sign Language.

Created by Param Damle, Eli Kin, and Flavien Moise for HooHacks 2024.

Shakti means "power" or "ability" in Hindi. With our solution, we hope to bridge gaps between communities regardless of communication style and community of origin, and empower learners of all walks of life to connect with the wider world through technology.

Usage

Sign Shakti deploys in a React app. Clone this repository, and in the main folder run

npm start

Allow webcam usage and you should begin to see live prediction of ISL letters and numbers. For more info on ISL gestures, click here.

Build it Yourself!

Download the gestures dataset as "pathikreet_dataset". Also download the "frame_images_DB" dataset from YT Faces (11GB, this may take a while). All sources are linked below.

Run the cells in data_generation.ipynb to create sample images with transformation. You can specify the number of total samples (different source images of the sign for letter A, for example) and the number of iterations (for a given sample image, how many different transformed versions will we augment to the dataset?).

To train the model, run isl_model.ipynb and define your training parameters (duration, etc). There are two cells that will generate a command line command you will have to run outside the Python shell:

  1. The training command
  2. The export command. Use this to export the trained model as the SavedModel format (.pb) and continue to the next step.

You will need to use this script to convert the SavedModel type to a JSON for TensorFlow.JS. Once you have the model weights as .JSON, run the app using npm start and watch magic happen!

Sources

This project was heavily inspired by Nicholas Renotte's series on building a TensorFlow object detection model and deploying it on a React webapp.

The datasets used include the ISL Gesture dataset by Kaggle user Pathikreet Chowdhury and the YouTube Faces dataset published by Professor Lior Wolf.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.