Giter VIP home page Giter VIP logo

samnaveenkumaroff / prodigy-ml-04 Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 0.0 92 KB

A hand gesture recognition model that can accurately identify and classify different hand gestures from image or video data, enabling intuitive human-computer interaction and gesture- based control systems.

License: MIT License

Python 0.79% Jupyter Notebook 99.21%
control-systems handgesture-recognition jupyter-notebook machine-learning python computer-interaction gesture-control

prodigy-ml-04's Introduction

Prodigy-ML-04


Crafted With Love by Sam Naveenkumar .V

MIT License

Overview

Welcome to Prodigy-ML-04, a comprehensive hand gesture recognition model designed to accurately identify and classify different hand gestures from image or video data. This project aims to enable intuitive human-computer interaction and gesture-based control systems, making technology more accessible and interactive.

Features

  • Accurate Gesture Recognition: Leverage a convolutional neural network (CNN) to classify hand gestures with high accuracy.
  • Real-time Predictions: Integrate with webcam or video feed to provide real-time gesture recognition.
  • Easy to Use: Simple and clean code structure for easy understanding and modification.
  • Extensible: Easily extend the model to include more gestures or improve accuracy with additional data.

Dataset

This project uses the Leap Gesture Database from Kaggle, which can be found here.

Getting Started

Prerequisites

  • A Google account to use Google Colab.
  • Basic understanding of Python and machine learning concepts.

Using the Colab Notebook

  1. Open the Colab Notebook:

  2. Setup Kaggle API Key:

    To download the dataset, you'll need a Kaggle API key. Follow these steps:

    • Sign in to your Kaggle account.
    • Go to Account and select Create New API Token to download the kaggle.json file.
    • Upload the kaggle.json file in the Colab notebook.
  3. Run the Notebook:

    Execute each cell in the notebook sequentially. The notebook includes:

    • Downloading the Dataset: Automatically download and unzip the dataset from Kaggle.
    • Training the Model: Train the hand gesture recognition model using the provided dataset.
    • Real-time Predictions: Use the webcam to make real-time gesture predictions or upload an image/video for prediction.
  4. Saving and Loading the Model:

    The trained model will be saved to your Google Drive. You can load the saved model for future predictions without retraining.

Model Architecture

The model uses a convolutional neural network (CNN) with the following architecture:

  • Conv2D: 32 filters, 3x3 kernel, ReLU activation
  • MaxPooling2D: 2x2 pool size
  • Conv2D: 64 filters, 3x3 kernel, ReLU activation
  • MaxPooling2D: 2x2 pool size
  • Conv2D: 128 filters, 3x3 kernel, ReLU activation
  • MaxPooling2D: 2x2 pool size
  • Flatten
  • Dense: 128 units, ReLU activation
  • Dropout: 50%
  • Dense: 8 units, Softmax activation (for 8 gesture classes)

Contributing

Contributions are welcome! Please feel free to submit a Pull Request or open an Issue to improve this project.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Acknowledgements

  • The dataset used in this project is provided by Leap Gesture Database on Kaggle.
  • Special thanks to the contributors and the open-source community for their valuable resources and support.

prodigy-ml-04's People

Contributors

samnaveenkumaroff avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.