Giter VIP home page Giter VIP logo

sauravraghuvanshi / udacity-computer-vision-nanodegree-program Goto Github PK

View Code? Open in Web Editor NEW
51.0 7.0 24.0 594.1 MB

This repositary contain all my exercises and projects of Udacity Computer Vision Nanodegree Program

Jupyter Notebook 97.45% Python 0.40% Lua 0.05% MATLAB 0.14% C++ 0.05% C 0.04% Makefile 0.01% HTML 1.87%
udacity-nanodegree udacity python3 pytorch lstm-neural-networks cnn-for-visual-recognition cnn-rnn landmark-detection facial-keypoint-detection facial-detection

udacity-computer-vision-nanodegree-program's Introduction

Computer Vision Nanodegree

This repository contains my exercises and projects for the Computer-Vision-Nanodegree at Udacity.

Created by Saurav Raghuvanshi

This repositary contain all my exercises and projects of Udacity Computer Vision Nanodegree Program

Project 1: Facial Keypoint Detection

Facial Keypoint Detection Project

In this project, I build a facial keypoint detection system. The system consists of a face detector that uses Haar Cascades and a Convolutional Neural Network (CNN) that predict the facial keypoints in the detected faces. The facial keypoint detection system takes in any image with faces and predicts the location of 68 distinguishing keypoints on each face.

Some results from my facial keypoint detection system:

The Udacity repository for this project: P1_Facial_Keypoints

Project 2: Image Captioning

Image Captioning Project

In this project, I design and train a CNN-RNN (Convolutional Neural Network - Recurrent Neural Network) model for automatically generating image captions. The network is trained on the Microsoft Common Objects in COntext (MS COCO) dataset. The image captioning model is displayed below.

Image Captioning Model Image source

One good and one not so good sample made by my model:

sample_171
sample_193

The Udacity repository for this project: CVND---Image-Captioning-Project

Project 3: Landmark Detection

Landmark Detection Project

In this project, I implement SLAM (Simultaneous Localization and Mapping) for a 2-dimensional world. Sensor and motion data gathered by a simulated robot is used to create a map of an environment. SLAM gives us a way to track the location of a robot in the world in real-time and identify the locations of landmarks such as buildings, trees, rocks, etc.

The Udacity repository for this project: Project_Landmark Detection

My Certificate of Completion

udacity-computer-vision-nanodegree-program's People

Contributors

chaoticblack avatar deepeshgarg09 avatar sauravraghuvanshi avatar tarunjain1st avatar tarushi98 avatar vidhi-mody avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

udacity-computer-vision-nanodegree-program's Issues

Trained model

Hi, thanks for your amazing work! Would you like to release your already trained model?

What is COCO?

Aim

Update readme by writing about COCO and add some links related to it.

Hint

Use COCO official web page to know about it.

Add a DEVELOPERS.md

This file generally includes the following:

  1. How to install the project
  2. How to set up the project locally
  3. How to run the project after setup

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.