Giter VIP home page Giter VIP logo

aka-vm / hello-cv Goto Github PK

View Code? Open in Web Editor NEW
1.0 2.0 0.0 71.2 MB

This repository performs Computer-Vision tasks on multiple Image Datasets using CNN based Networks.

License: MIT License

Jupyter Notebook 99.97% Python 0.03% Shell 0.01%
cifar-10 computer-vision deep-learning image-classification keras mnist-digits-classification python3 stanford-dogs-dataset tensorflow2 transfer-learning

hello-cv's Introduction

Hello CV

Not just a hello-world Repository with entery-level code!

In this repo I have worked to develop various Computer Vision taskes. The code uses both Pytorch & Tensorflow.

TODOs for running the code

  1. Create paths.py file (use paths.py.template as a template)
  2. Download the datasets.

Datasets

All the used datasets well-known around the deeplearning community. They include -

Data Preprocessing

  • For large datasets I have used Image Iterator, This uses less memory.

  • Image augmentation is also used to augment the images.

Image Classification Results

S. No Dataset Best Accuracy Best Model Real-Life Test
1 MNIST(Digit Recognizer) 99.3% Simple Sequential CNN
2 CIFAR-10 89.94% VGG-Like Model 5 images
3 Stanford Dogs 84.3% InceptionResNetV2 4 images

Other Computer Vision Tasks

S. No Application Name Refered Literature/Implimentation Implimented Using Metric Score Visuals
1 Neural Style Transfer A Neural Algorithm of Artistic Style,
NST With Two Style
Pytorch gif
2 Dog Breed Detection (YOLOv8) Joseph Chet's Publications, YOLOv8 Implimentation Pytorch, Ultralytics mAP50-95 0.79 Alt text



Check the branches. I have created a new branch everytime I added a new type of model.

NOTE: For Some reason different hardware give different results, I used two Machines, Macbook Air M1(8gb) and Intel i7 11700k | RTX3070. RTX3070 machine gave better results with a good margin. I even used Kaggle and Jarvis Labsto train some of the models.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.