Giter VIP home page Giter VIP logo

computer-vision-course's Introduction

Computer Vision, Syllabus Fall 2019

General Information

Teachers:

CMS

Course Description: Computer Vision has become ubiquitous in our society, with applications in search, image understanding, apps, mapping, medicine, drones, and self-driving cars. Core to many of these applications are image classification, localization, detection and other visual tasks. Recent developments in neural network approaches have greatly advanced the performance of these state-of-the-art visual recognition systems. This course will give students an understanding and practical experience with many important deep CNN models applied for specific tasks of classification, segmentation, detection, recognition and restoration.

Goals: The main goal of the course is to discover the most important problems in modern Computer Vision, and approach them with powerful CNN architectures. Students will be able to implement, train, validate and debug their own neural networks and gain a detailed understanding of cutting-edge research in computer vision.

Course Format

  • Lectures + Seminars
  • Every Tuesday (Sep 17, 2019 –– Dec 24, 2019)
  • 8:30 — 11:30 @ Sheptytsky Center, Room 308

Schedule and Syllabus

Сlass Hours Self work hours Course Materials
Module 1. Introduction
(17 Sep; 1 Oct)
9 18 Deep Learning for Computer Vision
[Assignment #1]
Intro to CNN [slides]
CNNs for Image Classification [slides]
CNNs for OCR
Module 2. Image Segmentation
(Oct, 11—14)
9 18
Problem Statement
CNNs for Semantic Segmentation
[slides]
Retina Blood Vessel Segmentation [code & data]
Deep Automatic Portrait Matting [Assignment #2]
Module 3. Generative Models
(23 Nov; Nov, 29—30)
9 18
Autoencoders [slides]
VAE, GANs [slides]
CNNs for Image Restoration [slides]
[code] [data] [more data]
[Assignment #3]
Module 4. Geometry in CV
(14 Dec; 17 Dec; 24 Dec)
9 18
Camera Models and Calibration [slides] [code]
Homography Estimation, Correspondence Problem [slides] [code]
Image Stitching [code]
[Assignment #4]
*optional: Instance-level Recognition papers: FPN, Focal Loss,
R-CNN, Fast R-CNN, Faster R-CNN,
YOLO, YOLO9000, YOLOv3,
SSD,
Mask R-CNN
*optional: Face Recognition papers:
DeepFace, FaceNet, Deep Face Recognition
Total 36 72

Course Policy

  • No plagiarism and other violation of academic integrity is allowed. Be sure to obey the academic code of honour of UCU.
  • A completed assignment should be submitted to cms as an archive (named as Name-Surname-CV-HW#) or a link to the repository.

Grading Policy

  • Assignment #1: 25%
  • Assignment #2: 25%
  • Assignment #3: 25%
  • Assignment #4: 25%

Recommended Materials

main:

  • cs231n (course notes | video lectures)
  • Multiple View Geometry in Computer Vision by Hartley and Zisserman (book)
  • Elements of Geometry in Computer Vision by Pajdla (book)

optional:

  • Computer vision: algorithms and applications by Szeliski (book)
  • Computer vision: a modern approach by Forsyth and Ponce (book)
  • cs131 (course notes)
  • EPFL Deep Learning (course)
  • CS294-158 Deep Unsupervised Learning (course)

computer-vision-course's People

Contributors

burnmyletters avatar t-martyniuk avatar ylochman avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.