Solution writeup to the comma.ai Driver Monitoring Challenge

Input

4x 60s 20hz video

Ouput

Annotated face tracking video
Head pose feature vector

Dependencies

numpy
sklearn
skimage
cv2

Layout

Core
- Frame preprocessing
- Facial detection
- Facial landmark identification
- Geometric orientation
- Rendering
- Main
Support
- Configuration
- SVM preprocessing
- SVM training
- File utilities
- Numpy utilities
- Image utilities
Data
- Trained SVM
- Input
  - Video files
- Intermediate
  - Preprocessing (optional)
  - imagesToFaces
  - videosToFrames
- Output
  - Annotated video
  - Head pose estimation feature vectors
- Haar cascade classifier
Spike
- Random excursions
Tests
- TBD

Method

video -> video preprocess + dataset preprocess -> train svm -> face detection -> retrain svm -> face detection -> find landmarks -> calculate geometry -> render

Pipeline

read frame -> frame preprocess -> face detection -> find landmarks -> calculate geometry -> render

SVM Preprocessing

HEVC video dataset -> frames
Yale faces dataset -> cropped

SVM Training

Cropped yale faces -> positive samples
256 object categories dataset -> negative samples
Annotate samples
Train linear SVM
Save SVM model
(After sliding window): Retain SVM with hard-negative mining
Save new SVM model

Face Detection

Sliding window over image pyramid
Non-maximum suppression

Face Alignment and Head Pose

Facial landmark alignment
2D-3D point mapping
Compute head orientation

Render Tracking and Pose

Future:

Pupil detection
- CDF
- Feature Extraction and Normalization
Gaze Classification and Decision Pruning

Method:

Using comma ai dataset: Take in hevc video Extract frames from 60s of 20hz video (~1200)

Using yale faces dataset: Convert to jpg and grayscale Crop the images using builtin haar cascades uniform resize write to disk generate (~165) positive samples for SVM using skimage hog descriptor

Using 256_object_categories dataset: generate (~30600) negative samples for SVM using skimage hog in batches of 1000 saving to disk

arrange data correctly + add labels train svm with the positive and negative samples save trained svm

sliding window image pyramid non-maximum suppression hard negative mining retrain

find face find eyes geometric transformation for facial plane generate vector

acarcher / monitoring Goto Github PK

monitoring's Introduction

Solution writeup to the comma.ai Driver Monitoring Challenge

Input

Ouput

Dependencies

Layout

Method

Pipeline

SVM Preprocessing

SVM Training

Face Detection

Face Alignment and Head Pose

Render Tracking and Pose

Future:

monitoring's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent