Giter VIP home page Giter VIP logo

comma.ai-speed-challenge's Introduction

comma.ai speed challenge

The aim of the challenge is to predict the speed of a car from pre-recorded video frames (training and test video data) provided.

Challenge Link

Challenge Description

Description by comma.ai

Welcome to the comma.ai 2017 Programming Challenge!

Basically, your goal is to predict the speed of a car from a video.

data/train.mp4 is a video of driving containing 20400 frames. Video is shot at 20 fps.
data/train.txt contains the speed of the car at each frame, one speed on each line.

data/test.mp4 is a different driving video containing 10798 frames. Video is shot at 20 fps.
Your deliverable is test.txt

We will evaluate your test.txt using mean squared error. <10 is good. <5 is better. <3 is heart.

Model Employed

Applied transfer learning on Inflated 3d (I3d) inception architecture (without the original classification layer) pretrained on imagenet and kinetics datasets. The pretrained weights was used to initialise the model and the model was trained end-to-end.

Requirements

  • keras
  • python-opencv (cv2)
  • python3

Data Preparation

The train video data is split into training and valiation videos using 70:30 split rule. Additionally, the videos are chopped into example clips (each clip contains 40 frames) and stored in a compressed numpy file (npz). Each clip can contain frames that overlaps with frames in another clip.
To prepate data, use the command below

sh prepare_data.sh

The data preparation on my local machine took about 53 minutes, with large bulk of the time (about 49 minutes) taken for conversion of the (RGB) video to Optical Flow video.

Training

To train the RGB model, use the comand below

python3 rgb_train.py

To train the Optical Flow model, use the command below

python3 flow_train.py

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.