Giter VIP home page Giter VIP logo

comma10k's Introduction

comma10k

Completion Status

It's 10,000 PNGs of real driving captured from the comma fleet, semantically labeled by the public. It's MIT license, no academic only restrictions or anything.

Learn more from the blog post, or on the comma.ai discord in the #comma-pencil channel.

Alt

Run

./viewer.py
to see them with the mask overlay.

Directories

 imgs/  -- The PNG image files
 masks/ -- PNG segmentation masks (update these!)
 imgs2/  -- New PNG image files paired with fisheye PNGs
 masks2/ -- PNG segmentation masks (update these!)
 imgsd/  -- Driver camera PNG image files from Comma3
 masksd/ -- PNG segmentation masks (update these!)
 segs/  -- The outputs in probability from our internal segnet (unreleased, too big)

Categories of internal segnet

 1 - #402020 - road (all parts, anywhere nobody would look at you funny for driving)
 2 - #ff0000 - lane markings (don't include non lane markings like turn arrows and crosswalks)
 3 - #808060 - undrivable
 4 - #00ff66 - movable (vehicles and people/animals)
 5 - #cc00ff - my car (and anything inside it, including wires, mounts, etc. No reflections)
 6 - #00ccff - movable in my car (people inside the car, imgsd only)

How can I help?

  1. Visit the Google Spreadsheet (request access to edit the spreadsheet) and put your discord username in the "labeler" column for the mask(s) you'll be working on and change the status to "In Progress." If you're new, please start off with just one so we can leave you feedback; this is to prevent you from having to redo them because of something done incorrectly. UPDATE: The original imgs set is complete, but a new imgs2 set was added and is still unfinished. There are "e" and "f" versions for the same image number. Check the "imgs2 series" tab in the spreadsheet to see what's available.

    UPDATE 2: Interior images have been added to the imgsd folder. These are the current priority.

  2. Spend some time studying already merged masks to see how things are labeled. You could use the comma10kviewer web tool to easily do this.

  3. Watch the Beginner Tutorial youtube video below.

  4. Start labelling! Useful label tools:

    • img-labeler Only compatible with Chrome and Edge. Other browsers like Brave, Firefox, and Opera, even if chromium based, don't work properly. Must be used with browser zoom and monitor scaling disabled otherwise it will save with a wrong resolution. Hardware acceleration has also been identified as a possible cause for img-labaler incorrectly saving masks with anti-aliasing. It can be disabled at chrome://settings/system.
      UPDATE: Img-labeler has been updated to support the new imgs2 set. If, for example, you would like to work on image 00074_e, simply type 74e in the image number box. Type 74f for image 00074_f.

    • An external image manipulation tool such as GIMP/Krita (Free) or Adobe Photoshop (Paid) If you choose to use an external tool please ensure your color mode is set to 8-bit, and that antialiasing doesn't change the colors on the edges of your mask.

  5. Fork this repository to your account using the "Fork" button in the top right

  6. Create a new branch from the master branch, and use your labelling tool of choice to label some images

  7. Open a pull request from your new branch to the master branch in the official repository to submit your changes!

  8. Visit the #comma-pencil channel on the comma.ai Discord for the latest news and chat about the project.

Image Viewing Tools

TODO:

  1. comma10kviewer #not available At The Moment
  2. comma10kreviewer

Beginner Tutorial

The Goal

Alt

Publication

comma10k is still a work in progress. For now, just cite the GitHub link. Once we reach 10k images, we'll release a paper, a train/test split, and a benchmark model.

For now, we are validating on images ending with "9.png" and are seeing a categorical cross entropy loss of 0.051. Can you beat this?

And it has been beaten with a CCE loss of 0.045, "comma10k-baseline" by YassineYousfi!

Can you beat that?

comma10k's People

Contributors

aleskus avatar alexandresato avatar bigbalthazor avatar cedricske avatar doktorsleepelss avatar energee avatar erikbernheim avatar geohot avatar hassanhibbert avatar hubblesphere avatar jzluo avatar kdizzy07 avatar kevinharbin avatar kogepang avatar lbj2000 avatar meh-meh avatar mobula3967 avatar mutusmurum avatar n-nico avatar nelsonjchen avatar pjlao307 avatar saber422 avatar scottadair avatar snevah avatar spektor56 avatar thedude130 avatar tracminhvu avatar venomrushct avatar virtuallychris avatar vyyyy avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

comma10k's Issues

793

Because this one is crazy!

CommaPencil bugs + 1 feature request

Tested this on windows 10 and found a few bugs:

Bugs:

  1. When zooming in and the image is bigger than the viewable area you can't scroll horizontally, only vertically so you can't edit anything that's cut off on the left side of the image.
  2. The generated image suffers from the BAD COLORS problem because the edges are anti-aliased.

Feature request:

It would make tagging easier if it also had a polyfill function (photoshop has this) where you can click points in the polygon and then it fills the color. This makes tagging more precise than having to be very accurate with the mouse (unless you have a wacom tablet to do this with).

Images with BAD COLORS

0011_ce0ea5158a0e1080_2018-09-20--12-28-17_4_1034.png HAS 13550 pixels with BAD COLORS
0018_55d35794f4955cd1_2018-05-03--21-04-31_54_175.png HAS 761 pixels with BAD COLORS
0272_55d35794f4955cd1_2018-05-08--18-37-21_13_630.png HAS 4637 pixels with BAD COLORS
0010_dad4fa0b6f4978ea_2018-09-07--02-42-25_21_161.png HAS 371 pixels with BAD COLORS
0014_c4289221cf16a21c_2018-09-17--19-49-26_23_349.png HAS 2120 pixels with BAD COLORS
0013_f550e313a9262051_2018-05-21--17-33-38_26_590.png HAS 512 pixels with BAD COLORS
0666_d2ea74adaff2660d_2018-07-17--07-31-05_5_277.png HAS 5270 pixels with BAD COLORS
0015_6c7ffc65dfef4e01_2018-09-15--12-28-30_84_312.png HAS 2426 pixels with BAD COLORS

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.