Giter VIP home page Giter VIP logo

handwritten-digit-dataset's Introduction

Handwritten-Digit-Dataset

CodeFactor

A collection of 107,730 28x28 PNG files of digits from 0-9 for use in machine learning.

Note: these images are not from MNIST and are original. They were created with various data augmentation techniques, which were performed on the digits.

The dataset can be found here, and is updated there. You can download it here as well. The dataset consists of 10 folders, one for each digit, and the digits are named '<digit>\<file-number>.png', where <digit> is the digit it depicts, and <file-number>, a number ranging from 0 to the total number of files in the folder - 1.

Creating your own dataset

You can create your own dataset here, do note that it takes a while for the data to download (~10-15 minutes for 29160 files, I find focusing on the tab makes the progress faster).

Steps:

  1. Draw the digit using the canvas box at the center of the screen using your mouse.
  2. Click on the button with the corresponding digit to which you drew. You can click on the 'clear' button to clear the canvas, or refresh the page to remove unsatisfactory data (e.g., the image is cut off at the bottom).
  3. When you have finished drawing all digits from 0-9 once (more won't work), click 'done', and wait for your data to download and save.

License

The dataset and code is licensed under the CC BY-SA 4.0 license, which gives permission for:

  • Sharing: distributing & copying the dataset in any format.
  • Adaptation/modification: change, add ons.
  • Commercial use

As long as:

  • Attribution: credit is given to the creator, providing a link to the license and page where the dataset was originally found.
  • Changes to the dataset are specified.
  • Your version of this dataset is distributed under the same license.

A more rigorous version can be found here.

handwritten-digit-dataset's People

Contributors

code-factor avatar jc-progjava avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar

Forkers

malbers-main

handwritten-digit-dataset's Issues

File 1\310.png

Image file 1\310.png is missing, it should be created in the next batch update.

Create CSV version of data

CSV (Comma-separated values) files are easier for programs to parse. Most developers probably won't be able to conveniently convert PNG pictures into greyscale arrays.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.