Giter VIP home page Giter VIP logo

shubham99bisht / expense-tracker Goto Github PK

View Code? Open in Web Editor NEW
10.0 3.0 7.0 77.01 MB

Key Information Extraction from Scanned Receipts: The aim of this project is to extract texts of a number of key fields from given receipts, and save the texts for each receipt image in a JSON file.

Python 4.19% CSS 63.24% HTML 13.85% JavaScript 18.73%
bi-lstm ocr uploading-invoices hackoberfest2020 hacktoberfest deep-learning web-application flask python3

expense-tracker's Introduction

Expense Tracker

Team Name: GroupTwo
Team: Shubham Bisht
Shadrak Gurupnor
Mukund Choudhary

Problem Statement:

Key Information Extraction from Scanned Receipts: The aim of this task is to extract texts of a number of key fields from given receipts, and save the texts for each receipt image in a json file.

    Step 1: Extracting all the text from given Invoice Image. (Text Region Detection, OCR)
    Step 2: Recognising Key information from the text like Store Name, Address, Total Amount etc using Bi-directional LSTM based approach

Usage Guide

This repository contains three directories. Inside each folder there are documentations of the method we adopted and guide of usage.

  • Folder 1 - Bill Label Tool: A GUI tool for efficiently labelling various entities in huge dataset of invoice images.
  • Folder 2 - ML: Bi-LSTM Approach for Character-wise classification of invoice text into key entities.
  • Folder 3 - Web UI: A user interface for uploading Invoices and receiving results in a HTML form

  • Folder 4 - Android UI: Android interface for uploading Invoices, receiving results and to watch previous Bill uploads. Find the Android Repo here

Approach

  • Task 1 - Scanned Receipt OCR: Tesseract OCR
  • Task 2 - Key Information Extraction: Character-wise classification with Bi-LSTM

For the information extraction task, each image in the dataset is annotated with a text file with format shown below:

{
  "company": "STARBUCKS STORE #10208",
  "address": "11302 EUCLID AVENUE, CLEVELAND, OH (216) 229-0749",
  "date": "14/03/2015",
  "invoice id":"5628391",
  "total": "4.95",
  "items": "WhiteMochaV"
}

Results

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.