Giter VIP home page Giter VIP logo

android-chinese-menu-recognition's Introduction

Android-Chinese-Menu-Recognition

A mobile application using Chinese OCR

I. INTRODUCTION

The motivation of Android Chinese Menu Recognition application is to help tourists navigate in a foreign language environment. The application we developed enables the users to get text translate as ease as a button click. The camera captures the dishes name from the Chinese menu and returns the translated English food name as well as the corresponding picture of that dish. The system we developed includes automatic text detection, rotation(midtern), segmentation, OCR (optical character recognition), and text translation. Although the current version of our application is limited to translation from Chinese to English, it can be easily extended into a much wider range of language sets.

II ENVIRONMENT SETUP

1)We work on Ubuntu 10.04 operation system

2)Install proper IDE, such as Eclipse Helios for java.

3)Download the Android SDK r16 package and Android NDK r7 package. You would also need proper PATH variables added. The detailed information can be found at http://developer.android.com/sdk/index.html

4)Install the ADT Plugin for Eclipse. The detailed information can also be found at http://developer.android.com/sdk/eclipse-adt.html

5)Open Eclipse Windows-->SDK Manager, then add Android platforms and other packages to your SDK(Our phone device is HTC desire, so we use API 10)

6)We use Tesseract Android Tools called Tess Two project as library. They are based on the Tesseract OCR Engine and Leptonica image processing libraries. To build this project you should using the command below after you downloaded it: cd /tess-two ndk-build android update project --path ant release

7)Then import the Tess Two project as a library in Eclipse. Remember to Fix project properties when first import it.

8)Configure your own created project to use the tess-two project as a library project.

9)Download OpenCV in andoid because we will use it in our project for grayscale and edge detection. the detailed setup information can be found at http://opencv.itseez.com/index.html . We use the opencv-2.3.1

  1. The same as step 7, we import the OpenCV project as a library of your own project in Eclipse.

  2. Download chi_sim.traineddata package of tesseract at http://code.google.com/p/tesseract-ocr/downloads/list. It is simplfied Chinese trained data. Create a tessdata folder under assets in your project. Unzip it and put the trained data file named 'chi_sim.traineddata ' to tessdata folder.

III. SYSTEM FLOW

In this paper, we propose a text detection / recognition / translation algorithm that consists of following steps:

  1. Grayscale the picture

  2. Binarization

  3. Canny edge detection

  4. Image Rotation(midtern)

  5. Segmentation

  6. Optical character recognition

  7. Text translation

  8. Display of the translation and correponding pictures

android-chinese-menu-recognition's People

Contributors

nianwen avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.