Giter VIP home page Giter VIP logo

chinese-ocr's Introduction

基于tensorflow、keras/pytorch实现对自然场景的文字检测及端到端的OCR中文文字识别

实现功能

  • 文字方向检测:VGG
  • 文本检测:CTPN
  • 不定长OCR识别:CRNN + CTC

环境部署

sh setup.sh

模型训练

训练keras版本的crnn

cd train/keras-train & python train.py

训练pytorch版本的crnn

cd train/pytorch-train & python train.py

文字方向检测

在VGG16模型的基础上进行迁移学习,训练文字方向检测(0、90、180、270度)分类模型,详细代码参考angle/predict.py文件,训练图片100000张,准确率95.10%。

模型下载地址:

文字检测

支持CPU、GPU环境,一键部署,文本检测训练参考:https://github.com/eragonruan/text-detection-ctpn

OCR端到端识别

采用GRU + CTC端到端识别技术,实现不分隔识别不定长文字

提供keras与pytorch版本的训练代码,在理解keras的基础上,可以切换到pytorch版本,此版本更稳定

识别结果展示

文字检测及OCR识别结果

倾斜文字

参考

  1. crnn:https://github.com/meijieru/crnn.pytorch

  2. keras-crnn:https://www.zhihu.com/question/59645822

  3. ctpn:https://github.com/eragonruan/text-detection-ctpn, https://github.com/tianzhi0549/CTPN

chinese-ocr's People

Contributors

wenlihaoyu avatar ycg09 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.