Giter VIP home page Giter VIP logo

pytorchocr-1's Introduction

基于pytorch的OCR库


最近跟新:

  • 2020.09.18 更新文本检测说明文档
  • 2020.09.12 更新DB,pse,pan,sast,crnn训练测试代码和预训练模型

目前已完成:


接下来计划:

  • 模型转onnx及调用测试
  • 模型压缩(剪枝)
  • 模型压缩(量化)
  • 模型蒸馏
  • tensorrt部署
  • 训练通用化ocr模型
  • 结合chinese_lite进行部署
  • 手机端部署

检测模型效果(实验中)

训练只在ICDAR2015文本检测公开数据集上,算法效果如下:

模型 骨干网络 precision recall Hmean 下载链接
DB ResNet50_7*7 85.88% 79.10% 82.35% 下载链接(code:fxw6)
DB ResNet50_3*3 86.51% 80.59% 83.44% 下载链接(code:fxw6)
DB MobileNetV3 82.89% 75.83% 79.20% 下载链接(code:fxw6)
SAST ResNet50_7*7 85.72% 78.38% 81.89% 下载链接(code:fxw6)
SAST ResNet50_3*3 86.67% 76.74% 81.40% 下载链接(code:fxw6)
PSE ResNet50_7*7 84.10% 80.01% 82.01% 下载链接(code:fxw6)
PSE ResNet50_3*3 82.56% 78.91% 80.69% 下载链接(code:fxw6)
PAN ResNet18_7*7 81.80% 77.08% 79.37% 下载链接(code:fxw6)
PAN ResNet18_3*3 83.78% 75.15% 79.23% 下载链接(code:fxw6)

模型压缩剪枝效果

这里使用mobilev3作为backbone,在icdar2015上测试结果,未压缩模型初始大小为2.4M.

  1. 对backbone进行压缩
模型 pruned method ratio model size(M) precision recall Hmean
DB no 0 2.4 84.04% 75.34% 79.46%
DB backbone 0.5 1.9 83.74% 73.18% 78.10%
DB backbone 0.6 1.58 84.46% 69.90% 76.50%
  1. 对整个模型进行压缩
模型 pruned method ratio model size(M) precision recall Hmean
DB no 0 2.4 85.70% 74.77% 79.86%
DB total 0.6 1.42 82.97% 75.10% 78.84%
DB total 0.65 1.15 85.14% 72.84% 78.51%

模型蒸馏

模型 teacher student model size(M) precision recall Hmean improve(%)
DB no mobilev3 2.4 85.70% 74.77% 79.86% -
DB resnet50 mobilev3 2.4 86.37% 77.22% 81.54% 1.68
DB no mobilev3 1.42 82.97% 75.10% 78.84% -
DB resnet50 mobilev3 1.42 85.88% 76.16% 80.73% 1.89
DB no mobilev3 1.15 85.14% 72.84% 78.51% -
DB resnet50 mobilev3 1.15 85.60% 74.72% 79.79% 1.28

文档教程


文本检测效果


参考

pytorchocr-1's People

Contributors

badbadbadboy avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.