alisen39 / trwebocr Goto Github PK

View Code? Open in Web Editor NEW

2.5K 2.5K 579.0 321.84 MB

开源易用的中文离线OCR，识别率媲美大厂，并且提供了易用的web页面及web的接口，方便人类日常工作使用或者其他程序来调用~

License: Apache License 2.0

Python 57.68% Dockerfile 1.71% JavaScript 4.76% HTML 2.01% Vue 33.78% Shell 0.06%

docker ocr ocr-recognition python python3 web

trwebocr's People

Contributors

Stargazers

Watchers

Forkers

36150592 ntu820 b-xiang pubfork 123weizheng zoulongming weilit lf52335 1003824904 myhub zhentg cqzhangyunhua yuolvv sangsxh bestjex tedyage ttyhu sixu007 derek008 dyjyzp yangsuo morganwang010 hjy273 dun933 xiaomudegithub darksand victor8733 jingmouren templeblock yaogjim allensmile nickliqian jangocheng leedaga zhangmeng001 ericxsun maozhiqiang wangyan841331749 13295960759 adewin rocteo chsword yewpu hjai-code caedmonx fakegit xuxueyun baifanysu lxngoddess5321 jason0x12 citysay jinhill mrjianliang laugha xiaoyubing wxthon peterli1001 yelau xyanggu liuzheng081 chunleixiahe geweixin 13683643950 qiuweibin2005 fourtech masixian xlight iampaopaoyu zengpengkindle unclehking rotcx molandtoxx shanrenli smarted rocinant yyyear supersdet linglong117 coder965 kandyjam znsoftm lingshitiancheng semua fcgll520 hwzhuanyong sjf180 charliewang01 alvin3721 abc1225 wanggrplus123 201736621051 piaoxue88 ishine testerclub tangwugang ihjmh yueyedeai duqingchen planet-2020 ynpython

trwebocr's Issues

前端报错：Uncaught (in promise) TypeError: Cannot read property 'msg' of undefined
at Index.vue:199
后端打印：2020-05-08 13:11:15,776 - backend.webInterface.tr_run - ERROR - {"code": 400, "msg": "\u4ea7\u751f\u4e86\u4e00\u70b9\u9519\u8bef\uff0c\u8bf7\u68c0\u67e5\u65e5\u5fd7", "err": "274"}
ERROR:backend.webInterface.tr_run:{"code": 400, "msg": "\u4ea7\u751f\u4e86\u4e00\u70b9\u9519\u8bef\uff0c\u8bf7\u68c0\u67e5\u65e5\u5fd7", "err": "274"}
2020-05-08 13:11:36,915 - backend.webInterface.tr_run - ERROR - {"code": 400, "msg": "\u4ea7\u751f\u4e86\u4e00\u70b9\u9519\u8bef\uff0c\u8bf7\u68c0\u67e5\u65e5\u5fd7", "err": "274"}
ERROR:backend.webInterface.tr_run:{"code": 400, "msg": "\u4ea7\u751f\u4e86\u4e00\u70b9\u9519\u8bef\uff0c\u8bf7\u68c0\u67e5\u65e5\u5fd7", "err": "274"}
2020-05-08 13:11:45,388 - backend.webInterface.tr_run - ERROR - {"code": 400, "msg": "\u4ea7\u751f\u4e86\u4e00\u70b9\u9519\u8bef\uff0c\u8bf7\u68c0\u67e5\u65e5\u5fd7", "err": "274"}
ERROR:backend.webInterface.tr_run:{"code": 400, "msg": "\u4ea7\u751f\u4e86\u4e00\u70b9\u9519\u8bef\uff0c\u8bf7\u68c0\u67e5\u65e5\u5fd7", "err": "274"}
2020-05-08 13:13:00,152 - backend.webInterface.tr_run - ERROR - {"code": 400, "msg": "\u4ea7\u751f\u4e86\u4e00\u70b9\u9519\u8bef\uff0c\u8bf7\u68c0\u67e5\u65e5\u5fd7", "err": "274"}
ERROR:backend.webInterface.tr_run:{"code": 400, "msg": "\u4ea7\u751f\u4e86\u4e00\u70b9\u9519\u8bef\uff0c\u8bf7\u68c0\u67e5\u65e5\u5fd7", "err": "274"}

英文识别不好？

识别出的英文单词之间没有空格
识别速度似乎较慢
字母识别错误，大写的“I”会识别成小写"l"(L)

Docker运行时报错

用docker镜像pull后，然后docker run 后，一直报这个错。
`2020-10-27 07:42:45,851 INFO success: trwebocr entered RUNNING state, process has stayed up for > than 1 seconds (startsecs)

2020-10-27 07:42:48,861 INFO exited: trwebocr (exit status 1; not expected)

2020-10-27 07:42:49,864 INFO spawned: 'trwebocr' with pid 1385`

Docker run

docker run -itd -p 8089:8089 --name trwebocr trwebocr:latest 运行错误
应该是：docker run -itd -p 8089:8089 --name trwebocr mmmz/trwebocr:latest
不然会找不到库

请问，同一张图片有可能结果不一样？请问是什么原因

合作交流

您好，我们是一家央企的人工智能公司（中译语通科技股份有限公司）主要从事大数据、智慧城市、机器翻译、知识图谱、语音识别、ocr识别等技术的研发，我是这边的技术负责人在github上看到您的开源系统很感兴趣，希望和您进一步沟通交流。

您可以加我微信：18611586751

前端页面画检测框怎么改成原图上画呢右侧的图有点小

繁体字无法识别

如图，繁体字完全无法识别

怎么实现多进程or多线程部署

怎么将该模型多线程or多进程部署，以实现提高批处理调用速度？

怎么在离线的情况下启动？

没有网络无法成功启动，怎么能启动？需要授权？作者联系一下

首次安装后，缺少log目录，无法启动

FileNotFoundError: [Errno 2] No such file or directory: '/home/test/trwebocr/TrWebOCR-master/backend/logs/log.log'

mkdir backend/logs
python backend/main.py

Service started successfully

安装失败

Traceback (most recent call last):
File "backend/main.py", line 11, in
from backend.webInterface import tr_run
File "/Users/fizz/the-world/TrWebOCR/backend/webInterface/tr_run.py", line 8, in
import tr
File "/Users/fizz/the-world/TrWebOCR/backend/tr/init.py", line 2, in
from .tr import *
File "/Users/fizz/the-world/TrWebOCR/backend/tr/tr.py", line 24, in
_libc = ctypes.cdll.LoadLibrary(os.path.join(_BASEDIR, 'libtr.so'))
File "/Library/Frameworks/Python.framework/Versions/3.7/lib/python3.7/ctypes/init.py", line 442, in LoadLibrary
return self._dlltype(name)
File "/Library/Frameworks/Python.framework/Versions/3.7/lib/python3.7/ctypes/init.py", line 364, in init
self._handle = _dlopen(self._name, mode)
OSError: dlopen(/Users/fizz/the-world/TrWebOCR/backend/tr/libtr.so, 6): no suitable image found. Did find:
/Users/fizz/the-world/TrWebOCR/backend/tr/libtr.so: unknown file type, first eight bytes: 0x7F 0x45 0x4C 0x46 0x02 0x01 0x01 0x03
/Users/fizz/the-world/TrWebOCR/backend/tr/libtr.so: unknown file type, first eight bytes: 0x7F 0x45 0x4C 0x46 0x02 0x01 0x01 0x03

对于"-"识别的不是很好

在实际测试中，金额前有“-” 或是文字中有“-” 的，有时候会识别不出来。修改缩放比例后，有改进，但是还其他图片还是有没识别出来的情况，
请问这个如何提升？
谢谢

我在Linux虚拟机上启动成功了，网页的方式怎么访问呢？

@alisen39

请问哪个版本使用的Tr1.5呢

由于是无网络环境的运行，新版需要验证Tr，无法使用，想问哪个版本是用的旧版Tr，可以无网络直接运行呢

postman调用接口报405: Method Not Allowed

大佬，
postman调用返回报错：

<title>405: Method Not Allowed</title> 405: Method Not Allowed 怎么解？用的是x-www-form-urlencoded模式，post方式调用

有关该识别算法文档

您好，想问下该项目有算法原理介绍么，想学习下

怎么调用GPU版本？

希望增加http跨域支持

树莓派安装Ubuntu18.04 Docker容器跑不起来

为什么我的树莓派3B,Docker 容器跑不起来？

405错误

import requests
import cv2
import base64
import json
url3 = 'http://192.168.1.102:8089/api/tr-run'
image = cv2.imread('a.png')
image = base64.b64encode(image)
data = {'img': image.decode('utf-8'), 'compress': 0}
json_mod = json.dumps(data)
res = requests.post(url=url3, data=json_mod)
print(res)
请问下这段代码为什么会报405错误？

为什么老是自动killed？

我测试了下，服务启动，用着用着就自动杀掉了
centos 8 下启动的docker

每使用一次，内存好像没释放

python调用多进程锁死

后台用supervisor启动了4个进程，python用进程池4个进程开启tr.run函数无法返回，改成1个进程可以正常返回。请问是啥问题？谢谢。

请问下各位大佬在局域网下如何运行 no server available

请问下各位大佬在局域网下如何运行
不连接上互联网就是no server available，连上互联网就好了。

大神，启动的时候需要连接哪个地址校验？我们需要针对性开个外网

大神，启动的时候需要连接哪个地址校验？我们需要针对性开个外网。如果要tr作者授权，需要怎么找到作者？

ubuntu docker 布署启动不了

code_Segmentation fault (core dumped)

docker镜像如何用GPU加速识别？

想做一个pdf批量转word的接口，但是识别速度有点慢，尤其是在目标多的图片中表现得更慢。求告知GPU加速的方法，感激不尽

文件长度大于30000px,约超2MB时会崩

tr

树莓派运行报错，请问这个怎样解决？？ModuleNotFoundError: No module named 'cv2'

root@raspberrypi:~/TrWebOCR-master# python backend/main.py
Traceback (most recent call last):
File "backend/main.py", line 10, in
from backend.webInterface import tr_run
File "/root/TrWebOCR-master/backend/webInterface/tr_run.py", line 8, in
import tr
File "/root/TrWebOCR-master/backend/tr/init.py", line 2, in
from .tr import *
File "/root/TrWebOCR-master/backend/tr/tr.py", line 6, in
import cv2
ModuleNotFoundError: No module named 'cv2'

如果指定语言？

有没有设置指定识别的语言类型？

能不能用GPU，怎么启用

CPU有点慢，一张要几秒钟

docker image not working

mmmz/trwebocr
itplanes01/trwebocr

missing file

CentOS8下安装，需要修改requirments.txt，取消指定的版本号

目前安装的软件版本如下，可以正常使用
pip list

Package Version

certifi 2020.4.5.1
libtorch 1.2.0.1
numpy 1.18.4
opencv-python 4.2.0.34
Pillow 7.1.2
pip 20.0.2
setuptools 46.1.3.post20200330
tornado 6.0.4
tr 1.0.0.3
wheel 0.34.2

Win10下面不能运行，是因为缺失 libtr.dll吗？

Traceback (most recent call last):
File "backend/main.py", line 10, in
from backend.webInterface import tr_run
File "c:\Projects\TrWebOCR\TrWebOCR-master\backend\webInterface\tr_run.py", line 8, in
import tr
File "c:\Projects\TrWebOCR\TrWebOCR-master\backend\tr_init_.py", line 2, in
from .tr import *
File "c:\Projects\TrWebOCR\TrWebOCR-master\backend\tr\tr.py", line 22, in
_libc = ctypes.cdll.LoadLibrary(os.path.join(BASEDIR, 'libtr.dll'))
File "C:\ProgramData\Miniconda3\lib\ctypes_init.py", line 442, in LoadLibrary
return self.dlltype(name)
File "C:\ProgramData\Miniconda3\lib\ctypes_init.py", line 364, in init
self._handle = _dlopen(self._name, mode)
OSError: [WinError 126] The specified module could not be found