Comments (5)
Can you give a more detailed example? Based solely on what you mentioned in your question, there is a scenario where a key points to multiple values in your data, right?
If so, you need to check if your GT is correctly associated with the KV relationship, and briefly calculate the proportion of this scenario in the entire dataset, and try to increase it as much as possible.
from paddleocr.
Please look into this document, for ex,
patient name is key and pamela wood is a answer
from paddleocr.
Did you use the official model for inference?
Have you used the data from the current document for fine-tuning the model?
from paddleocr.
Yes i am using this official paddleocr model for english.
Now that is a default model, i can't finetune the model.
Can you please share some ideas or anything about this problem
from paddleocr.
You can refer to the following document to fine tune the official model to fit your data, including data preparation, starting training, and so on.
Chinese document address:
https://github.com/PaddlePaddle/PaddleOCR/blob/main/doc/doc_ch/kie.md
English document address:
https://github.com/PaddlePaddle/PaddleOCR/blob/main/doc/doc_en/kie_en.md
from paddleocr.
Related Issues (20)
- PaddleOCR not working in a multiprocessing scenario HOT 5
- Could not load library libcublasLt.so.12. Error: libcublasLt.so.12: cannot open shared object file: No such file or directory HOT 1
- 版面分析文本块未识别 HOT 2
- 有没有更多印章数据集下载链接?能分享一下吗?谢谢 HOT 1
- Predict SER, RE KIE with my ocr HOT 3
- rec_svtrnet_ch.yml 配置训练的图片,识别报错 HOT 1
- 运行示例代码报错:FatalError: `Segmentation fault` is detected by the operating system. HOT 3
- resnet34 rec模型如何修改ctcloss以改善识别结果
- Android PP-OCRv3 HOT 2
- 按照示例Python代码。进行PDF类型的版面识别报错,但是使用命令行正常 HOT 4
- paddle2onnx现已不支持--input_shape_dict="{'x':[-1,3,x,x}"参数使用 HOT 4
- 打开TensorRT不使用动态Shape的情况下 识别速度越来越慢 HOT 2
- 运行PaddleOCR示例代码报错 TypeError: TextSystem.call() takes from 2 to 3 positional arguments but 4 were given HOT 2
- Cannot run "only" text detection HOT 1
- 自己数据集训练效果很差
- Support to load model in a local directory
- 使用Pyinstaller將PPOCRLabel轉exe後執行發生Missing string id : keyDialogTip
- 使用pyinstaller打包后的程序无法运行
- [branch 2.6.1 & 2.7.1]PyMuPDF install failed(pip install -r requirements.txt) HOT 3
- Crash when using PaddleOCR with CPU on Google Colab HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from paddleocr.