Giter VIP home page Giter VIP logo

Comments (5)

yuming-long avatar yuming-long commented on June 11, 2024

Hi there!
You should be able to specify OCR agent with env OCR_AGENT from link.

Please set it like export OCR_AGENT="unstructured.partition.utils.ocr_models.paddle_ocr.OCRAgentPaddle" from the constant definition here.

from unstructured.

Coniferish avatar Coniferish commented on June 11, 2024

Hi @Timotheevin! We need to update our documentation, but you can specify which agent you want to use or even provide your own using an OCR_AGENT environment variable.
Here's the commit where this was added: https://github.com/Unstructured-IO/unstructured/pull/2462/files

from unstructured.

peixin-lin avatar peixin-lin commented on June 11, 2024

Hi there! You should be able to specify OCR agent with env OCR_AGENT from link.

Please set it like export OCR_AGENT="unstructured.partition.utils.ocr_models.paddle_ocr.OCRAgentPaddle" from the constant definition here.

Hi, may I ask which version I should use to enable this feature?

from unstructured.

Coniferish avatar Coniferish commented on June 11, 2024

Hi @peixin-lin
It looks like that was introduced in version 0.12.4, so any version after that should be fine.

from unstructured.

vlavorini avatar vlavorini commented on June 11, 2024

Hello,
I tried to set os.environ['OCR_AGENT'] = '"unstructured.partition.utils.ocr_models.paddle_ocr.OCRAgentPaddle"'

But I get this error:

ValueError: Environment variable OCR_AGENT must be set to an existing OCR agent module, not unstructured.partition.utils.ocr_models.paddle_ocr.OCRAgentPaddle.

but that is exactly how the env variable should be set, or am I wrong?

from unstructured.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.