Related Issues (20)
- bug/partition_pdf removes spaces from the text
- Documentation for Ingestion of wikipedia
- bug: TesseractError: Estimating resolution as X HOT 1
- feat/ add a PPTX Picture shape sub-partitioner HOT 1
- Documentation for Partitioning table for email has wrong class type HOT 1
- Got `NameError: name 'sort_page_elements' is not defined` when tried to extract tables. HOT 5
- Doc/Docx with Checkboxes
- bug/Execution speed is very slow in AWS Lambda environment HOT 1
- bug/Execution speed is very slow in AWS LAMBDA environment HOT 10
- `infer_table_structure` in `partition_pdf` function causes CUDA RuntimeError
- `infer_table_structure` lead `Failed to initialize the model`
- chore: Update unstructured-client
- Clarify `orig_elements` documentation HOT 4
- ValueError: Detected a JSON file that does not conform to the Unstructured schema. partition_json currently only processes serialized Unstructured output.
- bug/KeyError with PDF partition fast strategy element ID — in old_to_new_mapping[parent_id] HOT 3
- feat/partition_metadata HOT 1
- bug/partition_pdf doesn't recognize given input parameter HOT 4
- 启动时能禁止nltk连网检查更新package吗?
- Text Extraction Issue: Greek Language PDFs Rendered with Incorrect Alphabet HOT 3
- feat/docx-field-codes
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from unstructured.