Comments (4)
As I wrote in another issue you should as well consider compile time regular expressions. I found this paper which contains some benchmark data compared to re2 - where performance seems to be equal or just a little worse. However, CTRE is available as 3 .hpp files and does support unicode vs a whole bunch of files for re2.
from podofo.
Ctre does not do dynamic regex, and we need those in text extraction. I am unsure we should carry a second dependency just to support a single static re. If there were more, I could change my mind, but not now.
from podofo.
To provide more context, I asked ctre dev if it supports ECMAScript and if it supports dynamic patterns. She said no to both questions and she recommended re2 instead.
from podofo.
Yeah it might have been beneficial but if re2 is better suited why not.
from podofo.
Related Issues (20)
- Required TrueType table missing HOT 5
- [podofo-0.10.0]Stack-Overflow HOT 1
- [podofo-0.10.0]a SIGSEGV error occurs HOT 4
- compilation difficulties on macOS HOT 2
- There exists heap-buffer-overflow at podofo-0.10.0/src/podofo/main/PdfXRefStreamParserObject.cpp:180:75 in readXRefStreamEntry HOT 3
- Heap-use-after-free in podofo 0.10.0(main/PdfEncrypt.h:352:47 in PoDoFo::PdfEncrypt::IsMetadataEncrypted()) HOT 3
- Heap-buffer-overflow in podofo 0.10.0(main/PdfEncrypt.cpp in PoDoFo::PdfEncryptAESV3::PdfEncryptAESV3) HOT 1
- Heap-buffer-overflow in podofo 0.10.0(main/PdfEncrypt.cpp:1132:5 in PoDoFo::PdfEncryptRC4::PdfEncryptRC4) HOT 2
- Crash on working with outlines when merging files. HOT 1
- [REG] Unable to set metadata 0.10.0
- The new PdfMetadata class in 0.10.0 breaks various use cases HOT 10
- Infinite loop in PdfDocument::InsertDocumentPageAt HOT 5
- Regression: Fails to operate with font the svn version works just fine HOT 5
- Where is PoDoFoBrowser for PoDoFo 0.10.0? HOT 1
- API changes confirmed for new version 0.10.0 HOT 6
- When extracting text using podofo and poppler, podofo is much slower HOT 1
- error while loading shared libraries (SVN version) HOT 1
- pkg-config not find library version
- PDFDocument:AppendDocumentPages performance regression HOT 33
- Tracking podofo in downstream package ecosystem HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from podofo.