Comments (8)
You can crank up the heap space (default: 1GB), eg. to 20GB, by prepending your command line using JAVA_OPTIONS=-Xmx20g
.
from hdt-java.
Make sure your configuration uses the "Big" dictionary (https://github.com/rdfhdt/hdt-java/blob/master/hdt-java-core/src/main/java/org/rdfhdt/hdt/dictionary/impl/FourSectionDictionaryBig.java) , should be enabled by default though. (https://github.com/rdfhdt/hdt-java/blob/master/hdt-java-core/hdt.cfg)
I would recommend using the CPP version though, it's more efficient, faster and more stable.
from hdt-java.
I would recommend using the CPP version though, it's more efficient, faster and more stable.
I got the impression that the CPP version was outdated? You said "You probably need to generate the HDT file with java too; both libraries might no longer be compatible."
Plus, looks like the CPP version doesn't compile?
from hdt-java.
The CPP version definitely has more development going on at the moment, that's why it probably diverted from the java version. I haven't checked, so the HDT's could still be compatible. It does compile for most people AFAIK.
Back to your issue, I honestly don't know. I wasn't involved in the development. But I can imagine that large datasets are a problem for the current implementation, that's why https://github.com/rdfhdt/hdt-mr came about. I could also be an issue with your Garbage Collection configuration.
from hdt-java.
I haven't checked, so the HDT's could still be compatible
regardless if a .hdt
was created with C++ or Java, shouldn't it be the exact same format?
from hdt-java.
in theory yes, in practice not that easy without dedicated development teams :/
from hdt-java.
Sorry I didn't want to imply that you are not doing enough, I just thought that HDT was more or less a well defined standard and for this reason shouldn't change.
from hdt-java.
No worries, you didn't imply anything. I just wanted to explain we want the same thing, but are dependent on charity :)
from hdt-java.
Related Issues (20)
- OPS indexing HOT 3
- Dependency org.apache.commons:commons-compress, leading to CVE problem HOT 1
- Increment HDT and index version to 3.0.0 HOT 6
- Release version 2.2 HOT 2
- Setup Github Action for Maven release HOT 18
- IllegalFormatException or IllegalArgumentException while reading RDF with B-Nodes in two-pass mode
- the ByteStringUtil.longestCommonPrefix(...) method isn't working between non ascii String and internal CharSequence
- Support query of multiple HDT files from CLI HOT 3
- Can't use Big version of the sequence HOT 1
- Fuseki integration seems broken since 3.0.0 HOT 4
- Byte strings aren't able to compare UTF32 strings
- Unsafe memory access HOT 2
- System.out.println() output HOT 5
- Question about Bitmap Triples Iterator ZFOQ implementation HOT 5
- Required array length 2147483639 + 11 is too large HOT 4
- Filtering issue HOT 3
- Problem loading wikidata: java.lang.OutOfMemoryError: Requested array size exceeds VM limit HOT 6
- Dependency resolution of hdt-java-core 3.0.5 fails HOT 6
- Using quad support with Jena/Fuseki
- W3C SPARQL 1.0 i18n normalization-02 test case fails HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from hdt-java.