Comments (8)
100% agree, tar.xz is perfect!
from gtfs-bench.
The output compression format (zip
, tar.bz2
or tar.xz
) can be an option of the user interface. If the output is just one format it should be zip
as its the most used and compatible.
What do you think @dachafra?
On the other hand, pugz
is only an option for gzip
, but gzip
and zip
are different formats. So, pugz is discarded and I have to search for an alternative...
from gtfs-bench.
I wouldn't delegate the compression option to the user sincerely.
Let's prioritize the performance on obtaining the resources instead of the compression format
from gtfs-bench.
btw, I found this: https://github.com/dcwatson/deflate
from gtfs-bench.
btw, I found this: https://github.com/dcwatson/deflate
But that is only the gzip
and zip
compression algorithm, the interesting thing is if there is a parallel implementation, and for the moment it seams is not possible.
We can move to tar.xz
, is compatible with 7-zip on Windows. Is the best option in speed, size and compatibility.
from gtfs-bench.
Some numbers in a machine with 24 computing threads.
Using zip [compression level 9]:
real 5m58.036s
user 5m55.845s
sys 0m2.000s
Using tar + pxz [compression level 1]:
real 0m30.468s
user 8m39.010s
sys 0m18.165s
For a similar output size:
-rw-r--r-- 1 root root 611M Jun 14 11:25 result.tar.xz
-rw-r--r-- 1 root root 652M Jun 14 11:15 result.zip
Better compression ratios can be achieved, but I think the problem is not the size but the compression time.
from gtfs-bench.
No problem with the size! Awesome results, perfect :-D
from gtfs-bench.
tested and working! closing issue :-D
from gtfs-bench.
Related Issues (20)
- MySQL "LOCAL INFILE" import HOT 3
- Docker "--pull always" option HOT 3
- Mysql 8.0: Incorrect DATE value
- Fix shape_dist / shape_dist_traveled inconsistency
- url fixed columns are not mantained in the scaling-up with VIG HOT 2
- exact_times in CSV is 0 while for RDB it is NULL HOT 7
- Include fixed jar from VIG HOT 1
- shape_dist_traveled not found in CSV HOT 3
- Table names in mysql mappings wrong HOT 3
- Mappings producing different number of results HOT 10
- Enable passing parameters via env vars or a config file HOT 4
- gtfs:zone is an object property in the ontology but data property in the mappings HOT 2
- Service-Calendar and Shape-shapePoints are joins without conditions HOT 1
- Queries with booleans in the triple patters do not produce result is ontop
- gtfs:distanceTraveled datatype
- Include PostgreSQL and Oracle schema SQL files when generating
- Change YARRRML translator to yatter HOT 4
- Include in the ontology all properties and classes
- Remove xsd:duration datatype from the mappings HOT 7
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from gtfs-bench.