Comments (4)
Hi there, checking in here, is there any update on having the data files available on an S3 bucket? I'd really appreciate it, especially for the 1e9 case which seems to have problems to create see #110
Thank you
cc: @jangorecki
from db-benchmark.
ok @jangorecki, will do. Thanks for your great contributions on this project.
from db-benchmark.
We could make the 50 GB accessible in S3 via multiple gzipped files that users could download and reassemble on their local machines too. That'd let uses download the file in parallel from S3 and limit the massive file problem. Thoughts @jangorecki / @ncclementi?
from db-benchmark.
Hi, you need to contact h2o support. I am no longer maintainer of the project.
from db-benchmark.
Related Issues (20)
- data.table uses keyby in place of by
- developer's script location is accidentally left in the source code HOT 1
- de-serialization cost? HOT 1
- Consider renaming "Arrow" case? HOT 12
- Mind re-running with DuckDB 0.2.8? Thanks! HOT 1
- allow solutions to load data on demand for joining task HOT 2
- Why Spark produces performance data based on csv dataset HOT 8
- Steps of running benchmarks in Windows HOT 1
- pyarrow supports groupby operations now.
- Join Data generation script gets stuck with e9 rows HOT 3
- Add q/shakti HOT 2
- Get DataFusion added to H2O AI DB-Benchmark HOT 1
- Add Pyspark.pandas to benchmark HOT 1
- Has anyone following this created a dockerfile to run this?
- update the benchmarks? HOT 6
- CUDF Package Issue: Merging on categorical variables with mismatched ordering is ambiguous HOT 1
- Ruby Dataframes
- Where I can download duckdb-latest 0.8.0 for test HOT 1
- h2oai Database-like OPS Benchmark Foster Innovation and Competition
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from db-benchmark.