Comments (3)
Is there any update on this issue? Will there be some sort of multiprocess support in the future?
If not, will there be some guides on how to utilize multiprocessing in the best way using this package?
from datasets.
The current plan is to let the user select the shards they want with: #121
Not sure when I'll have time to implement this in a close future though.
from datasets.
For update, the Split API has been updated to be more performant and only read required shards, rather than the whole dataset.
This should significantly increase performances.
For automatic multiprocessing, I'm closing this issue in favor of #1426
from datasets.
Related Issues (20)
- [data request] smallnorb HOT 2
- Multi-threaded compression? HOT 1
- checksum updated
- Exception ignored in: <function AtomicFunction.__del__ at 0x71926a728940> HOT 13
- canot load EMNIST dataset HOT 8
- HTTP Error 301 HOT 1
- Example serializer doesn't properly raise exception HOT 2
- [data request] <emnist>
- Error when processing speech_commands dataset HOT 1
- [data request] <poker>
- tfds failed to load open-x-embodiement dataset HOT 2
- Cannot build hugging face datasets
- [data request] figshare brain tumor dataset HOT 3
- NonMatchingChecksumError while loading the dataset plant_leaves HOT 3
- Invalid UTF8 bytes in default TAGS.txt
- [data request] malaria HOT 2
- Installation issues HOT 1
- NotImplementedError: While importing/Loading tfds plant_leaves dataset HOT 2
- RecursionError using tfds.load to import tensorflow-dataset (Mac) HOT 2
- image classification HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from datasets.