Giter VIP home page Giter VIP logo

ai-dataloader's People

Contributors

azhicham avatar dependabot[bot] avatar tudyx avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

Forkers

azhicham

ai-dataloader's Issues

Error handling

Hello,

It could be really nice to add error handling while fetching.
eg. For Dataset that reads data from IO file/http .....
Currently we can only unwrap/expect.
One solution could be to allow fetch to return a Result. And then collat if all Result are Ok
Iterator then return a Result<Batch, Self::E>

Python wrapper

Hello,

I was wondering if we can create a python wrapper to this library using pyO3/maturin and ship it on Pypi.
This way python user could also benefit from this crate ie no more process fork but real multithreading.

Add proper benchmark

Right now we have benchmarks to compare with PyTorch but not really to compare ai-dataloader with itself.
It will be very useful to have some, to be able to make choices based on real data. For instance, it would help with #17, #13 and #1

There are some benchmark using the criterion library but, at least on my computer, they are too noisy to be useful. It will be useful to improve them or maybe try a different approach.

The rustls benchmarks seems very qualitative and could be a good source of inspiration.

Pin memory for tch-rs

Hello,
I think it could be great to pass a variable (pin_memory) to the dataloader to allow pin memory for the TorchCollate case. Wdyt ?

Multi-threaded dataloading with tch-rs

Hello,

Thank you for your awesome work !!!
As you may know there is no dataloader feature in tch-rs, it could be really cool to have it with ai-dataloader and maybe with multi-threading handling in further steps.

Thank you :)

Opt-in feature

In order to reduce compile time / complexity.
It could be nice to make tch-rs and ndarray option feature.
WDYT ?

how to custom a dataset?

I was wondering how to build a DataLoader-usable dataset consisting of multiple tch::Tensor, and the error I encountered was not implementing Collate< (Tensor), (Tensor), (Tensor), (Tensor) > for DefaultCollate

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.