Comments (24)
Sounds good! Thank you!
from datasets.
That's great @tabshaikh, thank you!
Yes, I think we should have both using "heavy" configuration, but you can start with just one.
from datasets.
I'll assign the issue to you as soon as you accept the collaborator invite! ๐
from datasets.
As this is a good first issue would like to take this up ๐
from datasets.
There are 2 version of this dataset 32 * 32 images and 64 * 64 images should both be done ?
from datasets.
@rsepassi invite accepted would have a pr soon :)
from datasets.
@rsepassi I would love to collaborate with @tabshaikh and make a pull request for Downsampled ImageNet. Please assign it to me also.
from datasets.
Hi @anupam-tripathi, thanks for your interest! Are you already working directly with @tabshaikh? If not, let's give him a chance to get a PR in. If he'd like the help, then please do work together to get something in!
from datasets.
No, I have not joined him till now but will surely contact him personally.
from datasets.
@anupam-tripathi I would love to collaborate with you but the pr is almost done with a few changes left to do and hopefully, I have added the dataset correctly @rsepassi I would do a pr till mid next week as I will be going for an ML hackathon during the weekend, I had some question too would ask in the draft pr
Let us collaborate on adding a big dataset @anupam-tripathi would be great to have a teammate in it
from datasets.
from datasets.
Ya, surely I will prove to be a good teammate.
from datasets.
@rodrigob @rsepassi the link for http://image-net.org/small/download.php does not contain the whole dataset of the downsampled imagenet nor does it contain the labels.
Further i found these links https://patrykchrabaszcz.github.io/Imagenet32/ for dataset details and this http://image-net.org/download-images which contains the whole dataset and
I could not understand in which dev kit of imagenet the labels are present as mentioned in this link https://patrykchrabaszcz.github.io/Imagenet32/ ?
Also the data requires login and is present in the form of pickle file which extracts into a dictionary
Can you help me how to proceed with this further :)
from datasets.
@tabshaikh - Yes, this dataset has only a subset of subsampled imagenet images and does NOT have labels.
This is on purpose - as it was used for autoregressive algorithms, that were generating the output images (rather than trying to predict the class).
Please download from the official link rather than from side-ones.
from datasets.
@cyfra okay cool
from datasets.
The idea of this ticket was to create a smaller version of imagenet that is small enough so that most people can prototype and experiment without having to worry about download time or disk-space.
I would suggest to go for 32x32 and 64x64 versions; ideally with labels so that supervised training (ร la MNIST and CIFAR) can also be used.
from datasets.
@rodrigob okay thanks :)
from datasets.
Any thoughts on doing the 128x128 version while you're at it?
from datasets.
Joel, looks like there are only 2 versions listed, 32 and 64: http://image-net.org/small/download.php
from datasets.
@joel-shor Do you have a link to 128x128?
from datasets.
@rsepassi no there are 4 versions actually 8x8, 16x16, 32x32, 64x64 here http://image-net.org/download-images. The link which you pointed out is incomplete as there is no labels for the same.
@joel-shor can you point me to 128x128 version :)
from datasets.
Sorry for the delay. The 128x128 imagenet, which is used in a number of state-of-the-art GANs (such as Self Attention GAN), can be found here: https://github.com/openai/improved-gan/blob/master/imagenet/convert_imagenet_to_records.py
If you were able to turn this in to a TFDS data set, you would be a hero!
from datasets.
@tabshaikh Have you moved on from this?
from datasets.
Has been added with #613. Closing this now
from datasets.
Related Issues (20)
- Streaming dataset construction or appending to an existing dataset HOT 1
- Please support prefetch with python datasets HOT 2
- gs' not implemented HOT 2
- tfds build failed HOT 2
- need help to build a dataset from local numpy data HOT 4
- [data request] <dataset dengue> HOT 1
- [data request] <dataset educaรงรฃo superior no Brasil> HOT 1
- [data request] smallnorb HOT 2
- Multi-threaded compression? HOT 1
- checksum updated
- Exception ignored in: <function AtomicFunction.__del__ at 0x71926a728940> HOT 13
- canot load EMNIST dataset HOT 8
- HTTP Error 301 HOT 1
- Example serializer doesn't properly raise exception HOT 2
- [data request] <emnist>
- Error when processing speech_commands dataset HOT 1
- [data request] <poker>
- tfds failed to load open-x-embodiement dataset HOT 2
- Cannot build hugging face datasets
- [data request] figshare brain tumor dataset HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ๐๐๐
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google โค๏ธ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from datasets.