ejlb / google-open-image-download Goto Github PK
View Code? Open in Web Editor NEWA parallel download util for Google's open image dataset
A parallel download util for Google's open image dataset
Thanks for the fix; I see what you did with the UTF-8 chars. However, the new code gives me the following error: AttributeError: 'dict' object has no attribute 'iteritems'. See details below.
python download.py --timeout 10 --sub-dirs 100 --min-dim -1 "S:\Google OpenImages\images_2016_08\train\images.csv" "S:\Google OpenImages\images_2016_08\train\images"
8044 @ 2016-10-06 09:11:05,259 (266) download - DEBUG - Namespace(consumers=5, force=False, input='S:\Google OpenImages\images_2016_08\train\images.csv', min_dim=-1, output='S:\Google OpenImages\images_2016_08\train\images', queue_size=1000, sub_dirs=100, timeout=10.0)
Process Process-1:
Traceback (most recent call last):
File "C:\Python35\lib\multiprocessing\process.py", line 249, in _bootstrap
self.run()
File "C:\Python35\lib\multiprocessing\process.py", line 93, in run
self._target(_self._args, *_self._kwargs)
File "S:\Google OpenImages\download\download.py", line 145, in producer
for row in unicode_dict_reader(f):
File "S:\Google OpenImages\download\download.py", line 58, in unicode_dict_reader
yield {key: unicode(value, 'utf-8') for key, value in row.iteritems()}
AttributeError: 'dict' object has no attribute 'iteritems'
Thanks for this script! I am running it on Windows and it seems to dump out when it hits some of the foreign characters in the description (?) column. Consequently, I am only able to download about 90 image files. Any way to make it decode those characters? Or this there another issue?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.