Comments (7)
Closing upon successful completion (maciejkula/spotlight#62).
from goodbooks-10k.
Sure thing! What do you need from me?
from goodbooks-10k.
I would write all the code on my end, but I was wondering if you'd be up to do some more data packaging.
For bandwidth and speed of loading, I use h5py files for my datasets. Would you be willing to re-package the data in that format, and maybe include them as Github release files? (Something along the lines of the downloads here).
from goodbooks-10k.
That's a good idea, I will make a release. Please send me some example code for packing CSV into HDF5.
from goodbooks-10k.
Awesome!
I have some code here (warning, not end-user code, might be fairly horrible). It basically boils down to stuffing numpy arrays into the file with the appropriate names.
from goodbooks-10k.
I made a release: https://github.com/zygmuntz/goodbooks-10k/releases/tag/v1.0
As regards HDF5, I created a file with integer-only data: ratings, to_read and book_tags. Books and tags contain numbers mixed with strings so I skipped them.
from goodbooks-10k.
Great, I'll have a look soon and let you know how I get on.
from goodbooks-10k.
Related Issues (9)
- ISBN "0679781587" doesn't appear in BX-BOOK.csv
- Preferred citation style HOT 5
- Is the books_xml.tar.gz corrupt? - I get a data stream error with gzip HOT 1
- Rating timestamp? HOT 1
- Error in ratingmat[current_user, ] : subscript out of bounds HOT 4
- Please Update the dataset HOT 3
- ISBN-13 wrong
- Scripts for the dataset
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from goodbooks-10k.