Comments (9)
Hi, thank you for raising this. The current dataset on GCP should have the correct shapes and amount of samples now. Could you check that that is the case? If you are using examples/run.py
You will need to remove your local dataset copy to force a download (which is by default stored in /tmp/CLRS30)
from clrs.
Thanks for your reply. The issue seems to be still existing. I double-checked that and only 32 data points are loaded into the validation/test datasets (take the minimum
algorithm for instance). The content of dataset_info.json
also suggests that there are only 32 data points in the dataset.
from clrs.
Thank you for following up so quickly! It indeed was a bug that had gone under the radar. This should be fixed now + dataset has been re-generated. Could you take a look at this new version?
from clrs.
Yup, looks perfect. Thanks for the fix.
from clrs.
Just wanted to let you know that in your run.py
, only one batch (=32) of test data (and validation data in each iteration) is being evaluated. Full batch leads to high GPU memory consumption for test data, and average over batches would not give accurate results (e.g., F1 metric for masks).
from clrs.
Hi Sadegh, thank you so much for spotting this!
Indeed, the example script needs to be patched, and we will do this shortly after the Easter holiday.
The issue you point out about properly aggregating batches at test time is very important, especially when datasets are manually generated, and we will make sure to address it! Although for the specific CLRS-30 dataset we generated, this would only really impact tasks with graph mask outputs (eg. segments-intersect); in all other mask output settings we do not need to go beyond 32 trajectories, and for types other than mask I think simple averaging across batches will be sufficient.
Lastly, thank you very much for your general interest in the library and all the great pointers you've had! We'd be delighted to add you to our paper's acknowledgements when it is ready for release :)
from clrs.
Thank you for the detailed explanation!
from clrs.
Hi Sadegh, the issue of evaluating/testing on multiple batches should now be resolved by #70.
Please let us know if any issues remain! :)
from clrs.
Great! Thanks for the fix :)
from clrs.
Related Issues (15)
- Is the paper still available? HOT 3
- Hint `A_t` in SCC HOT 2
- Why no directed graph for FloydWarshall, Dijkstra, BFS and BellmanFord HOT 2
- Issue with distribution of undirected graphs HOT 2
- What does DFS output result mean ? HOT 1
- Repetition of indexes in pred
- Update of PyPI version
- Why the outputs of bfs and dfs algorithms are the same HOT 2
- Tarjan's strongly connected components algorithm
- Sampling bug on undirected weighted graphs HOT 2
- Bug in KMP implementation HOT 3
- tensorflow-macos and tensorflow-metal
- Inability to reproduce paper results HOT 5
- Problems with jax HOT 9
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from clrs.