This repo contains a list of all of the video IDs in the Youtube8M training set, along with the scripts used to produce this list.
Note: the scripts are not clean, nor are they robust. However, I do think they work.
If you're here, you're probably just looking for the list of Youtube8M IDs. If you really wanted to re-download everything, you would run
python download_ids.py