Comments (6)
Hi @lucasjinreal,
I appreciate your interest in our work. The videos used to create VCG+112K are same as used in Video-ChatGPT.
Alternatively, you can download these videos from our video annotation pipeline HuggingFace page as well.
Please let me know if you face any issues. Good Luck! and thank you.
from videogpt-plus.
Hi. I saw the gogole link you provided has about 160GB, which is soooo big, did you have a splited version whcih can be directly used for training?
from videogpt-plus.
Is the activity_net.video.tar.gz is videos used for trainig?
from videogpt-plus.
Hi @lucasjinreal,
Thank you for your interest in our work. Please note that both the videos (provided through google drive link) and activity_net.video.tar.gz
are same, however activity_net.video.tar.gz
is the compressed version.
In short, you can use activity_net.video.tar.gz
. Thank You and Good Luck!
from videogpt-plus.
@mmaaz60 Hi, does the uncompress video is same? Or the video is being compressed like resized?
from videogpt-plus.
Hi @lucasjinreal,
My apologies, I missed your message. Yes, the videos in activity_net.video.tar.gz
are stored at a lower resolution and lower frame rate. However, as per our experiments, it did not affect the final performance.
If you want to use the original videos, you can download them using the google drive link I share here. Good Luck and Thank You.
from videogpt-plus.
Related Issues (20)
- Performance on MVBench HOT 8
- are you planning to relase the inference codes for VideoGPT-plus_LLaMA3-8B-8k and/or VideoGPT-plus_Vicuna-7B-4k HOT 1
- The webm file from ssv2 can not be loaded HOT 3
- Simple Demo HOT 3
- eval/vcgbench/inference/run_ddp_inference.sh HOT 1
- About pre-training stage. HOT 2
- Detailed Video Descriptions HOT 3
- In what order should I reproduce the paper? HOT 6
- About downloading the datatset? HOT 1
- Question about Training Time HOT 1
- Intermediate descriptions for vcg-plus_112k
- Phi3Model ImportError HOT 2
- You are using a model of type phi3 to instantiate a model of type VideoGPT+. This is not supported for all configurations of models and can yield errors.
- “python setup.py install” for flash-attention reports errors HOT 1
- Where can I find the dense captions for the 112K videos?
- Support for Multi-turn Conversations with Fixed Video Input?
- Inquiry about Costs Associated with Video LLM Benchmarks
- Zero-shot QA evaluation
- Issue of garbled text HOT 2
- Question about Dataset
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from videogpt-plus.