Comments (5)
Hi, thanks for your question! The score range for all dimensions are 0 to 1.
For samples of different scores, at different dimensions, you can refer to section G of supplementary materials: https://arxiv.org/pdf/2311.17982, where for each dimension we provided some samples at varying scores.
from vbench.
Thanks Ziqi for your helpful answer. May I ask why we have
dynamic degree as false,
while subject consistency & imaging_quality much larger than 1?
from vbench.
Hi, I assume you are asking about the scores for individual videos in the generated eval_results.json
file.
For dynamic_degree
, each video undergoes binary classification, with true
referring to dynamic
, while false
referring to static
. The final score for the dynamic_degree
dimension is defined as the percentage of videos classified as dynamic
.
For other dimensions that have values larger than 1, it could be due to these two reasons: (1) The individual videos' raw score is in the range of 0-100. (2) The individual video's raw score hasn't been divided by the frame count yet.
We retain these raw scores for individual videos in case users need them for debugging. However, you should refer to the final aggregated score for each dimension to assess the model's performance in that particular dimension.
from vbench.
Hi, I assume you are asking about the scores for individual videos in the generated
eval_results.json
file.For
dynamic_degree
, each video undergoes binary classification, withtrue
referring todynamic
, whilefalse
referring tostatic
. The final score for thedynamic_degree
dimension is defined as the percentage of videos classified asdynamic
.For other dimensions that have values larger than 1, it could be due to these two reasons: (1) The individual videos' raw score is in the range of 0-100. (2) The individual video's raw score hasn't been divided by the frame count yet.
We retain these raw scores for individual videos in case users need them for debugging. However, you should refer to the final aggregated score for each dimension to assess the model's performance in that particular dimension.
how can I get the original classification probability of dynamic_degree?
from vbench.
Hi, it's not probability-based classification, but based on threshold.
from vbench.
Related Issues (20)
- LeaderBoard sampled videos
- run inference on customized video and prompt? HOT 2
- Request for detailed instructions for submitting evaluation results to the leaderboard HOT 13
- the score meaning HOT 2
- AssertionError: dimensions : {'background_consistency'} not supported for custom input HOT 1
- Code for generating Figure 6 HOT 2
- Difference between "i2v_subject" and "subject_consistency" dimensions HOT 6
- KeyError: 'imaging_quality_preprocessing_mode' HOT 4
- Division by 0 when directory only contains a single video HOT 1
- Pika, Gen2 latest results HOT 4
- When evaluate aesthetic_quality, 'CUDA out of memory' always occur for multiple videos HOT 1
- failed to get the output result HOT 1
- Vbench Leaderboard HOT 2
- Where is the VLM-tuning model (VideoChat finetuned using human annotation)? HOT 2
- Great works. A question about detailed inference settings for animatediffv2. HOT 9
- How to calculate total score? HOT 1
- Can we withdraw the results in leaderboard? HOT 5
- How to solve the problem "av.error.FileNotFoundError: [Errno 2] No such file or directory” HOT 4
- unzip missing HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from vbench.