Giter VIP home page Giter VIP logo

mrcmv's Introduction

HoK400 and CFM400

The Datasets Consisting of PROFESSIONALLY Edited Short Videos Accompanied with Background Music and Voice-Overs collected from public video platforms.

1. Introduction

HoK400 and CFM400 are two datasets consisting of short videos edited by professional teams collected from public video platforms. HoK400 and CFM400 contain 427 and 401 short videos related to the globally popular games "King of Honors" (alias "Arena of Valor") and "CrossFire Mobile" respectively. Because the main components of a game video are only those stylized 3D models with some transformations and variations generated by the 3D engines, the videos are fine-grained. The HoK400 dataset is split into 265 and 162 videos for training and testing respectively, while the CFM400 dataset is split into 270 and 131 videos for training and testing respectively. The data augmentation can be implemented by randomly clipping the videos in the training datasets. As the video lengths are different, we use the beginning 32s clips of the videos in the test datasets for performance evaluation. If the video length is less than 32s, we simply repeat the video until it is longer than 32s.

The datasets can evaluate the music retrieval performance for short fine-grained videos with or without the popular voice-over modality. By releasing HoK400 and CFM400, we hope the datasets can promote the progress of the video-music retrieval task in the research community.

2. How to download

The urls of the datasets are stored in the following txt files.

  • HoK400: 'training_part_HoK400.txt' and 'test_part_HoK400.txt'

  • CFM400: 'training_part_CFM400.txt' and 'test_part_CFM400.txt'

Convenient download.

python video_download_script.py --url_list txt_file --saving_path ./saving_dir

You may install you-get before the download starts.

FFmpeg can separate the video and the audio tracks, and the source separator Spleeter [link] can separate the BGM and the voice-overs.

python separating_script --video_path ./video_dir --temporary_audio_saving_path ./temporary_audio_dir --spleeter_res_path  ./separation_dir

3. Citation

{
  author     = {Tingtian Li, Zixun Sun, Haoruo Zhang, Jin Li, Ziming Wu, Hui Zhan, Yipeng Yu, Hengcan Shi},
  title      = {Deep Music Retrieval for Fine-Grained Videos by Exploiting Cross-Modal-Encoded Voice-Overs},
  conference = {ACM SIGIR},
  year       = {2021},
}

4. The implementation of the video-music retrieval algorithm MRCMV

Codes are coming soon..

mrcmv's People

Contributors

pixel-333 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.