Giter VIP home page Giter VIP logo

codecfake's Introduction

The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio

ALM-based deepfake audio

This is the official repo of our work titled The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio, which was available on arxiv at https://arxiv.org/abs/2405.04880.

πŸ“š Codecfake Dataset

Due to platform restrictions on the size of zenodo repositories, we have divided the Codecfake dataset into various subsets as shown in the table below:

Codecfake dataset Description Link
training set (part 1 of 3) & label train_split.zip & train_split.z01 - train_split.z06 https://zenodo.org/records/11171708
training set (part 2 of 3) train_split.z07 - train_split.z14 https://zenodo.org/records/11171720
training set (part 3 of 3) train_split.z15 - train_split.z19 https://zenodo.org/records/11171724
development set dev_split.zip & dev_split.z01 - dev_split.z02 https://zenodo.org/records/11169872
test set (part 1 of 2) Codec test: C1.zip - C6.cip & ALM test: A1.zip - A3.zip https://zenodo.org/records/11169781
test set (part 2 of 2) Codec unseen test: C7.zip https://zenodo.org/records/11125029

πŸ›‘οΈ Countermeasure

1. Data prepraring

Upon downloading the Codecfake datasets, please arrange them in accordance with the directory structure outlined below:

β”œβ”€β”€ Codecfake
β”‚   β”œβ”€β”€ label
β”‚   β”‚   └── *.txt
β”‚   β”œβ”€β”€ train
β”‚   β”‚   └── *.wav (740,747 samples)
β”‚   β”œβ”€β”€ dev
β”‚   β”‚   └── *.wav (92,596 samples)
β”‚   β”œβ”€β”€ test
β”‚   β”‚   └── C1
β”‚   β”‚   β”‚   └── *.wav (26,456 samples)
β”‚   β”‚   └── C2
β”‚   β”‚   β”‚   └── *.wav (26,456 samples)
β”‚   β”‚   └── C3
β”‚   β”‚   β”‚   └── *.wav (26,456 samples)
β”‚   β”‚   └── C4
β”‚   β”‚   β”‚   └── *.wav (26,456 samples)
β”‚   β”‚   └── C5
β”‚   β”‚   β”‚   └── *.wav (26,456 samples)
β”‚   β”‚   └── C6
β”‚   β”‚   β”‚   └── *.wav (26,456 samples)
β”‚   β”‚   └── C7
β”‚   β”‚   β”‚   └── *.wav (145,505 samples)
β”‚   β”‚   └── A1
β”‚   β”‚   β”‚   └── *.wav (8,902 samplesοΌ‰
β”‚   β”‚   └── A2
β”‚   β”‚   β”‚   └── *.wav (8,902 samplesοΌ‰
β”‚   β”‚   └── A3
β”‚   β”‚   β”‚   └── *.wav (99,112 samplesοΌ‰

If you want to co-training with ASVspoof2019, Please download the training, development and evaluation set from ASVspoof2019 LA Database first.

2. Offline Feature Extraction

python preprocess.py 

Please ensure the data and label position are correct. If you need to adjust, please modify in ./preprocess.py.

After preprocess, the hidden states of wav2vec2 will be saved in /data2/xyk/codecfake/preprocess_xls-r-5.

3. Train

Training on different task:

python main_train.py -t 19LA 
python main_train.py -t codecfake
python main_train.py -t co-train 

Before running the main_train.py, please change the path_to_features according to the location of pre-processed featrue on your machine.

If training is slow, consider adjusting the num_worker parameter in conjunction with the number of CPU cores. The default is set to 8. If performance remains slow, you may explore multi-GPU training in args.

4. Test

Testing on different datasets was performed using the best pre-trained model, which is saved in ./models/try/anti-spoofing_feat_model.pt:

python generate_score.py -t 19LA
python generate_score.py -t ITW
python generate_score.py -t codecfake

The result will be saved in ./result.

python evaluate_score.py 

You will get the final test EER.

5. Pre-trained model

We provide pre-trained models and score results as mentioned in our paper, you can use our pre-trained models testing on other condition. The inference step can refer section 4.

Vocoder-trained ADD models:

./pretrained_model/vocoder_mellcnn/anti-spoofing_feat_model.pt
./pretrained_model/vocoder_w2v2lcnn/anti-spoofing_feat_model.pt
./pretrained_model/vocoder_w2v2aasist/anti-spoofing_feat_model.pt

Codec-trained ADD models:

./pretrained_model/codec_mellcnn/anti-spoofing_feat_model.pt
./pretrained_model/codec_w2v2lcnn/anti-spoofing_feat_model.pt
./pretrained_model/codec_w2v2aasist/anti-spoofing_feat_model.pt

Co-trained ADD model:

./pretrained_model/cotrain_w2v2aasist/anti-spoofing_feat_model.pt

πŸ“ Citation

If you find our dataset or countermeasure is useful to your research, please cite it as follows:

@article{xie2024codecfake,
  title={The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio},
  author={Xie, Yuankun and Lu, Yi and Fu, Ruibo and Wen, Zhengqi and Wang, Zhiyong and Tao, Jianhua and Qi, Xin and Wang, Xiaopeng and Liu, Yukun and Cheng, Haonan and others},
  journal={arXiv preprint arXiv:2405.04880},
  year={2024}
}

codecfake's People

Contributors

xieyuankun avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.