Comments (5)
Hello,
Thanks for your note:
-
Now you should see requirements.txt inside code/ folder. It was due to
.gitignore
that all the text files were ignored automatically in git push -
We are still trying to improve our model as we speak. If you observe, the pre-trained model performs very well on the training and validation set (i.e. above 0.9 f1-score) but not so much good on test set or any other arbitrary audio file. This shows that the pre-trained model is overfitting on the algorithms that were used to produce fake voices in train and validation set.
We would like to encourage other users to improve upon this model by performing hyperparam search to get a more generalized model. A couple of things to try are:
a) Try to use regularization techniques
b) Try to add more audio data to train and possibly validation set
c) Try different model architectures (i.e. using conv1d on raw audio instead of spectrograms, or using LSTMs etc.)
d) Analyze the pre-trained model on different audio clips to find patterns in the spectrograms where the model seems to overfit.
c) The classes are highly imbalanced in the original dataset: 90% fake and 10% real audio clips. So, the model generally has more tendency to predict audio signals to be fake. I took some steps to overcome these by augmenting the data by using nlpaug
library (see in utils.py
) but it can be further improved by adding more dataset.
You should not get the bash: scp_spectrogram.sh: No such file or directory
error anymore since I removed that line. It was only there for some remote debugging processes
You can try pre-trained model without Atlas and that doesn't change anything.
Please see my suggestions in order to get a better model and let me know your thoughts.
Thanks!
from fake-voice-detection.
Hi Team,
I am little skeptical about whether I can run the code on Windows 10 PRO version or not.
Please share your thoughts.
Thanks,
Kinjal
from fake-voice-detection.
Hi Kinjal, thanks for your question. Yes you should be able to run the code on Windows 10 but you might need to modify some terminal commands as the windows commands are little bit different than Mac and Linux. But the code itself should be platform independent, if you run into problems, please let us know what error you are getting.
from fake-voice-detection.
from fake-voice-detection.
Hello,
I tried to install the requirements.txt file but shows an error in the line " import numpy as np" saying no module found as "numpy" .
Kindly help me resolve this issue.
Regards,
Harshit
from fake-voice-detection.
Related Issues (14)
- All real/authentic audios in real subfolder are classified as 'fake' with the pre-trained model HOT 7
- Inference on pre-trained model HOT 1
- Is there any way to run the model on google colab?? HOT 1
- Importing model using Tf and Keras Error HOT 1
- Training my own dataset HOT 3
- Should this still work? HOT 1
- The link in download_ data.sh has expired. Could you please update it? HOT 2
- Process getting Killed HOT 5
- NVIDIA or Intel GPU, which one is applicable to run the code ? HOT 1
- Getting error while starting atlas server HOT 4
- Error in requirements.txt Packages HOT 1
- Can not able to run "bash download_data.sh" in Windows 10 Pro version HOT 1
- What's the maximum length of the audio array? HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from fake-voice-detection.