Giter VIP home page Giter VIP logo

speechgenderrecognition's Introduction

SpeechGenderRecognition

This model recognizes gender by analyzing real call recording. It is a Keras implementation of a CNN&LSTM which predict long audio with short audio.

Dependencies

  • Python3.6+
  • Keras2.3
  • scipy, numpy, Pandas, pyAudioAnalysis, pydub, h5py
  • Webrtcvad2.0.10
  • Sklearn

Data

Generate short audio by:

generate_sample.py

It will generate short audio from long audio by VAD(Voice Activity Detection),then you need to label them and put them in three folders.

Train data files:

├── ...
├── data
│   ├── long_audio          #wav files before VAD
│   ├── model                  #save model
│   └── short_audio         #wav files after VAD
│──────├── female          # wav files with label female
│──────├── male          #  wav files with label male
│──────└── noise          # wav files with label noise
└── ...

Train

  • Set train=true in:
	main.py
  • It will create feature and label data at first time:
	x.npy y.npy label.txt
  • If you use your own data, please delete them first.

Predict

  • Set train=false and model_path in:
	main.py

Accuracy

gender precision recall
female 0.896 0.89
male 0.909 0.871

Thanks

speechgenderrecognition's People

Contributors

luckyhouse avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.