Giter VIP home page Giter VIP logo

speechbrain_dsr_proj-3's Introduction

Multi-microphone Signal Processing and Speech Recognition

Overview

Multi-microphone speech recognition is one of the most challenging problems in spoken language understanding. Previously, speech scientists used to rely on traditional methodologies of computing and preprocessing the features, and then feeding it to the recognition models. In this project, we did experiments with 3 models, namely, CRDNN (a combination of Convolutional, Recurrent, and Fully connected networks), DenseNet and ResNet in combination with signal processing techniques such as beamforming for preprocessing the multi-microphone signal.

TIMIT Multi-mic dataset which is a modified version of the standard dataset called TIMIT Acoustic-Phonetic Continuous Speech Corpus was used for this project.

Training the models

The instruction to run the experiments can be found in the README.md of the folders.

Results

https://share.streamlit.io/prabodhw96/log_streamlit/app.py

Metric: Phoneme Error Rate (PER)

About Speechbrain

Citing Speechbrain

Please cite SpeechBrain if you use it for your research or business.

@misc{speechbrain,
  title={{SpeechBrain}: A General-Purpose Speech Toolkit},
  author={Mirco Ravanelli and Titouan Parcollet and Peter Plantinga and Aku Rouhe and Samuele Cornell and Loren Lugosch and Cem Subakan and Nauman Dawalatabad and Abdelwahab Heba and Jianyuan Zhong and Ju-Chieh Chou and Sung-Lin Yeh and Szu-Wei Fu and Chien-Feng Liao and Elena Rastorgueva and François Grondin and William Aris and Hwidong Na and Yan Gao and Renato De Mori and Yoshua Bengio},
  year={2021},
  eprint={2106.04624},
  archivePrefix={arXiv},
  primaryClass={eess.AS},
  note={arXiv:2106.04624}
}

speechbrain_dsr_proj-3's People

Contributors

prabodhw96 avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.