Giter VIP home page Giter VIP logo

webrtcvadsharp's Introduction

WebRtcVadSharp

A .NET Standard adapter for the WebRTC voice activity detection (VAD) component. The WebRTC VAD uses a Gaussian Mixture Model to detect speech, typically with better performance than the more common energy threshold model.

See below for a brief overview, or visit the wiki for more in-depth documentation.

Build Status Coverage Status NuGet Version

Installation

WebRtcVadSharp is available on NuGet:

Install-Package WebRtcVadSharp

This will install the .NET Standard adapter (WebRtcVadSharp.dll) and an unmanaged library (WebRtcVad.dll) containing the supporting WebRTC algorithms.

Usage

In the simplest case, you just need to instantiate a WebRtcVad object and supply it with a byte[] of raw audio.

bool DoesFrameContainSpeech(byte[] audioFrame)
{
  using var vad = new WebRtcVad();
  return vad.HasSpeech(audioFrame, SampleRate.Is8kHz, FrameLength.Is10ms);
}

ℹ️ Note that WebRtcVad implements IDisposable, so a using block is necessary.

ℹ️ This library (and WebRTC itself) only supports raw, 16-bit linear PCM audio, and will not work with WAV files or other container formats. For hints on converting your audio, see issue #6.

Configuration

The underlying VAD code can be configured along three axes:

  • Frame length: 10ms, 20ms and 30ms frames are supported.
  • Sample rate: 8kHz, 16kHz, 32kHz and 48kHz sample rates are supported.
  • Operating mode: four levels of "aggressiveness" are supported.

These options may be set via properties on the WebRtcVad object. More documentation on each is available in the wiki.

License

The code in this repository dual licensed. The .NET code and the DLL exports are covered by the MIT license, while the WebRTC code — imported from the WebRTC repository — is licensed under Google's WebRTC license.

webrtcvadsharp's People

Contributors

ladenedge avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.