Giter VIP home page Giter VIP logo

speechenhancement's Introduction

Speech Enhancement

Speech Enhancement 是一个基于深度学习模型的语音降噪和去混响工具。

  • 支持多种格式音频文件
  • 基于 CPU 推理的语音增强模型
  • 基于 EBU R128 标准对响度进行归一化处理
  • 分片并行处理,提高处理效率

安装

Docker

  1. 拉取Docker镜像

    dockerHub

    docker pull goomd/ghcr.io.tech-podcasts.speech_enhancement:latest

    Github Packages

    docker pull ghcr.io/tech-podcasts/speech_enhancement:latest

  2. 启动容器

    dockerHub

    docker run -p <主机端口>:8080 -d goomd/ghcr.io.tech-podcasts.speech_enhancement:latest

    Github Packages

    docker run -p <主机端口>:8080 -d speech_enhancement:latest

    其中,主机端口是指用户在主机上访问该服务的端口。

使用

你可以通过浏览器访问主机的端口,即可使用该服务。

我们的处理链默认音频格式为采样率44100Hz,采样深度16bit的wav文件,如果你的文件不是这个格式会进行转码操作,从而会增加一定的处理时间。

最终完成的文件将是一个wav文件,所以文件会比原文件大很多。以mp3为例,如果原文件大小为10M,那么处理后的文件大小将会是50M左右。

性能

在 4 核 8 线程的 CPU 上,处理 40 分钟的音频文件,耗时在5分钟左右。

参考

DeepFilterNet

ffmpeg

ffmpeg-normalize

gin

speechenhancement's People

Contributors

ganymedenil avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.