dongsig Goto Github PK

followers: 3.0 following: 14.0 repos: 108.0 gists: 0.0

Name: dyang

Type: User

Company: Tencent

Bio: Speech

Location: Shanghai

dyang's Projects

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, speaker embedding

pyaudioanalysis

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

pytorch_xvectors

Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196

quality-net

Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM. (Interspeech, 2018, with Travel Grants)

real-esrgan

Real-ESRGAN aims at developing Practical Algorithms for General Image Restoration.

realtalk

Detects fake voices in YouTube videos with 94% accuracy and alerts the user to prevent misinformation [Hack the North 2019]

reaper

reconstructing_faces_from_voices

Implementation of the "Reconstructing Faces from Voices" paper.

rnnoise

Recurrent neural network for audio noise reduction

rt_auxiva_rls

signal-alignment

Algorithms to align 1D signals via cross correlation and likelihood maximization.

sliding_dtw

Keyword Spotting using Sliding DTW

sound-source-localization-algorithm_doa_estimation

关于语音信号声源定位DOA估计所用的一些传统算法

speaker-anti-spoofing-classifiers

Baselines and Classifiers for speaker anti-spoofing detection

speaker-identification-python

Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library

speakerprofiling

Estimating the Age, Height, and Gender of a speaker with their speech signal.

speech-aligner

speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription

speech-emotion-analyzer

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

speech-enhancement

Deep neural network based speech enhancement toolkit

speech-keyword-verification

Verifying Deep Keyword Spotting Detection with Acoustic Word Embeddings

speech_enhancement_dnn_nmf

Speech Enhancement based on DNN (Spectral-Mapping, TF-Masking), DNN-NMF, NMF

spoken-keyword-spotting

In this repository, we explore using a hybrid system consisting of a Convolutional Neural Network and a Support Vector Machine for Keyword Spotting task.

spoof_speech_detection

Сlassification of the real speech and speech from device speakers

torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

tts

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

unified2021

A UNIFIED SPEECH ENHANCEMENT FRONT-END FOR ONLINE DEREVERBERATION, ACOUSTIC ECHO CANCELLATION, AND SOURCE SEPARATION

vad

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

vad-1

Voice Activity Detector

vad-python

Voice Activity Detector in Python

vgg-speaker-recognition

Utterance-level Aggregation For Speaker Recognition In The Wild

dongsig Goto Github PK

dyang's Projects

Recommend Projects

Recommend Topics

Recommend Org