entn-at Goto Github PK

followers: 107.0 following: 287.0 repos: 1.6K gists: 2.0

Name: Ewald Enzinger

Type: User

Bio: Ph.D. EE (UNSW Sydney). ML, speaker recognition, speech recognition, speech synthesis, forensic voice comparison

Twitter: entn_at

Location: Portland, Oregon

Blog: https://entn.at/

Ewald Enzinger's Projects

spec_augment

🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

specaugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

spectralcluster

Python re-implementation of the spectral clustering algorithm in the paper "Speaker Diarization with LSTM"

speech-denoising-wavenet

A neural network for end-to-end speech denoising

speech-emotion-recognition

speech emotion recognition using a convolutional recurrent networks based on IEMOCAP

speech-language-processing

A curated list of speech and natural language processing resources

speech-representations

Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)

speech-resynthesis

An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.

speech-separation

speech-to-text-russian

Проект для распознавания речи на русском языке на основе pykaldi.

speech-transformer

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

speech-transformer-tf2.0

transformer for ASR-systerm (via tensorflow2.0)

speech_activity_detection

Unsupervised speech activity detection system.

speech_emotion_recognition_dnn-elm

Implementation of Speech Emotion Recognition using DNN-ELM

speechkitt

🗣 A flexible GUI for Speech Recognition

speechlab-aims-kaldi-aks

Kubernetes/Docker Load Balancing Project for NTU Speechlab / AISG AIMS Speech Recogition System

speechnas

SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification

speechprompt

An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks

speechset

Numpy-librosa implementation of Speech dataset pipeline

speechsplit

Unsupervised Speech Decomposition Via Triple Information Bottleneck

speechsplit-1

An implement of SPEECHSPLIT

speechvae

This repository contains the code to reproduce the core results from the paper "Learning Latent Representations for Speech Generation and Transformation".