Giter VIP home page Giter VIP logo

king's Projects

pychain icon pychain

PyTorch implementation of LF-MMI for End-to-end ASR

pyroomacoustics icon pyroomacoustics

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developped as a fast prototyping platform for beamforming algorithms in indoor scenarios.

python icon python

All Algorithms implemented in Python

pytorch-kaldi icon pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

pytorch-tdnn icon pytorch-tdnn

Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training

pytorch_mlp_for_asr icon pytorch_mlp_for_asr

This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.

ray icon ray

An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

runx icon runx

Deep Learning Experiment Management

sadtalker_emo icon sadtalker_emo

和 emo 类似的 图片+ 音频 转 视频。 [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

scws icon scws

开源免费的简易中文分词系统,PHP分词的上乘之选!

sednn icon sednn

deep learning based speech enhancement using keras python, make it easy to use

segan icon segan

Speech Enhancement Generative Adversarial Network in TensorFlow

sherpa-onnx icon sherpa-onnx

各种功能集合:ASR\TTS: Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift

silero-vad icon silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

snownlp icon snownlp

Python library for processing Chinese text

sonic icon sonic

Simple library to speed up or slow down speech

sound-of-pixels icon sound-of-pixels

视频中的与对象关联的音频分割:Codebase for ECCV18 "The Sound of Pixels"

speechtasks-dataset icon speechtasks-dataset

This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent speech tool development, and speech applications.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.