Giter VIP home page Giter VIP logo

ai-x-king's Projects

l3das22-task2 icon l3das22-task2

A Track-Wise Ensemble Event Independent Network for 3D Polyphonic Sound Event Localization and Detection

lhotse icon lhotse

Tools for handling speech data in machine learning projects.

librosa icon librosa

Python library for audio and music analysis

lidbox icon lidbox

End-to-end spoken language identification out of the box.

lstm icon lstm

Minimal, clean example of lstm neural network training in python, for learning purposes.

machine-learning-notes icon machine-learning-notes

My continuously updated Machine Learning, Probabilistic Models and Deep Learning notes and demos (2000+ slides) 我不间断更新的机器学习,概率模型和深度学习的讲义(2000+页)和视频链接

mlapp_cn_code icon mlapp_cn_code

《Machine Learning: A Probabilistic Perspective》(Kevin P. Murphy)中文翻译和书中算法的Python实现。

modern-cpp-tutorial icon modern-cpp-tutorial

📚 Modern C++ Tutorial: C++11/14/17/20 On the Fly | https://changkun.de/modern-cpp/

mtfaa-net icon mtfaa-net

Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement

nasbowl icon nasbowl

[ICLR '21] Interpretable Neural Architecture Search using Bayesian Optimisation with Weisfiler-Lehman Kernel (NAS-BOWL)

onnx icon onnx

Open standard for machine learning interoperability

onnxruntime icon onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

prml icon prml

Repository of notes, code and notebooks in Python for the book Pattern Recognition and Machine Learning by Christopher Bishop

psst icon psst

Prosodic Speech Segmentation with Transformers

pyctcdecode icon pyctcdecode

A fast and lightweight python-based CTC beam search decoder for speech recognition.

pytorch-kaldi icon pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

rvad icon rvad

Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.

sentencepiece icon sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

sherpa icon sherpa

Streaming and non-streaming ASR server for next-gen Kaldi

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.