luquesky Goto Github PK
Name: luquesky
Type: User
Name: luquesky
Type: User
A list of publically available audio data that anyone can download for ASR or other speech activities
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
📊 Easily apply 527 machine learning models trained on AudioSet.
Practical info for BISON BCN meeting Sept. 25th
End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
Audio fingerprinting and recognition in Python
A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.
📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
More than a hundred strange attractors
Speech Commands Recognition using end-to-end deep learning models in pytorch
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
Proof of concept app that demonstrates use of KeenASR speech recognition framework
How to do Real Time Trigger Word Detection with Keras | DLology
An End-to-End Architecture for Keyword Spotting and Voice Activity Detection
Keyword Spotting for detecting a word in an audio file
A collection of resources to make a smart speaker
Keyword spotting on Arm Cortex-M Microcontrollers
A lightweight, simple-to-use, RNN wake word listener
Python interface to the WebRTC Voice Activity Detector
Python implementation for Reinforcement Learning: An Introduction
official implementation for the paper "Simplifying Graph Convolutional Networks"
Music Identification Program based on Shazam's methods
Generate speech data sets using the audios and transcriptions of YouTube videos.
Code for Temporal Convolution for Real-time Keyword Spotting on Mobile Devices
A TensorFlow implementation of DeepMind's WaveNet paper
Repo de mi tesis
🔊 A comprehensive list of open-source datasets for voice and sound computing (40+ datasets).
🗣️ A book and repo to get you started programming voice computing applications in Python - 10 chapters and 200+ scripts.
Runnable demo for Kaldi android
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.