wl3b10s's Projects
:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification
A knowledge-grounded human-human dataset of open-domain conversations.
Audio super resolution using neural networks
Deep-Learning-Based Audio-Visual Speech Enhancement and Separation
A curated list of awesome Machine Learning frameworks, libraries and software.
Recurrent Neural Network - A curated list of resources dedicated to RNN
TensorFlow - A curated list of dedicated resources http://tensorflow.org
Using ARM Compute Library (NEON+GPU) to speed up caffe; Providing utilities to debug, profile and tune application performance
The GitHub open source software repository on interpreting super-resolution CNNs for sub-pixel motion compensation in video coding
pytorch implementation of complex convolutional neural network
Public facing notes page
2.5D visual sound dataset
Faster R-CNN
Mirror of git://source.ffmpeg.org/ffmpeg.git
:unlock: Lip Reading - Cross Audio-Visual Recognition using 3D Architectures
Keras implementation of 'LipNet: End-to-End Sentence-level Lipreading'
"LipNet: End-to-End Sentence-level Lipreading" in PyTorch
Lip Reading in the Wild using ResNet and LSTMs in PyTorch
Torch code for using Residual Networks with LSTMs for Lipreading
Looking to listen at cocktail party
Control adaptive filters with neural networks.
This research aims at simply deploying CNN(Convolutional Neural Network) on mobile devices, with low complexity and high speed.
Models and examples built with TensorFlow
Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features
Visualizer for neural network, deep learning and machine learning models
Natural Language Processing Tasks and References