Jiahong Zhao's Projects
Collection of various MATLAB functions for spatial audio processing released by the 3D3A Lab at Princeton University
A toolkit for customizing the ambiX ambisonics-to-binaural renderer
《挑战程序设计竞赛》习题册攻略
The Decision Systems Lab (DSL) at UOW announces the DSL@UOW AI Meetup Group. The meetups will involve brief tutorials on topics such as: Machine Learning, Robotic Process Automation and Chatbots, Agents, Alphago and Game Tree Search, Automated Planning and so on. The tutorials will be interspersed with (and followed by) much discussion. Come talk about what interests you in AI, what you'd like to do with it, how to AI-enable your favourite system and what the future holds. WHEN: Every Thursday at 5:30pm. Sessions will go for an hour to 90 minutes. Your enthusiasm will determine how long we'll go for.... WHERE: Bldg 6 (SMART Building) Room 105, UOW Main Campus
数据结构和算法必知必会的50个代码实现
Ambisonics utilities to use with Matlab
Real-time object detection on Android using the YOLO network with TensorFlow
List of articles related to deep learning applied to music
挑战程序设计竞赛2 算法和数据结构 pdf及源码
This project was completed in junior year in April 2019. My level is limited. There may be bugs in the project. Welcome to propose solutions.
Tensorflow 2.0 implementation of the paper: A Fully Convolutional Neural Network for Speech Enhancement
A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).
A PyTorch implementation of "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation"
Deep Neural Network for Speaker Separation
Real-time updates and information about key SARS-CoV-2 variants, plus the scripts that generate this information.
List of Computer Science courses with video lectures.
Tools for creating and manipulating computer vision datasets
YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet )
Deep neural networks for voice conversion (voice style transfer) in Tensorflow
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系[email protected] 版权所有,违权必究 Tan 2018.06
Deep Recurrent Neural Networks for Source Separation
Real-time facial landmarks detection / 摄像头人脸检测并进行特征点标定
Code and Dataset for CVPR2020 "Dynamic Refinement Network for Oriented and Densely Packed Object Detection"
A PyTorch implementation of dual-path RNNs (DPRNNs) based speech separation described in "Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation".
The Easy Communications (EasyCom) dataset is a world-first dataset designed to help mitigate the *cocktail party effect* from an augmented-reality (AR) -motivated multi-sensor egocentric world view.
embedded software
Splits for epic-sounds dataset