shammur Goto Github PK

followers: 17.0 following: 2.0 repos: 94.0 gists: 1.0

Name: Shammur Absar

Type: User

Company: QCRI

Bio: Interested in analyzing and understanding human conversation. Main focus: speech overlaps, turn-takings, speech discourse, code-switching, explainability.

Blog: http://shammur.one/

Shammur Absar 's Projects

dialectid_siam

Dialect identification using Siamese network

dynamic-superb

The official repository of Dynamic-SUPERB.

dynamic_memory_networks_with_keras

Keras implementation of the dynamic memory networks from https://arxiv.org/pdf/1603.01417.pdf

e2e_lfmmi

E2E system with LF-MMI; word N-gram for Mandarin

eusipco2017

The phoneme classification code for EUSIPCO 2017 paper: Timbre Analysis of Music Audio Signals with Convolutional Neural Networks

factorizedhierarchicalvae

This repository contains the code to reproduce the core results from the paper "Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data"

fine-grained-sentiment

A comparison and discussion of different NLP methods for 5-class sentiment classification on the SST-5 dataset.

interactive-keras-captioning

Interactive multimedia captioning with Keras

itri-speech-recognition-dataset-generation

Automatic Speech Recognition Dataset Generation

iwslt22-dialect

IWSLT 2022 Dialectal Speech Translation Shared Task

jiwer

Evaluate your speech-to-text system with similarity measures such as word error rate (WER)

k6nele

An Android app that offers speech-to-text services and user interfaces to other apps

kaldi

This is the official location of the Kaldi project.

keras-resources

Directory of tutorials and open-source code repositories for working with Keras, the Python deep learning library

lium-diarization

Copy of Lium Speaker Diarization project with a new build script.

lre15_siam

Language identification using Siamese network based on i-vector

machine-learning-interviews

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

models

Models and examples built with TensorFlow

Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three different self-supervised models, Wav2vec (2019, 2020), HuBERT (2021) and WavLM (2022) pretrained on a corpus of English speech that we will use in various ways to perform phoneme recognition for different languages with a network trained with Connectionist Temporal Classification (CTC) algorithm.

shammur Goto Github PK

Shammur Absar 's Projects

Recommend Projects

Recommend Topics

Recommend Org