Name: Shammur Absar
Type: User
Company: QCRI
Bio: Interested in analyzing and understanding human conversation. Main focus: speech overlaps, turn-takings, speech discourse, code-switching, explainability.
Blog: http://shammur.one/
Shammur Absar 's Projects
Attention Bidirectional Video Recurrent Net
Notebook from my article of breaking down the Agglomerative Clustering. https://towardsdatascience.com/breaking-down-the-agglomerative-clustering-process-1c367f74c7c2
ALT research group publications
Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"
Country-level Arabic dialect identification (17 Arabic countries)
Arabic Dialectal Offensive Language dataset from social media comments on news post from facebook, twitter and youtube platforms
Categorize social media news post (short text) to multiple categorizes including politics, health, environment, sports among others
arabic offensive language detection model from social media comments and posts
Arabic NLP tool used to perform Text Search, POS tagging, Translation.
easy-to-use implementation of the ISMIR 2013 Audio Degradation Toolbox
Audio degradation toolbox in python, with a command-line tool. It is useful to apply controlled degradations to audio: e.g. data augmentation, evaluation in noisy conditions, etc.
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
An experimental open-source attempt to make GPT-4 fully autonomous.
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Curated list of python software and packages related to scientific research in audio
TensorFlow code and pre-trained models for BERT
Charsiu: A neural phonetic aligner.
API for interacting with ChatGPT and GPT4 using Python and from Shell.
Compact Language Detector 2
This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalence Constant Theory and Matrix Language Theory.
PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
Dataset for training machine learning model for automatically generating psychiatric case notes from doctor-patient conversations.
Automatic Dialect Detection Repository