alefiury Goto Github PK

followers: 23.0 following: 121.0 repos: 31.0 gists: 0.0

Name: Alef Iury

Type: User

Company: Universidade Federal de Goiás

Bio: Machine learning researcher and computer science undergraduate student focusing mostly on automatic speech recognition, sound event detection and bioacoustics.

Location: Goiânia, Goiás - Brasil

About Me 👨🏻‍💻

💡 I enjoy learning new things and explore a variety of disciplines, attempting to tie it all together.
🇧🇷 🇺🇸 🇫🇷 🇯🇵 Passionate about learning new languages.
🎓 Computer science master's student at Federal University of Goiás (UFG).
🔎 Speech recognition researcher at Centro de Excelência em Inteligência Artificial (CEIA).
🔎 NLP researcher at Hub de Inteligência Artificial e Arquiteturas Cognitivas (HIAAC)

Languages

Frameworks

Alef Iury's Projects

add-background-noise-and-white-noise

audio-tagging-single-attention-cnn

This is an implementation in Pytorch of a Single Attention Convolutional Neural Network model for audio tagging and sound event detection.

audioset-download

This package aims at simplifying the download of the AudioSet dataset.

automatic-gender-classification

Implementation in Pytorch of Deep Learning models for Automatic Gender Recognition (AGR) for the paper: "A Comparison of Deep Learning Architectures for Automatic Gender Recognition from Audio Signals".

bees-tomato

This repository has all the codes used in the work: classification of bees

canonical-genetic-algorithm

checkout2

compilador-2021-2

confidenceintervals

Confidence interval computation for evaluation in machine learning using the bootstrapping approach

coqui-tts-mod

ddsp-svc-dynamic-loading

Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)

fast-audioset-download

Download audioset data super fastly with youtube-dl, ffmpeg and python multiprocessing

ft-w2v2-ser

Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition

genetic_algorithm_multidimensional_knapsack

hash-table-backward-shift-visualization

mars5-tts

MARS5 speech model (TTS) from CAMB.AI

multilingual_kws_pytorch

Unofficial PyTorch implementation of Few-Shot Keyword Spotting in Any Language. A model for few-shot keyword spotting in any language, trained with the Multilingual Spoken Words Corpus.

resolucoes-maratona-behind-the-code-2021

se-r-2022-ser-track

Code for the winning solution in the SE&R 2022 Challenge - SER track.

se-r_2022_challenge_wav2vec2

Code for the paper "Domain Specific Wav2vec 2.0 Fine-tuning For The SE&R 2022 Challenge"

ser-evo

This repository implements a Neuroevolution approach to the task of spontaneous speech emotion recognition using wav2vec2 embeddings and Natural Evolution Strategies.