Giter VIP home page Giter VIP logo

et_vae's Introduction

Audio Effect Transfer Using VAEs

Overview

This project presents a machine learning system aimed at transferring audio effects from one monophonic audio source to another. Nowadays, applying effects to sound is a ubiquitous practice, enhancing the pleasantness or serving as a contrast to improve other sounds. This system is particularly valuable in the music industry, where sounds in songs often undergo the application of multiple effects.

You can find some model showcases in the playground.ipynb.

Key Features

  • Automated Audio Effects Transfer: The system efficiently replicates audio effects from one sound to another, overcoming the laborious process involved in manual replication.
  • Innovative Use of VAE: The backbone of the system is a Variational Autoencoder (VAE), which generates a latent space specifically tailored for manipulating audio effects.
  • Unique Loss Term: Our system includes a novel loss term that successfully separates the timbre, pitch, and various audio characteristics of the original sound from the effects applied to it.

Installation

Clone the repository and install the necessary dependencies.

git clone [repository-link]
cd [repository-name]
pip install -r requirements.txt

Dataset Generation

A python script is provided to generate a dataset.

Script Usage

The script generate_stft_specs.py in the ./scripts directory can be used to create datasets with different audio transformations. It accepts the following parameters:

python ./scripts/generate_stft_specs.py [dataset-type] [spectrogram-type]
  • dataset-type: Choose from test or valid to specify the type of dataset you wish to generate.
  • spectrogram-type: Specify the type of spectrogram. Options include stft (short-time Fourier transform), cqt (constant-Q transform), mel (Mel-scaled spectrogram), or hifi (high-fidelity).

Example

To generate a test dataset with Mel-scaled spectrograms:

python ./scripts/generate_stft_specs.py test mel

et_vae's People

Contributors

claussss avatar

Watchers

 avatar Pradyumann Singhal avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.