Giter VIP home page Giter VIP logo

vggish_audioclassifer_02456's Introduction

Acoustic Scene Classification using transfer learning on VGGish pre trained model

By Tobias Toft Christensen, Mikkel Heber Hahn Petersen, and Anders Hansen Warming.

This repository is related to a paper presenting an acoustic scene classification method, which uses transfer learning on a VGGish pre-trained model. Transfer learning is a method where knowledge from solving one problem is gained and stored, and can subsequently be used and applied to a related problem. The performance of this method is evaluated on the TUT Acoustic Scenes 2017 data set. A data set collected in Finland by Tampere University of technology. The data collection has received funding from the European Research Council and is part of a DCASE \textit{(Detection and Classification of Acoustic Scenes and Events)} 2017 challenge. The project is related to the DTU course 02456 Deep Learning.

The project are written in Python programming language and some of the scripts is formatted into Jupyter Notebooks. This repository contains a folder "tfRecordsReal" with the tfRecords for the training data, the validation data and the test data. Further more the resulting model is compiled and saved in the folder 3ClassModel and 15ClassModel, respectively for the 3 class and the 15 class classification problem. This repository does NOT contain the pre-trained VGGish model (ref: https://github.com/tensorflow/models/tree/master/research/audioset), but can be obtained from the link. The pre-trained VGGish model is due to the tfRecords, not needed to reproduce the project results.
Finally this repository contain a Jupyter Notebook file (AcousticSceneClassifier.ipynb) with our model.

Pre-trained models - Reproduce results

To reproduce the results illustrated in the Article the two trained models are saved, and can be restored. The two saved models are respectively the 3 class problem and the 15 class problem. Only test classification accuracy for the 3 class problem is illustrated in the Jupyter Notebook. The test for the 15 class problem is preformed the same way, just tuned to 15 classes. The code can be found in the repository under "restoreTest15Classes.py".

vggish_audioclassifer_02456's People

Contributors

tobiastoft91 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.