Giter VIP home page Giter VIP logo

dcase2017-baseline-system's Introduction

DCASE2017 Baseline system

Audio Research Group / Tampere University of Technology

Build Status Coverage Status

Authors

Name
Toni Heittola Baseline system, DCASE Framework, Documentation [email protected], http://www.cs.tut.fi/~heittolt/, https://github.com/toni-heittola
Aleksandr Diment Dataset synthesis (Task 2) [email protected], http://www.cs.tut.fi/~diment/
Annamaria Mesaros Documentation [email protected], http://www.cs.tut.fi/~mesaros/

Documentation

See https://tut-arg.github.io/DCASE2017-baseline-system/ for detailed instruction, manuals and tutorials.

Getting started

  1. Clone repository from Github or download latest release.
  2. Install requirements with command: pip install -r requirements.txt
  3. Run the application with default settings: python applications/task1.py

System description

This is the baseline system for the Detection and Classification of Acoustic Scenes and Events 2017 (DCASE2017) challenge tasks.

The baseline system is intended to lower the hurtle to participate the DCASE challenges. It provides an entry-level approach which is simple but relatively close to the state of the art systems to give reasonable performance for all the tasks. High-end performance is left for the challenge participants to find.

In the baseline, one single low-level approach is shared across the tasks by application specific extensions. Main idea of this is to show the parallelism in the tasks settings, and show how easily one can jump between tasks during the system development.

The main approach implemented in the baseline system:

  • Acoustic features: Log Mel-band energies extracted in 40ms windows with 20ms hop size.
  • Machine learning: neural network approach using multilayer perceptron (MLP) type of network (2 layers with 50 neurons each, and 20% dropout between layers).

Directory layout

.
├── applications            # Task specific applications (task1.py, task2.py, and task3.py) 
│   └── parameters          # Default parameters for the applications
├── dcase_framework         # DCASE Framework code
├── docker                  # Docker container to reproduce baseline results
├── docs                    # Docs in HTML format
├── documentation           # Documentation sources (Sphinx)  
├── examples                # Examples how to extend the DCASE Framework
├── tests                   # Unit tests
├── EULA.pdf                # End-user license agreement
├── README.md               # This file
└── requirements.txt        # External module dependencies 

Installation

The system is developed for Python 2.7 and Python 3.6. The baseline system is tested to work with Linux, Windows and MacOS operating systems.

To ensure that all external modules are installed, run command:

pip install -r requirements.txt

See more detailed instructions from documentation.

Changelog

1.2 / 2017-06-20

  • Introduced recognizer classes: SceneRecognizer, EventRecognizer
  • Introduced Keras callbacks: ProgressLoggerCallback, ProgressPlotterCallback, StopperCallback, StasherCallback
  • Added new learner classes: SceneClassifierKerasSequential, EventDetectorKerasSequential
    • Full support for Keras sequential API
    • Epoch-by-epoch processing, with external metric evaluation (using sed_eval)
    • DataProcessor class to embed feature processing and data processing chains
    • Feature generators to allow loading feature data from disk during training procedure
  • Added possibility to use constant and simple math equations in the keras based learner parameters

1.1.1 / 2017-06-06

  • feature_stacker parameter hash added to the data paths. This broke backward compatibility, previously stored models and results has to be re-calculated or moved to correct paths.

1.1 / 2017-04-02

  • Added Docker container to reproduce baseline results
  • Fixed training data ordering (might have been different on different computer setups)
  • Added Windows compatibility

1.0 / 2017-03-20

  • First public release

License

The DCASE Framework and the baseline system is released only for academic research under EULA.pdf from Tampere University of Technology.

dcase2017-baseline-system's People

Contributors

emrcak avatar toni-heittola avatar wetdog avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.