Giter VIP home page Giter VIP logo

ms2176 / aaico-voice-processing-challenge-2024 Goto Github PK

View Code? Open in Web Editor NEW

This project forked from aaico/aaico-voice-processing-challenge-2024

0.0 0.0 2.0 10.03 MB

TriCode's solution for the AAICO February 2024 Voice Processing Challenge. Python-based audio command detection system that processes streaming audio, detects wake words and specific commands (BATTERY, OXYGEN, TEMPERATURE) using correlation techniques and multithreading.

Home Page: https://community.aaico.com/aaico-february-2024-hackathon/

Python 100.00%

aaico-voice-processing-challenge-2024's Introduction

Audio Command Detection System - TriCode

This project is our team's solution for the AAICO February 2024 Voice Processing Challenge. It implements an audio command detection system using Python, focusing on processing streaming audio data to detect specific commands.

Team Information

Challenge Overview

The challenge involves processing the 'audio_aaico_challenge.wav' file, simulating real-time streaming. The main tasks are:

  • Process audio frames of 512 samples each (16000 Hz sample rate)
  • Detect and label commands in the audio stream
  • Label samples as 0 (command detected) or 1 (no command)
  • Save results for evaluation

Features

  • Simulates real-time audio streaming
  • Implements wake word detection and command identification
  • Uses multithreading for parallel processing
  • Saves detection results in 'results.pkl' for evaluation

Prerequisites

  • Python 3.9
  • Required libraries: librosa, numpy

Installation

  1. Clone this repository
  2. Install required libraries:
    pip install librosa numpy
    

Usage

  1. Ensure 'test_aaico_challenge.wav' is in the same directory as the script
  2. Run the script:
    python aaico_voice_processing_challenge.py
    
  3. The script processes the audio file and outputs frame processing times
  4. Results are saved in 'results.pkl'

Solution Approach

Our solution focuses on:

  1. Efficient audio frame processing
  2. Wake word detection using correlation techniques
  3. Command identification for specific keywords (BATTERY, OXYGEN, TEMPERATURE)
  4. Optimized multithreaded processing for real-time performance

Evaluation

Results can be evaluated using the provided Colab notebook: AAICO Challenge Evaluation Colab

Note

This solution was developed specifically for the AAICO February 2024 Voice Processing Challenge and may require adjustments for other applications.

Acknowledgements

We thank the AAICO team for organizing this challenge and providing the opportunity to work on this exciting project.

aaico-voice-processing-challenge-2024's People

Contributors

ms2176 avatar theofagnoni avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.