Giter VIP home page Giter VIP logo

cnn_qbe_std's Introduction

Description

Implementation of the work presented in "CNN based Query by Example Spoken Term Detection".

We have included some example groundtruth files for training as well as development set.

The posteriors features are extracted using the setup presented in the following paper: "High-performance query-by-example spoken term detection on the SWS 2013 evaluation".

The input feature files for training/evaluation are in pytorch readable format which are saved as python dictionaries. The keys are the names of the files in 'groundtruth files' and values are the features in matrix format.

Training

python query_detection_dtw_cnn.py -optim adam -learning_rate 0.0001 -input_size 152 -batch_size 50 -layers 9 -depth 30 -dropout 0.2 -loss_threshold 0.1 -n_valid 50 -max_batch_dev 250 -max_batch_train 1000

Evaluation

python query_detection_dtw_cnn_evaluation.py -input_size 152 -depth 30 -load_model -modelpath cnn_qbe_std_model.pt -outdir outpath -query_list dev_queries_sample_list.txt -search_list search_utterances_sample_list.txt

Reference

@inproceedings{ram2018cnn,
  title={CNN based Query by Example Spoken Term Detection},
  author={Ram, Dhananjay and Miculicich, Lesly and Bourlard, Herv{\'e}},
  booktitle={Nineteenth Annual Conference of the International Speech Communication Association (INTERSPEECH)},
  year={2018}
}

Contact:

[email protected]

cnn_qbe_std's People

Watchers

James Cloos avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.