Giter VIP home page Giter VIP logo

django-deepspeech-server's Introduction

django-deepspeech-server

This is Mozilla deepspeech server implemented in django. One can record sound in browser or upload compatible wav file and submit it to get corresponding text. It supports both HTTP/HTTPS and web sockets(ws).
Note: For good results using websockets, deepspeech server should have GPU for higher inference rate and SSD is better as it promotes fast disk I/O.

Acknowledgement

First of all, thanks to mozilla for such a awesome project. Speech to text is revolutionary technology that has huge scope in future and these type of open source efforts will definitely help nurture this tech. I have used wav-encoder to encode recorded sound in wav format and resampler to get 16000 Hz sample rate. Got some of my inspiration from deepspeech-server.

Installation

Download or clone this project. This project uses python3. To run this project you need to first install deepspeech. Check out deepspeech's README.md for details on how to install deepspeech on your machine.

Once deepspeech is installed, then run following command to install required dependencies of django-deepspeech-server:

pip3 install -r path/to/django-deepspeech-server/requirements.txt

Configuration

Enter path for your model, alphabet, lm and trie in speech-server-main/config/config.json file. Also make change to audiofiledir key in same config.json file, to match some valid path on your system. You can also limit audio length by setting audiofilelength to some time in seconds.

Go to directory where manage.py is located and start server:

python3 manage.py runserver

Go to your browser and browse to http://127.0.0.1:8000/dsserver. Alternatively, you can use use https server, using below command:

python3 manage.py runsslserver

Now you can access website over https (https://127.0.0.1:8000).

TODO

  • Support for web sockets.
  • Input file validation.
  • Real time inference.
  • Provide Google speech API like response, so that one only has to change websocket address.

License

MIT(see LICENSE)

django-deepspeech-server's People

Contributors

ashwan1 avatar cahya-wirawan avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.