Giter VIP home page Giter VIP logo

deepspeech.openvino's Introduction

DeepSpeech OpenVINO

  • Update: Currently, you can find the sample at OMZ which support Deep Speech 0.6.1 and 0.8.2, please find Speech Recognition Demo for detail.
  • Update: Add demo for Deep Speech Chinese version, please check the Demo for Deep Speech for detail.

This is the demo application which based on OpenVINO™ for Deep Speech algorithm, which make speech to text that are being performed on input speech audio. In this repository, not only provide an inference pipeline, but also provide an model downloader and accuracy checker for deep speech. Users can use accuracy checker to check the performance and use model downloader to download the deep speech model automatically. For more information, refer to the corresponding pages:

License

This repository is licensed under Mozilla Public License 2.0.

Reference

deepspeech.openvino's People

Contributors

fengyen-chang avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Forkers

stefanruan

deepspeech.openvino's Issues

Unable to convert deepspeech-0.6.1 model

Using https://github.com/FengYen-Chang/open_model_zoo/tree/master/demos/python_demos/deep_speech_demo able to convert deepspeech-0.5.1 model.

But when attempt it on 0.6.1 version, get the below error :


python3 ./mo_tf.py --input_model /root/deepspeech-0.6.0-models/output_graph.pb --freeze_placeholder_with_value "input_lengths->[16]" --input input_node,previous_state_h/read,previous_state_c/read --input_shape [1,16,19,26],[1,2048],[1,2048] --output Softmax,lstm_fused_cell/GatherNd,lstm_fused_cell/GatherNd_1 --disable_nhwc_to_nchw
Model Optimizer arguments:
Common parameters:
        - Path to the Input Model:      /root/deepspeech-0.6.0-models/output_graph.pb
        - Path for generated IR:        /opt/intel/openvino_2020.3.194/deployment_tools/model_optimizer/.
        - IR output name:       output_graph
        - Log level:    ERROR
        - Batch:        Not specified, inherited from the model
        - Input layers:         input_node,previous_state_h/read,previous_state_c/read
        - Output layers:        Softmax,lstm_fused_cell/GatherNd,lstm_fused_cell/GatherNd_1
        - Input shapes:         [1,16,19,26],[1,2048],[1,2048]
        - Mean values:  Not specified
        - Scale values:         Not specified
        - Scale factor:         Not specified
        - Precision of IR:      FP32
        - Enable fusing:        True
        - Enable grouped convolutions fusing:   True
        - Move mean values to preprocess section:       False
        - Reverse input channels:       False
TensorFlow specific parameters:
        - Input model in text protobuf format:  False
        - Path to model dump for TensorBoard:   None
        - List of shared libraries with TensorFlow custom layers implementation:        None
        - Update the configuration file with input/output node names:   None
        - Use configuration file used to generate the model with Object Detection API:  None
        - Use the config file:  None
Model Optimizer version:
[ ERROR ]  Exception occurred during running replacer "REPLACEMENT_ID" (<class 'extensions.front.user_data_repack.UserDataRepack'>): No node with name previous_state_h/read.
 For more information please refer to Model Optimizer FAQ (https://docs.openvinotoolkit.org/latest/_docs_MO_DG_prepare_model_Model_Optimizer_FAQ.html), question #51.


Some updation of Chinese DeepSpeech

Hi, thanks for you wonderful work, and I have tested the demo code for Chinese DeepSpeech model.
There are some things to pay attention to:

  1. Follow the version of OpenVINO as shown below
    openvino 2023.0.0.dev20230217
    openvino-dev 2023.0.0.dev20230217
    openvino-telemetry 2022.1.0
  2. ./mo_tf.py is from https://github.com/openvinotoolkit/openvino/blob/master/tools/mo/openvino/tools/mo/mo_tf.py
  3. The L125 in deep_speech_demo.py is
    net = IENetwork(model=model_xml, weights=model_bin)
    Change it to
    net = ie.read_network(model=model_xml, weights=model_bin)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.