corticph / prefix-beam-search Goto Github PK

View Code? Open in Web Editor NEW

183.0 183.0 37.0 75 KB

Code for prefix beam search tutorial by @labodk

Home Page: https://medium.com/corti-ai/ctc-networks-and-language-models-prefix-beam-search-explained-c11d1ee23306

Python 100.00%

prefix-beam-search's People

Contributors

Stargazers

Watchers

prefix-beam-search's Issues

Error in code according to the paper

https://github.com/corticph/prefix-beam-search/blob/master/prefix_beam_search.py#L61 should be

Pnb[t][l] += ctc[t][c_ix] * Pb[t - 1][l]

How to integrate end symbol if the CTC table was trained without it

Hi, thanks for this awesome tutorial! I was wondering, how could i integrate the end symbol ('>') in the algorithm if my CTC tables do not contain it? Would it be possible with the Language Model by predicting EOS? Another thing is, how to train the model with CTC and include the end symbol? Would ju just append each sentence with the end mark? Thanks!

Change Beam Search output format

Thank you for the implementation.

Is there any way to find the path the beam search uses for each final hypothesis?

For example, if the hypothesis was

"a loud laugh followed at chunkys expense"

-------a-- l-l-l--oo--ud---- etc. ...

A_prev

A_prev is [' '] so it doesn't loop. Is this a bug?

Edit: I mean, nothing is added to my A_prev in the loop. And in the end A_prev is empty.

what is defintion of alphabet

in function greedy_decoder, alphabet = list(ascii_lowercase) + [' ', '>']. But in function prefix_beam_search, alphabet = list(ascii_lowercase) + [' ', '>', '%']. i feel confused.

Input formats

I have my corpus in plain text and language model in .arpa format generated from KenLM.
How can I input those to the algorithm?

corticph / prefix-beam-search Goto Github PK

prefix-beam-search's People

Contributors

Stargazers

Watchers

Forkers

prefix-beam-search's Issues

Error in code according to the paper

How to integrate end symbol if the CTC table was trained without it

Change Beam Search output format

A_prev

what is defintion of alphabet

Input formats

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent