Comments (6)
Hi,
I am not sure what is happening here,
can you tell me the language id you are targeting and share a couple of phone transcribed utterances so I can investigate?
Thanks!
from allosaurus.
The language id = 'tel', and these are two utterances from the text file under train-
000010007 a n ʊ ʂ a blk> t̪ a ɖ r ɪ blk> n a r s a j j a blk> k o n n e eː l ɭ l a blk> k r ɪ t̪ a blk> a n a aː r o oː g j a t̪ o oː blk> m r t̪ ɪ blk> tʃ e d̪ a aː ɖ ʊ
000010013 p r a t̪ j e eː k a blk> ɦ o oː d̪ a aː blk> k o oː s a blk> k e eː d̪ r a blk> p a aI n a blk> o t̪ t̪ ɪ ɖ ɪ blk> tʃ e eː j a l e eː k a blk> a b bh ɪ ʋ r d̪ d̪ d̪h ɪ blk> p e eː r ʊ t̪ o oː blk> m a n a blk> p a aː l a k ʊ l ʊ blk> ɪ t̪ a r a blk> d̪ e eː ʃ a aː l a blk> tʃ ʊ ʈ ʈ ʊ uː blk> t̪ ɪ r ʊ g ʊ t̪ ʊ n n a aː r a n ɪ blk> tʃ a d̪ r a b a aː b ʊ blk> p r a b bh ʊ t̪ ʋ a aː n n ɪ blk> e d̪ d̪ e eː ʋ a aː blk> tʃ e eː ʃ a aː r ʊ
The corresponding audio files -
000010007.mp4
000010013.mp4
We also used our own phone directory -
phones_telugu.txt
from allosaurus.
@xinjli Hey! Even I had the same question! Hope you can answer it soon:)
from allosaurus.
Hi anusha2904, thanks for sharing the details, I will take a look at your data
sahanashettigar, do you have the same problem for Telugu?
from allosaurus.
Hey @xinjli! I'm working with Kannada with IPA notation phones same as what Anusha shared and I got the same error. We both updated the phone inventory.
The audios used in train and validation are not machine-generated but recorded by different individuals. Thus, the amount of silence in the audio clips may vary.
We converted transcripts from a different notation to IPA using a label set that came with the transcripts.
from allosaurus.
It looks that one of the dependency panphon is upgrading its feature recently to use dim 24 instead of previous dim 22, which causes the dim mismatch.
I updated the version and can you retry it? I think it should be fixed now.
Also, it looks there was a trivial bug in the phone inventory setup, please remove your from your phone list or you can list the phone again to remove the , it is better not to include it in the customized inventory.
Thanks for submitting the issue!
from allosaurus.
Related Issues (20)
- Prior.txt file path HOT 2
- Optimizing for Latency
- support for python 3.10 HOT 4
- Not able to transcribe simple word what in English HOT 5
- more model for recognition HOT 1
- The timestamp of model 'interspeech21' is incorrect HOT 5
- Unable to run interspeech21 model HOT 1
- Feature normalization can cause NaN to appear HOT 1
- Directory Name con not allowed on Windows HOT 1
- NumPy requirement is less than 1.22 and latest is 1.19.5
- Difference in outputs of splitted v/s unsplitted audio file HOT 2
- Wave error for given sample
- Any way to add new languages?
- UnicodeEncodeError: 'charmap' codec can't encode character '\u02d0' in position 28 when redirecting in WIndows
- Content of fine-tuning files?
- AttributeError: 'PosixPath' object has no attribute 'startswith' HOT 1
- Fix setup.py
- Phone inventory always the default one even after specifying model eng2102 and lang eng
- Is there any way of getting arpabet phonetic transcription for hindi language?
- How long does it theoretically take for "allosaurus" to recognize phonemes?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from allosaurus.