Giter VIP home page Giter VIP logo

Comments (6)

xinjli avatar xinjli commented on August 12, 2024 1

Hi,

I am not sure what is happening here,
can you tell me the language id you are targeting and share a couple of phone transcribed utterances so I can investigate?

Thanks!

from allosaurus.

anushakabber avatar anushakabber commented on August 12, 2024 1

The language id = 'tel', and these are two utterances from the text file under train-

000010007 a n ʊ ʂ a blk> t̪ a ɖ r ɪ blk> n a r s a j j a blk> k o n n e eː l ɭ l a blk> k r ɪ t̪ a blk> a n a aː r o oː g j a t̪ o oː blk> m r t̪ ɪ blk> tʃ e d̪ a aː ɖ ʊ
000010013 p r a t̪ j e eː k a blk> ɦ o oː d̪ a aː blk> k o oː s a blk> k e eː d̪ r a blk> p a aI n a blk> o t̪ t̪ ɪ ɖ ɪ blk> tʃ e eː j a l e eː k a blk> a b bh ɪ ʋ r d̪ d̪ d̪h ɪ blk> p e eː r ʊ t̪ o oː blk> m a n a blk> p a aː l a k ʊ l ʊ blk> ɪ t̪ a r a blk> d̪ e eː ʃ a aː l a blk> tʃ ʊ ʈ ʈ ʊ uː blk> t̪ ɪ r ʊ g ʊ t̪ ʊ n n a aː r a n ɪ blk> tʃ a d̪ r a b a aː b ʊ blk> p r a b bh ʊ t̪ ʋ a aː n n ɪ blk> e d̪ d̪ e eː ʋ a aː blk> tʃ e eː ʃ a aː r ʊ

The corresponding audio files -

000010007.mp4
000010013.mp4

We also used our own phone directory -
phones_telugu.txt

from allosaurus.

sahanashettigar avatar sahanashettigar commented on August 12, 2024

@xinjli Hey! Even I had the same question! Hope you can answer it soon:)

from allosaurus.

xinjli avatar xinjli commented on August 12, 2024

Hi anusha2904, thanks for sharing the details, I will take a look at your data
sahanashettigar, do you have the same problem for Telugu?

from allosaurus.

sahanashettigar avatar sahanashettigar commented on August 12, 2024

Hey @xinjli! I'm working with Kannada with IPA notation phones same as what Anusha shared and I got the same error. We both updated the phone inventory.
The audios used in train and validation are not machine-generated but recorded by different individuals. Thus, the amount of silence in the audio clips may vary.
We converted transcripts from a different notation to IPA using a label set that came with the transcripts.

from allosaurus.

xinjli avatar xinjli commented on August 12, 2024

It looks that one of the dependency panphon is upgrading its feature recently to use dim 24 instead of previous dim 22, which causes the dim mismatch.

I updated the version and can you retry it? I think it should be fixed now.

Also, it looks there was a trivial bug in the phone inventory setup, please remove your from your phone list or you can list the phone again to remove the , it is better not to include it in the customized inventory.

Thanks for submitting the issue!

from allosaurus.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.