Giter VIP home page Giter VIP logo

Comments (6)

synesthesiam avatar synesthesiam commented on August 27, 2024

Is there enough of a pattern that we could automate some prompt corrections and re-train?

from larynx.

ddavout avatar ddavout commented on August 27, 2024

I have to compare the prompts I use with the original.. How many prompts do you need, you think ?

from larynx.

ddavout avatar ddavout commented on August 27, 2024

in parl, there are 4 occurrences of "rerai"
the first 3 are affected
text/part1/neut_parl_s01_0429.txt:
A défaut, je suggérerai à l’Assemblée de le rejeter.

text/part1/neut_parl_s02_0531.txt:
Cela représente, pour ceux qui l’ignoreraient, plus de deux fois le salaire moyen.

text/part1/neut_parl_s02_0589.txt:
Si le travail continue de cette manière, je me retirerai moi aussi.

text/part1/neut_parl_s03_0372.txt:
S’il nous rejoint, je retirerai mon amendement.

the only correct is
text/part1/neut_parl_s04_0597.txt:
Je les rencontrerai prochainement, probablement

from larynx.

synesthesiam avatar synesthesiam commented on August 27, 2024

I use Siwis as the "base" model for French, since it's one where I had the most data available. So any corrections to the transcripts will improve it and all of the downstream models when I re-train.

Should I create a repo to share the corrected transcripts, or would you like to do that?

Also, thanks for your effort :)

from larynx.

ddavout avatar ddavout commented on August 27, 2024

I have notice quite a lot of problems of "reading". For my voice I've just changed the prompts .. and yes it improved my voice particularly when the defaults are repeated, of course
other example "erion" on 9 occurrences I found, 3 are wrong

text/part1/neut_parl_s01_0633.txt: gagnerions vs gagnerons
Nous gagnerions beaucoup à examiner ce qui est pratiqué là-bas.

text/part1/neut_parl_s03_0462.txt: oserions vs oserons
Nous n’oserions pas, quant à nous, porter de telles accusations.

text/part1/neut_parl_s03_0622.txt:
Sans eux, nous ne serions pas là aujourd’hui, quoi que l’on pense, quoi que l’on dise.

text/part1/neut_parl_s04_0310.txt:
Nous souhaiterions savoir comment on peut faire.

text/part1/neut_parl_s04_0378.txt:
Je ne vois d’ailleurs pas comment nous le ferions…

text/part1/neut_parl_s06_0096.txt:
Certes, notre pays ne va pas aussi bien que nous le souhaiterions.

text/part1/neut_parl_s06_0666.txt: y is read as e (SAMPA)
Je crois que nous y gagnerions tous.

text/part2/neut_book_s06_0092.txt:
– Pourquoi serions-nous malades, puisqu’il n’y a pas de médecins dans l’île ? répondit très sérieusement Pencroff.

text/part3/emph_parl_s01_0633.txt: gagnerions vs gagnerons
Nous gagnerions BEAUCOUP à examiner ce qui est pratiqué là-bas.

a repo is a good idea, right now I am putting a lot effort to chase all these imperfections,
.. contrary to you who are helped by larynx, I am obliged to track more subtle differences (as with a festival lexicon, 1 (word, POS) corresponds to 1 entry in the lexicon) and I need to take in account every liaison she makes compulsory, optional or completely wrong
There are parts that are not read at all (at least ... one part between parenthesis) and ...
they are waves files are not good enough (in my mind) for Festival (particularly I would say badly truncated ones with a script not suitable for French Phoneset ... It's my "feeling" but one fact is here : the sound i + k is very weak .. I look the waves with Praat and I am with time more and more selective ... but that's another problem

from larynx.

synesthesiam avatar synesthesiam commented on August 27, 2024

If it would help you out, I have the prompt alignments too. I trained a French Kaldi model on these same IPA phonemes, and used the alignments in the training labels and to trim the WAV files.

from larynx.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.