Giter VIP home page Giter VIP logo

Comments (5)

jameslyons avatar jameslyons commented on August 10, 2024

For higher sample rates you'll need more filterbanks, otherwise each filterbank will be covering a large frequency range. If you do use more filterbanks, the higher ones will contain very little information since there is little speech info above 8khz. Sampling around 16khz should do the best.

from python_speech_features.

passarel avatar passarel commented on August 10, 2024

Actually, this comes together with another doubt I have. I am quite concerned about the pre-emphasis function - it seems that it is a basic subtraction of the original signal by the same signal slightly delayed (multiplied by a constant near 1). This delay is considerably different depending on the frame rate (delay = (62.5 uS for fr = 16k, 24.4us for fr = 44.1).

Do you guys have any suggestion of paper or any other technical reference about this specific approach of pre-emphasis?

from python_speech_features.

jameslyons avatar jameslyons commented on August 10, 2024

Preemphasis is used to 'flatten' the spectrum a little bit. For speech signals there is usually more energy in the low frequencies compared to the high frequencies. The Preemphasis filter is a highpass filter that evens out the energy a bit. It was used a lot in the past when euclidean distances were commonly used in asr systems. With gmms or neural nets preemphasis doesn't really matter, results will be the same whether you use it or not. It was included in the code because every other mfcc library I have seen includes it. You can safely ignore it for most purposes. Alternatively you can run some tests on a dev set with different preemph coefficients and see which, if any, works better.

from python_speech_features.

passarel avatar passarel commented on August 10, 2024

Thanks, James... Your response was really helpful

from python_speech_features.

jameslyons avatar jameslyons commented on August 10, 2024

No problem, glad I could help

from python_speech_features.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.