Giter VIP home page Giter VIP logo

Comments (3)

patrick-kidger avatar patrick-kidger commented on May 31, 2024

Hi there. (In passing I'd note that I'm not a professor!)

So the GPUs we uses were some mid-level ones -- to quote from the paper, two GeForce RTX 2080 Ti, and two Quadro
GP100
. I doubt that the GPU choice is going to be responsible for poor performance though.

On the topic of model performance -- I am (obviously) quite surprised by the poor results you seem to be getting. IIRC with Speech Commands you should get to about 85% accuracy after just a few epochs, and then the rest of the training time slowly improves things a little further.

Are you definitely running the neural CDE models, and not one of the benchmark-for-comparison RNN models? The RNN models were all flaky on this dataset: sometimes they would produce excellent results, sometimes they would produce awful results, and it differed from training run to training run. That aside, it's also plausible that a change in software library somewhere has quietly broken something.

As a first place to start, I'd recommend trying the Speech Commands example from this repository. This was a follow-up paper that happened later, with a new codebase. (And substantially tidier code.) Getting some more data might help diagnose the issue.

from neuralcde.

dungxibo123 avatar dungxibo123 commented on May 31, 2024

Many thanks for your response.

I will try using the new code in your recommendation. In order to reply above questions. I tried to remove all other models. I just run NeuralCDE model. And, due to datasets/speech_commands.py does not contain any deprecated methods so I have not changed anything. But I think some small changes have broken something.

Again, thanks for your advices.

from neuralcde.

dungxibo123 avatar dungxibo123 commented on May 31, 2024

I'm sorry madam, In case someone falls into my fault, I will leave some comments for them.

Please make sure that torchaudio.load goes with normalize=False (default will be True), I have erased this term when process data.
If you want to train with fewer classes, please make sure, X and Y variables was generate as same size as batch_index.

Thanks.

from neuralcde.

Related Issues (8)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.