Comments (1)
Hi Anar, thank you for the interest.
The model return logits. You can apply a sigmoid function on these to get a value between 0,1. You can then set some threshold (like 0.5) to binarize the outputs.
I don't believe that the probability has to do with the length of the event in the transformer model, because there is no pooling applied. However, this may be true in CNNs models with a global average pooling for example.
I think you can run a second-by-second predictions, we had competitive results in HEAR challenge on short events. The wrapper is published as a pip package here. you can use it to get embeddings or logits for shorter audio clips. You can also change the window length example.
from passt.
Related Issues (20)
- Is it possible to use this project directly for a code example for instrument recognition? HOT 4
- mismatch version of pytorch-lighting and sarced HOT 15
- Installation issues HOT 1
- The loop in the diagram HOT 1
- RuntimeError: The size of tensor a (2055) must match the size of tensor b (99) at non-singleton dimension 3 HOT 3
- is `config.dyn_norm` enabled? HOT 1
- Is it possible to install the passt with python=3.6? HOT 2
- ImportError: cannot import name 'F1' from 'torchmetrics' (/app/anaconda3/lib/python3.7/site-packages/torchmetrics/__init__.py) HOT 1
- FSD50K - validating on eval data HOT 5
- Pretrained models config HOT 3
- OpenMic fine-tuned model? HOT 2
- Could not solve for environment specs HOT 4
- setup.py
- I have a problem. why convert wav to mp3? HOT 3
- difference of fine-tuning the pretrained models HOT 2
- Inference Issue HOT 2
- Getting started with a custom dataset HOT 8
- 音频事件检测
- test my own model HOT 1
- Inference on AudioSet HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from passt.