Comments (3)
More details on the model latency:
- On a Macbook Pro with M1 Pro, one pass of running the model's
forward
function typically takes between 30ms and 40ms. - That's too long to keep up with a stream of 256 frames at 16kHz, which only leaves a 16ms time window for processing.
- So the RTF (realtime factor) is between 1.8 (30/16) and 2.5 (40/16). It needs to be <1.0 in order for the model to qualify as realtime.
How did you calculate your RTFs?
from fullsubnet.
Good news: I updated my operating system to Sonoma 14.3.1 and that fixed it, without any further code changes. Now the processing time is consistently between 13ms and 15ms.
from fullsubnet.
For potential future reference, here's the torch.profiler output of a single inference run:
----------------------------- ------------ ------------ ------------ ------------ ------------ ------------
Name Self CPU % Self CPU CPU total % CPU total CPU time avg # of Calls
----------------------------- ------------ ------------ ------------ ------------ ------------ ------------
model_inference 5.41% 780.000us 100.00% 14.412ms 14.412ms 1
aten::lstm 6.87% 990.000us 86.02% 12.397ms 2.479ms 5
aten::linear 1.02% 147.000us 39.57% 5.703ms 126.733us 45
aten::addmm 29.68% 4.277ms 35.56% 5.125ms 122.024us 42
aten::sigmoid_ 15.89% 2.290ms 15.89% 2.290ms 21.204us 108
aten::tanh_ 8.48% 1.222ms 8.48% 1.222ms 33.944us 36
aten::tanh 7.88% 1.136ms 7.88% 1.136ms 31.556us 36
aten::copy_ 6.45% 930.000us 6.45% 930.000us 17.222us 54
aten::add_ 5.68% 818.000us 5.68% 818.000us 10.907us 75
aten::matmul 0.15% 22.000us 2.46% 354.000us 88.500us 4
----------------------------- ------------ ------------ ------------ ------------ ------------ ------------
Self CPU time total: 14.412ms
And here's a pretty timeline view:
from fullsubnet.
Related Issues (20)
- error
- Training and Validation cRM Mismatch HOT 2
- 仓库里模型比比赛时提交的结果好很多, 这个是有什么不同吗
- Unable to fine-tune pre-trained model (fullsubnet_best_model_58epochs.tar) HOT 1
- Normalization HOT 2
- [Question] How to do streaming reasoning (Real-Time), is there any documentation?
- [Question] The real-time speech enhance is poor, Need help! HOT 3
- Training dataset bug HOT 1
- New
- 😍😍
- 有关look-ahead的疑问
- แอปตัดนอยซ์
- Fast FullSubNet m=infinity HOT 1
- Low volume vocal segments are suppressed HOT 2
- How to use the cpu training model? HOT 2
- Improved FullSubNet Usage HOT 2
- Will there be fast fullsubnet pre-trained model released? HOT 1
- When will be the code of Mel-FullSubNet released? I'm very interested in the new work.
- DefaultCPUAllocator: can't allocate memory: you tried to allocate 178GB
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from fullsubnet.