Comments (6)
Hey Fabian! hope you're well :)!
Thank you for you question.
- We did not use any MUSDB data for training or validation but datasets that we have in Deezer (which, may be part of the added value of Spleeter over other released models).
- The model is based on convolutional U-nets (one per instruments). I think some models in SISEC 2018 (JY from what I remember, but not sure, you could probably confirm that) were quite similar and got quite good performances without other data than MUSDB.
We also trained other kinds of model (such as LSTM based ones), but we finally kept this one because it makes possible very fast computation on GPUs (both for training and prediction) while having good separation results. - On MUSDB18 test, we get the following SDR values with the 4 stems model using multichannel Wiener filtering (which improves a bit the scores, but we believe is perceptively worst than basic ratio masks):
vocals SDR | bass SDR | drums SDR | other SDR | |
---|---|---|---|---|
Spleeter 4 stems | 6.86dB | 5.51dB | 6.71dB | 4.55dB |
Note, that we did not try do any optimization on these scores and did not use any MUSDB training data in the training process so these scores are an actual measure of the generalization power of the model (on western pop/rock song though).
There are some more detailed on the extended abstract of the demo we'll present in ISMIR next week.
from spleeter.
@romi1502 thanks for the info.
I think, especially for the research community, it would be cool to also present reproducible scores when just trained on MUSDB18. By doing it yourself, you might prevent people from using non-ideal parameters, hence, reporting scores that are too low. Oh and also we can save a bit of energy for the environment ;-)
from spleeter.
Could you please share how large was your dataset for the pretrained models?
from spleeter.
Hi @faroit Thanks for your Feedback.
Training on musdb is definitely something we can do but I'm not sure how much value it would bring to end users.
Our intent with Spleeter is not so much to compare ourselves with the latest separation models but rather to provide a fast and ready-to-use separation tool for researchers doing other tasks (e.g. transcription..). I'm afraid releasing multiple models trained on different datasets would complicate things for users.
from spleeter.
Training on musdb is definitely something we can do but I'm not sure how much value it would bring to end users.
I'm afraid releasing multiple models trained on different datasets would complicate things for users.
@mmoussallam I understand that but at one point people will use this to train on their own data and might publish results based on this repo.
I already trained and evaluated on MUSDB18 and it seems that there are some issues - See #81
It would be great if you could help to update the training configs for MUSDB18. Another options would be to maintain a fork of spleeter on sigsep to host a pretrained models on MUSDB18 for the source separation community, what do you think?
from spleeter.
@mmoussallam I am closing this issue since it is not related to the pretrained beans model. Feel free to reply here or in #81
from spleeter.
Related Issues (20)
- [Bug] The link to the model is incorrect in the GitHub Actions environment. HOT 2
- My batch script worked 2 days ago, doesnt work now. No changes done. HOT 3
- sorry. i cannot find "stft-backend" HOT 1
- 2sterms model just use in 44100?
- How to batch process audio files on spleeter?
- [Bug] Unable to install spleeter on Macbook running M1 chips because of old numpy version !!! HOT 1
- Doesn't Work with Sonoma HOT 2
- [Bug] Version 3.12 compatible
- Regarding the use of separate_to_file, a deprecation warning is prompted! HOT 1
- Cannot do `poetry add spleeter` due to `tensorflow-io-gcs-filesystem` error
- [Discussion] your question HOT 1
- [Feature] your feature name
- I created web UI. You can have it
- [Bug] Fail to install (with python 3.10.13) HOT 3
- Install fail on macOS Sonoma 14.2, on Apple M2 laptop HOT 4
- Why is the audio unchanged after separation? HOT 5
- Index Error
- [Discussion] cuda memory optimisation issues
- [Feature] Create a Docker image for Apple Silicon
- [Discussion] Is there any way to release memory after separating is done?? HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from spleeter.