Comments (5)
from pit-speech-separation.
Thanks for your reply.
In fact that, I am quite confused about what's the problem.
The main part of PIT is not complicated. I also tried to replace the PIT part with your code and got the same issue.
The main difference I am sure is the way to mix data. I am using
Voxceleb dataset http://www.robots.ox.ac.uk/~vgg/data/voxceleb/. The way I created input data is using librosa's STFT and I only use a log of magnitude as the input of the network. I directly add two speaker's log magnitude as the mix data.
I do not have too many experiences on audio analysis before, do you think the way I created training data may cause some potential issues?
Thanks
from pit-speech-separation.
from pit-speech-separation.
Thanks for your suggestion.
The reason why I did log is that the raw amplitude is distributed very unbalance. I did try using raw amplitude and it did not work. Because I do not have WSJ dataset, could you tell me what kind of distribution of your raw amplitude data? For mine it looks like this,
Sorry for disturbing you a lot, I just wanna to figure out the reason that casuses my implemente does work.
Thanks,
from pit-speech-separation.
from pit-speech-separation.
Related Issues (20)
- there is a small error HOT 1
- hi,can you offer me some data for train,i am new to the LSTM HOT 5
- Can I add you wechat to discuss this work?
- How to convert feats_mapping.lst to tfrecords? HOT 4
- a littile HOT 2
- tensorflow.python.framework.errors_impl.InvalidArgumentError: Dimensions must be equal, but are 1488 and 992 for 'model/blstm/stack_bidirectional_rnn/cell_0/bidirectional_rnn/fw/fw/while/fw/basic_lstm_cell/MatMul_2' (op: 'MatMul') with input shapes: [25,1488], [992,1984]. HOT 2
- A problem when preparing my own data for TFRecords format. HOT 3
- Hello, I have a question... HOT 1
- Is this repo for training or training+speech separation? HOT 1
- Contact
- Dataset structure
- run.sh HOT 1
- what‘s meta-frame's meaning? HOT 1
- I have some problems with executing code. HOT 1
- How to separate the target speech? HOT 5
- dataset
- list index out of range
- wv version of wsj0 dataset
- Code clarity: Permutation for minimum loss HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pit-speech-separation.