Comments (4)
@redwrasse, I agree that the original paper misses some details here and there. Take a look at (Gated) PixelCNN by WaveNet's main author (https://arxiv.org/pdf/1606.05328.pdf) and you will find that he "copies" the gated activation from there. Also, it seems like they stacked them along the output function dims to spare a conv1d.
For the later, have a look here
https://github.com/cheind/autoregressive/blob/e1f9b72b0f9764f9b4d6b6f65f028cd50db6940e/autoregressive/wave.py#L63
from tensorflow-wavenet.
Answering this for myself from looking through the literature, yes it looks like there are in fact two distinct dilated convolutions passed to the 'gated activation unit'- the original wavenet paper diagrams appear misleading.
from tensorflow-wavenet.
Thanks @cheind, I'll take a look. A side project I'd like to get back into.
from tensorflow-wavenet.
@redwrasse, same for me :) I just figured that it works nicely on 2D images as well (without the special architecture of PixelCNN, just plane WaveNet with unrolled images). In addition, once you have the joint distribution the model estimates, you might start to query all kind of things from the model (like given a wavenet conditioned on the speaker id, what is the probability that this speech was spoken by speaker X).
In case you are interested, I have a quite elaborate presentation + code here
https://github.com/cheind/autoregressive/tree/image-support
The branch will be closed soon and merged to main, so I leave a perm-link
https://github.com/cheind/autoregressive/tree/23701bd503843a1de82c6a32ba5bd6e8ad6965a3
from tensorflow-wavenet.
Related Issues (20)
- how dialated convolution actually work ?
- How to stop and resume training HOT 2
- Problem on runing it on colab HOT 2
- generate.py very slow with GPU HOT 1
- TypeError: cast() missing 1 required positional argument: 'dtype'
- tensorboard result: the generated audio of generate.py is 0 seconds
- TypeError: Value passed to parameter 'indices' has DataType float32 not in list of allowed values: uint8, int32, int64 HOT 1
- I failed to download the dataset, how should I resolve the voice HOT 1
- My loss function fluctuates like crazy.
- Colab problem: continue previous training HOT 4
- problem on generate only noise HOT 5
- testing much worse than training?
- QUESTION How long does it take to generate one sample? HOT 1
- Module 'tensorflow' has no attribute 'placeholder' HOT 8
- Why is there no activation function applied to the 1x1 conv that produces the dense output?
- ModuleNotFoundError: No module named 'tensorflow.contrib' HOT 1
- about loading VCTK_Corpus dataset?
- Project dependencies may have API risk issues
- Training wavenet to rap?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tensorflow-wavenet.