Comments (4)
For average length sentences, the inference time of Glow-TTS on a V100 GPU is about 40ms, whereas the inference time of FastSpeech was reported as ~25ms. You can find out in the papers (FastSpeech: Table 2, Glow-TTS: Section 5.2. Sampling Speed).
Glow-TTS is slower than FastSpeech, but I don't think the difference is significant in end-to-end setting, because vocoders are usually way slower than parallel TTS models.
from glow-tts.
@jaywalnut310 Do you try to inference on CPU, and compare with GPU?
from glow-tts.
I tried to measure inference on CPU, but I don't remember accurately. So, I cannot tell in certain, but I thought it was about 500ms to generate average length sentences on CPU.
You can try it with the pretrained model and inference notebook.
Closing the issue.
from glow-tts.
How would one configure the inference notebook to be CPU compatible?
I modified the script by adding the following after the import statements:
device = torch.device("cuda:0" if torch.cuda.is_available() else "cpu")
Then I replaced every call to cuda()
with to(device)
. The script is able to nearly finish, but fails on the second-to-last line of code.
When it tries to execute:
audio = waveglow.infer(y_gen_tst, sigma=.666)
it raises
RuntimeError: Found no NVIDIA driver on your system.
from glow-tts.
Related Issues (20)
- Question regarding fine tunning HOT 15
- Question about duration loss HOT 1
- Runtime Error: Multi speaker HOT 1
- GPU required or CPU-compatible? HOT 1
- Different Languages us different amount of GPU memory
- multi speaker
- Output compared to Fastspeech2
- Models for finetuning
- Could not create monotonic_align HOT 3
- Glowtts melspectrogram to fine tune hifigan HOT 2
- RuntimeError: CUDA error: invalid device function
- ImportError: /glow-tts/monotonic_align/monotonic_align/core.cpython-38-x86_64-linux-gnu.so: failed to map segment from shared object HOT 1
- Error using mel generated from glow-tts for hifi-gan training HOT 1
- Can I apply MAS method to other model ? HOT 1
- Query : How is the Model training different from the Model training of wave glow
- Multi speaker training error HOT 11
- With out Training DDI
- An explanation for the source code of finding the alignment path in GlowTTS? HOT 2
- DDI training compared to not DDI training HOT 1
- [Question] How many iterations for the available pretrained model?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from glow-tts.