by hphtwm
Open source · 144 downloads · 0 likes
This model is a fine-tuned version of SpeechT5, specifically trained on the VoxPopuli dataset for the German language. It generates speech from text with natural and expressive quality, making it well-suited for applications requiring German text-to-speech synthesis. Its primary use cases include creating voiceovers, voice assistance for interactive applications, or accessibility tools. What sets it apart is its training on a diverse and representative corpus of spoken German, enabling it to produce more natural intonations and rhythms than generic models.
This model is a fine-tuned version of microsoft/speecht5_tts on the voxpopuli dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 0.5277 | 2.2621 | 1000 | 0.4846 |
| 0.5106 | 4.5241 | 2000 | 0.4723 |
| 0.5029 | 6.7862 | 3000 | 0.4654 |
| 0.5043 | 9.0497 | 4000 | 0.4642 |