AI ExplorerAI Explorer
ToolsCategoriesSitesLLMsCompareAI QuizAlternativesPremium

—

AI Tools

—

Sites & Blogs

—

LLMs & Models

—

Categories

AI Explorer

Find and compare the best artificial intelligence tools for your projects.

Made within France

Explore

  • All tools
  • Sites & Blogs
  • LLMs & Models
  • Compare
  • Chatbots
  • AI Images
  • Code & Dev

Company

  • Premium
  • About
  • Contact
  • Blog

Legal

  • Legal notice
  • Privacy
  • Terms

© 2026 AI Explorer. All rights reserved.

HomeLLMsBengali finetuned speecht5 tts

Bengali finetuned speecht5 tts

by DeepDiveDev

Open source · 150 downloads · 0 likes

0.0
(0 reviews)AudioAPI & Local
About

This model is a fine-tuned version of SpeechT5, specifically adapted for Bengali text-to-speech synthesis. It converts text into natural, fluent speech by leveraging the vocal generation capabilities of the base model. Its primary use cases include generating audio content for educational applications, voice assistants, or reading services for the visually impaired. What sets it apart is its specialization in Bengali, a language underrepresented in TTS models, providing a solution tailored to the linguistic needs of the region.

Documentation

Bengali_finetuned_speecht5_tts

This model is a fine-tuned version of microsoft/speecht5_tts on an unknown dataset. It achieves the following results on the evaluation set:

  • Loss: 0.6190

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 0.0001
  • train_batch_size: 4
  • eval_batch_size: 2
  • seed: 42
  • gradient_accumulation_steps: 8
  • total_train_batch_size: 32
  • optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
  • lr_scheduler_type: linear
  • lr_scheduler_warmup_steps: 100
  • training_steps: 600
  • mixed_precision_training: Native AMP

Training results

Training LossEpochStepValidation Loss
6.14411.94221000.7127
5.58763.89882000.6550
5.24515.85543000.6514
5.15147.81204000.6227
4.97279.76875000.6220
4.979711.72536000.6190

Framework versions

  • Transformers 4.46.0.dev0
  • Pytorch 2.5.0+cu121
  • Datasets 3.0.2
  • Tokenizers 0.20.1
Capabilities & Tags
transformerstensorboardsafetensorsspeecht5text-to-audiogenerated_from_trainerendpoints_compatible
Links & Resources
Specifications
CategoryAudio
AccessAPI & Local
LicenseOpen Source
PricingOpen Source
Rating
0.0

Try Bengali finetuned speecht5 tts

Access the model directly