AI/EXPLORER
ToolsCategoriesSitesLLMsCompareAI QuizAlternativesPremium
—AI Tools
—Sites & Blogs
—LLMs & Models
—Categories
AI Explorer

Find and compare the best artificial intelligence tools for your projects.

Made within France

Explore

  • ›All tools
  • ›Sites & Blogs
  • ›LLMs & Models
  • ›Compare
  • ›Chatbots
  • ›AI Images
  • ›Code & Dev

Company

  • ›Premium
  • ›About
  • ›Contact
  • ›Blog

Legal

  • ›Legal notice
  • ›Privacy
  • ›Terms

© 2026 AI Explorer·All rights reserved.

HomeLLMsspeecht5 arabic female voice

speecht5 arabic female voice

by Kaizendsds

Open source · 794 downloads · 0 likes

0.0
(0 reviews)AudioAPI & Local
About

The "speecht5 arabic female voice" model is a fine-tuned version of the Microsoft SpeechT5 model, specifically trained to generate a female voice in Arabic. It converts text into speech naturally and smoothly, producing a clear and expressive voice tailored to the Arabic language. This model is ideal for applications such as voice assistants, audiobooks, automated response systems, or accessibility tools for the visually impaired. What sets it apart is its ability to deliver realistic intonation and prosody while remaining faithful to the nuances of the Arabic language. Its training on specific data allows it to offer superior vocal quality compared to generic solutions.

Documentation

speecht5_arabic_female_voice

This model is a fine-tuned version of microsoft/speecht5_tts on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 0.3636

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 3e-05
  • train_batch_size: 4
  • eval_batch_size: 2
  • seed: 42
  • gradient_accumulation_steps: 8
  • total_train_batch_size: 32
  • optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
  • lr_scheduler_type: linear
  • lr_scheduler_warmup_steps: 400
  • training_steps: 4000
  • mixed_precision_training: Native AMP

Training results

Training LossEpochStepValidation Loss
0.50854.33602000.4609
0.46978.67214000.4265
0.450913.00816000.4111
0.428917.34428000.4057
0.420821.680210000.4049
0.413626.016312000.3990
0.407230.352314000.3980
0.401134.688316000.3920
0.397139.024418000.3905
0.389843.360420000.3886
0.392347.813022000.3813
0.375552.149124000.3691
0.362956.485126000.3655
0.359960.821128000.3660
0.352565.157230000.3632
0.349369.493232000.3603
0.345273.829334000.3626
0.346878.165336000.3597
0.341182.501438000.3621
0.346486.837440000.3636

Framework versions

  • Transformers 4.46.3
  • Pytorch 2.10.0+cu128
  • Datasets 2.19.0
  • Tokenizers 0.20.3
Capabilities & Tags
transformerstensorboardsafetensorsspeecht5text-to-audiogenerated_from_trainerendpoints_compatible
Links & Resources
Specifications
CategoryAudio
AccessAPI & Local
LicenseOpen Source
PricingOpen Source
Rating
0.0

Try speecht5 arabic female voice

Access the model directly