by tencent
Open source · 1k downloads · 332 likes
SongGeneration is an advanced music-generation model designed to produce high-quality, professional songs from text descriptions or audio instructions. With an optimized architecture and training on massive datasets like the Million Song Dataset, it excels at creating tracks with rich melodies, coherent structures, and polished arrangements. What sets it apart is its exceptional accuracy in generating lyrics, boasting an impressively low phonetic error rate that addresses the lyrical hallucination issues common in competing models. It also offers high controllability, allowing users to refine results through multimodal prompts to adjust style, mood, or instrumentation. Evaluated by industry experts, SongGeneration rivals the best closed commercial systems while remaining open-source.

Demo | Paper | Code | Space Demo
This repository is the official weight repository for LeVo: High-Quality Song Generation with Multi-Preference Alignment. In this repository, we provide the SongGeneration model, inference scripts, and the checkpoint that has been trained on the Million Song Dataset.
| Model | Max Length | Language | GPU Memory | RTF(H20) | Download Link |
|---|---|---|---|---|---|
| SongGeneration-base | 2m30s | zh | 10G/16G | 0.67 | Huggingface |
| SongGeneration-base-new | 2m30s | zh, en | 10G/16G | 0.67 | Huggingface |
| SongGeneration-base-full | 4m30s | zh, en | 12G/18G | 0.69 | Huggingface |
| SongGeneration-large | 4m30s | zh, en | 22G/28G | 0.82 | Huggingface |
| SongGeneration-v2-large | 4m30s | zh, en, es, ja, etc. | 22G/28G | 0.82 | Huggingface |
| SongGeneration-v2-medium | 4m30s | zh, en, es, ja, etc. | 12G/18G | 0.69 | Coming soon |
| SongGeneration-v2-fast | 4m30s | zh, en, es, ja, etc. | - | - | Coming soon |
🚀 We introduce LeVo 2 (SongGeneration 2), an open-source music foundation model designed to shatter the ceiling of open-source AI music by achieving true commercial-grade generation.
Through a large-scale, rigorous expert evaluation (20 industry professionals, 6 core dimensions, 100 songs per model), LeVo 2 has proven its superiority:
📊 For detailed experimental setups and comprehensive metrics, please refer to the Evaluation Performance section below or our upcoming technical report.
The code and weights in this repository is released in the LICENSE file.