AI/EXPLORER
ToolsCategoriesSitesLLMsCompareAI QuizAlternativesPremium
—AI Tools
—Sites & Blogs
—LLMs & Models
—Categories
AI Explorer

Find and compare the best artificial intelligence tools for your projects.

Made within France

Explore

  • ›All tools
  • ›Sites & Blogs
  • ›LLMs & Models
  • ›Compare
  • ›Chatbots
  • ›AI Images
  • ›Code & Dev

Company

  • ›Premium
  • ›About
  • ›Contact
  • ›Blog

Legal

  • ›Legal notice
  • ›Privacy
  • ›Terms

© 2026 AI Explorer·All rights reserved.

HomeLLMsacestep v15 xl sft

acestep v15 xl sft

by ACE-Step

Open source · 3k downloads · 64 likes

2.3
(64 reviews)AudioAPI & Local
About

ACE-Step 1.5 XL SFT is an AI model specialized in high-quality music generation, featuring 4 billion parameters. It enables the creation of musical pieces from text descriptions while offering precise control over adherence to prompts through classifier-free guidance (CFG). Designed for commercial use, it relies on legally compliant training data, including licensed music, royalty-free works, and audio synthesized from MIDI scores. This model stands out for its ability to produce professional results, suitable for content creators, artists, and music application developers alike. Its optimized architecture and advanced features make it a versatile solution for automated or assisted composition.

Documentation

ACE-Step 1.5 XL — SFT (4B DiT)

Project | Hugging Face | ModelScope | Space Demo | Discord | Tech Report

Model Details

This is the XL (4B) SFT variant of ACE-Step 1.5 — a supervised fine-tuned model with ~4B parameters. SFT provides higher audio quality with CFG (Classifier-Free Guidance) support for fine-grained prompt adherence control.

XL Architecture

ParameterValue
DiT Decoder hidden_size2560
DiT Decoder layers32
DiT Decoder attention heads32
Encoder hidden_size2048
Encoder layers8
Total params~4B
Weights size (bf16)~18.8 GB
Inference steps50 (with CFG)

GPU Requirements

VRAMSupport
≥12 GBWith CPU offload + INT8 quantization
≥16 GBWith CPU offload
≥20 GBWithout offload
≥24 GBFull quality (XL + 4B LM)

All LM models (0.6B / 1.7B / 4B) are fully compatible with XL.

Key Features

  • 💰 Commercial-Ready: Trained on legally compliant datasets. Generated music can be used for commercial purposes.
  • 📚 Safe Training Data: Licensed music, royalty-free/public domain, and synthetic (MIDI-to-Audio) data.
  • 🎯 CFG Support: Fine-tune prompt adherence with guidance scale control.
  • 🔮 Highest Quality: SFT + 4B parameters = the highest quality variant.

Quick Start

Bash
# Install ACE-Step
git clone https://github.com/ace-step/ACE-Step-1.5.git
cd ACE-Step-1.5
pip install -e .

# Download this model
huggingface-cli download ACE-Step/acestep-v15-xl-sft --local-dir ./checkpoints/acestep-v15-xl-sft

# Run with Gradio UI
python acestep --config-path acestep-v15-xl-sft

Model Zoo

XL (4B) DiT Models

DiT ModelCFGStepsQualityDiversityTasksHugging FaceModelScope
acestep-v15-xl-base✅50HighHighAll (extract, lego, complete)LinkLink
acestep-v15-xl-sft✅50Very HighMediumStandardThis repoLink
acestep-v15-xl-turbo❌8Very HighMediumStandardLinkLink

LM Models (all compatible with XL)

LM ModelParamsAudio UnderstandingCompositionHugging FaceModelScope
acestep-5Hz-lm-0.6B0.6BMediumMediumLinkLink
acestep-5Hz-lm-1.7B1.7BMediumMediumIncluded in mainIncluded in main
acestep-5Hz-lm-4B4BStrongStrongLinkLink

Acknowledgements

This project is co-led by ACE Studio and StepFun.

Citation

BibTeX
@misc{gong2026acestep,
    title={ACE-Step 1.5: Pushing the Boundaries of Open-Source Music Generation},
    author={Junmin Gong, Yulin Song, Wenxiao Zhao, Sen Wang, Shengyuan Xu, Jing Guo},
    howpublished={\url{https://github.com/ace-step/ACE-Step-1.5}},
    year={2026},
    note={GitHub repository}
}
Capabilities & Tags
transformerssafetensorsacestepfeature-extractionaudiomusictext2musiccustom_codetext-to-audio
Links & Resources
Specifications
CategoryAudio
AccessAPI & Local
LicenseOpen Source
PricingOpen Source
Rating
2.3

Try acestep v15 xl sft

Access the model directly