AI/EXPLORER
ToolsCategoriesSitesLLMsCompareAI QuizAlternativesPremium
—AI Tools
—Sites & Blogs
—LLMs & Models
—Categories
AI Explorer

Find and compare the best artificial intelligence tools for your projects.

Made within France

Explore

  • ›All tools
  • ›Sites & Blogs
  • ›LLMs & Models
  • ›Compare
  • ›Chatbots
  • ›AI Images
  • ›Code & Dev

Company

  • ›Premium
  • ›About
  • ›Contact
  • ›Blog

Legal

  • ›Legal notice
  • ›Privacy
  • ›Terms

© 2026 AI Explorer·All rights reserved.

HomeLLMsmusicgen small onnx

musicgen small onnx

by chinedudave06

Open source · 291 downloads · 0 likes

0.0
(0 reviews)AudioAPI & Local
About

MusicGen Small ONNX is an artificial intelligence model specialized in generating music from text descriptions. It autonomously creates musical pieces by interpreting prompts such as "a soft jazz melody" or "an energetic electronic rhythm." Thanks to its optimized architecture with a KV-cache decoder, it delivers enhanced performance for real-time generation, particularly on mobile devices. This model stands out for its ability to produce coherent and high-quality musical sequences while remaining lightweight and suitable for embedded use. It is especially useful for content creators, music app developers, or artists looking to quickly explore sound ideas.

Documentation

MusicGen Small — ONNX (KV-Cache)

ONNX export of facebook/musicgen-small with KV-cache decoder for efficient on-device autoregressive generation.

Model Details

PropertyValue
Base Modelfacebook/musicgen-small
PrecisionFP32
AudioMono (1 channel)
Codebooks4
Hidden Size1024
Sample Rate32 kHz
Max Length1500 steps (~30s)
Total Size~3.6 GB

Files

FileDescriptionSize
decoder_model.onnxStep-0 decoder (no KV-cache)1.6 GB
decoder_with_past_model.onnxSteps 1+ decoder (with KV-cache)1.4 GB
text_encoder.onnxT5 text encoder419 MB
encodec_decode.onnxEnCodec audio decoder113 MB
tokenizer.jsonT5 tokenizer vocabulary2.4 MB
config.jsonModel architecture config<1 KB
generation_config.jsonGeneration parameters<1 KB

Usage

These models are designed for the DJNed Android app using ONNX Runtime. The KV-cache decoder pair enables O(1) per-step generation instead of O(n).

Pipeline

  1. Text encoding: text_encoder.onnx encodes the text prompt
  2. Step 0: decoder_model.onnx generates the first token + initial KV-cache
  3. Steps 1+: decoder_with_past_model.onnx generates subsequent tokens using KV-cache
  4. Audio decode: encodec_decode.onnx converts codebook tokens to audio waveform

License

This model is derived from Meta's MusicGen under the CC-BY-NC-4.0 license.

Capabilities & Tags
onnxruntimeonnxmusicgenmusic-generationkv-cachetext-to-audioon-deviceandroiden
Links & Resources
Specifications
CategoryAudio
AccessAPI & Local
LicenseOpen Source
PricingOpen Source
Rating
0.0

Try musicgen small onnx

Access the model directly