Sign In

ACE-Step Audio Gen

14.9k

Updated: Apr 30, 2026

base modelaudio gen

Download

1 variant available

bf16 SafeTensor

qwen_4b_ace15.safetensors

BF16, good balance • 7.8 GB

Verified:

Type

Checkpoint Trained

Stats

40

Reviews

Published

Apr 30, 2026

Base Model

Other

Hash

AutoV2
FFE5FFB855
default creator card background decoration
Followers - 16826

16.8K

Downloads - 1308391

1.3M

Generations - 7867447

7.9M

License:

e53562aa-0ed7-44c7-a2a0-53bdade36105.png

Originally posted: https://huggingface.co/ACE-Step/ACE-Step-v1-3.5B

https://github.com/ace-step/ACE-Step

Model Description

ACE-Step is a novel open-source foundation model for music generation that overcomes key limitations of existing approaches through a holistic architectural design. It integrates diffusion-based generation with Sana's Deep Compression AutoEncoder (DCAE) and a lightweight linear transformer, achieving state-of-the-art performance in generation speed, musical coherence, and controllability.

Key Features:

  • 15× faster than LLM-based baselines (20s for 4-minute music on A100)

  • Superior musical coherence across melody, harmony, and rhythm

  • full-song generation, duration control and accepts natural language descriptions

Uses

Direct Use

ACE-Step can be used for:

  • Generating original music from text descriptions

  • Music remixing and style transfer

  • edit song lyrics

Downstream Use

The model serves as a foundation for:

  • Voice cloning applications

  • Specialized music generation (rap, jazz, etc.)

  • Music production tools

  • Creative AI assistants