home models images videos articles comics bounties challenges updates shop

ACE-Step Audio Gen

Name: ACE-Step Audio Gen
Rating: 5 (18 reviews)
Author: CivitaiOfficial

513

14.4k

Updated: Apr 30, 2026

base model

audio gen

Download

1 variant available

bf16 SafeTensor

BF16, good balance • 9.29 GB

Verified: 2 months ago

Download (9.29 GB)

Details

Type

Checkpoint Trained

Stats

194

13.3K

142.5K

Reviews

Positive

(16)

Published

Apr 30, 2026

Base Model

ACE Audio

Hash

AutoV2

86A1AFB0A1

Tensors

default creator card background decoration

CivitaiOfficial

License:

Originally posted: https://huggingface.co/ACE-Step/ACE-Step-v1-3.5B

https://github.com/ace-step/ACE-Step

Model Description

ACE-Step is a novel open-source foundation model for music generation that overcomes key limitations of existing approaches through a holistic architectural design. It integrates diffusion-based generation with Sana's Deep Compression AutoEncoder (DCAE) and a lightweight linear transformer, achieving state-of-the-art performance in generation speed, musical coherence, and controllability.

Key Features:

15× faster than LLM-based baselines (20s for 4-minute music on A100)
Superior musical coherence across melody, harmony, and rhythm
full-song generation, duration control and accepts natural language descriptions

Uses

Direct Use

ACE-Step can be used for:

Generating original music from text descriptions
Music remixing and style transfer
edit song lyrics

Downstream Use

The model serves as a foundation for:

Voice cloning applications
Specialized music generation (rap, jazz, etc.)
Music production tools
Creative AI assistants