home models images videos 3D Models articles comics bounties challenges updates shop

ACE-Step Audio Gen

Name: ACE-Step Audio Gen
Rating: 5 (18 reviews)
Author: CivitaiOfficial

538

14.9k

Updated: Apr 30, 2026

base model

audio gen

Download

1 variant available

bf16 SafeTensor

qwen_4b_ace15.safetensors

BF16, good balance • 7.8 GB

Verified: 3 months ago

Download (7.8 GB)

Details

Type

Checkpoint Trained

Stats

Reviews

Positive

(2)

Published

Apr 30, 2026

Base Model

Other

Hash

AutoV2

FFE5FFB855

Tensors

default creator card background decoration

16.8K

1.3M

7.9M

CivitaiOfficial

Joined Nov 12, 2022

License:

Originally posted: https://huggingface.co/ACE-Step/ACE-Step-v1-3.5B

https://github.com/ace-step/ACE-Step

Model Description

ACE-Step is a novel open-source foundation model for music generation that overcomes key limitations of existing approaches through a holistic architectural design. It integrates diffusion-based generation with Sana's Deep Compression AutoEncoder (DCAE) and a lightweight linear transformer, achieving state-of-the-art performance in generation speed, musical coherence, and controllability.

Key Features:

15× faster than LLM-based baselines (20s for 4-minute music on A100)
Superior musical coherence across melody, harmony, and rhythm
full-song generation, duration control and accepts natural language descriptions

Uses

Direct Use

ACE-Step can be used for:

Generating original music from text descriptions
Music remixing and style transfer
edit song lyrics

Downstream Use

The model serves as a foundation for:

Voice cloning applications
Specialized music generation (rap, jazz, etc.)
Music production tools
Creative AI assistants