Sign In

ACE-Step Audio Gen

7.7k

Updated: Apr 30, 2026

base modelaudio gen

Download

1 variant available

bf16 SafeTensor

BF16, good balance • 9.29 GB

Verified:

Type

Checkpoint Trained

Stats

84

7.6K

52.2K

Reviews

Published

Apr 30, 2026

Base Model

ACE Audio

Hash

AutoV2
86A1AFB0A1

License:

Originally posted: https://huggingface.co/ACE-Step/ACE-Step-v1-3.5B

https://github.com/ace-step/ACE-Step

Model Description

ACE-Step is a novel open-source foundation model for music generation that overcomes key limitations of existing approaches through a holistic architectural design. It integrates diffusion-based generation with Sana's Deep Compression AutoEncoder (DCAE) and a lightweight linear transformer, achieving state-of-the-art performance in generation speed, musical coherence, and controllability.

Key Features:

  • 15× faster than LLM-based baselines (20s for 4-minute music on A100)

  • Superior musical coherence across melody, harmony, and rhythm

  • full-song generation, duration control and accepts natural language descriptions

Uses

Direct Use

ACE-Step can be used for:

  • Generating original music from text descriptions

  • Music remixing and style transfer

  • edit song lyrics

Downstream Use

The model serves as a foundation for:

  • Voice cloning applications

  • Specialized music generation (rap, jazz, etc.)

  • Music production tools

  • Creative AI assistants