DiffRhythm AI & API: Instant Full-Length Song Generation!

Create complete, professional-quality songs in seconds! DiffRhythm is the world's first open source latent diffusion model that generates full length vocal and instrumental tracks from simple text prompts. Describe your vision and let DiffRhythm handle the rhythm, melody, and lyrics.

AI Playground Try our API!Join Discord

DiffRhythm Playground

Generate audio based on lyrics and style prompts with Qubico/DiffRhythm

Configuration

Model*

Lyrics*

The lyrics of the audio following a specific time structure. Format: [mm:ss.xx] lyric text

Style Prompt*

Describe the style of audio that you want to generate (e.g., pop, rock, jazz, electronic)

Reference Audio Style (Optional)

📁

Upload Files

Click or drag a file (JPEG, JPG, PNG)

Upload a reference audio file to define the style. Alternative to style prompt.

Result

Idle

This shows preset sample previews. Sign in and click 'Generate audio' to create your own.

Logs

No logs yet

DiffRhythm Music Generation Demo

Switch between examples to see different input/output combinations.

Audio Reference

Reference audio

Input Lyrics


  [00:00.52]Abracadabra abracadabra
  [00:03.97]Ha
  [00:04.66]Abracadabra abracadabra
  [00:12.02]Yeah
  [00:15.80]Pay the toll to the angels
  [00:19.08]Drawin' circles in the clouds
  [00:23.31]Keep your mind on the distance
  [00:26.67]When the devil turns around
  [00:30.95]Hold me in your heart tonight
  [00:34.11]In the magic of the dark moonlight
  [00:38.44]Save me from this empty fight
  [00:43.83]In the game of life
  [00:45.84]Like a poem said by a lady in red
  [00:49.45]You hear the last few words of your life
  [00:53.15]With a haunting dance now you're both in a trance
  [00:56.90]It's time to cast your spell on the night
  [01:01.40]Abracadabra ama-ooh-na-na
  [01:04.88]Abracadabra porta-ooh-ga-ga
  [01:08.92]Abracadabra abra-ooh-na-na
  [01:12.30]In her tongue she's sayin'
  [01:14.76]Death or love tonight
  [01:18.61]Abracadabra abracadabra
  [01:22.18]Abracadabra abracadabra
  [01:26.08]Feel the beat under your feet
  [01:27.82]The floor's on fire
  [01:29.90]Abracadabra abracadabra

DiffRhythm Generated Music

Generated music output

Features

End-to-End Full-Length Music: Generate complete songs up to 4 minutes 45 seconds in a single step - no stitching short clips or multi-stage workflows.
Style + Scene-Driven Creation!: Describe moods, genres, or imagery (e.g. 'Jazzy Nightclub Vibe' or 'Indie Folk Ballad with Acoustic Harmonica') to shape unique compositions.
Asynchronous API Calls: Our asynchronous API call structure allows developers to submit tasks and have their program continues on, until a "callback" function is executed.
Instrumental Mode: Craft soundscapes from wild prompts like "Arctic Theremin Storms" - perfect for film scores, game soundtracks, or experimental music.
Pure Vocal Generation: Focus on lyrical storytelling with standalone vocal tracks, ideal for refining lyrics or acapella projects.
Multilingual Music: Seamlessly generate songs in English or Chinese, with natural-sounding vocal phrasing in both languages.
High Concurrency: Experience stable performance even under the most demanding load - our service can automatically scale as per varying peak load, processing high number of jobs concurrently while keeping latencies to the minimum!
10-Second Inference Speed: Leverage non-autoregressive architecture to create songs 100x faster than language model-based alternatives.
Open-Source Freedom: Apache 2.0 license allows commercial use, customization, and integration into your creative tools or apps.
Dynamic Length Control: Adjust song duration on the fly, from 30-second jingles to extended 10-minute compositions (coming soon!).
Song Extension & Remixing: Expand existing tracks or blend styles by extending AI-generated songs with new sections (coming soon!).

Our Pricing Plans

"Pay-as-you-go" Option

For the most updated unit pricing, please see our pricing page for more details!

txt2audio base | 1.35 minutes

$0.02/call

txt2audio full | 4.45 minutes

$0.02/call

Check our Pricing page for more information

Detailed DiffRhythm Pricing To our API Playground!

Frequently asked questions