Create complete, professional-quality songs in seconds! DiffRhythm is the world’s first open source latent diffusion model that generates full length vocal and instrumental tracks from simple text prompts. Describe your vision and let DiffRhythm handle the rhythm, melody, and lyrics.
Switch between examples to see different input/output combinations.
Reference audio
[00:00.52]Abracadabra abracadabra [00:03.97]Ha [00:04.66]Abracadabra abracadabra [00:12.02]Yeah [00:15.80]Pay the toll to the angels [00:19.08]Drawin' circles in the clouds [00:23.31]Keep your mind on the distance [00:26.67]When the devil turns around [00:30.95]Hold me in your heart tonight [00:34.11]In the magic of the dark moonlight [00:38.44]Save me from this empty fight [00:43.83]In the game of life [00:45.84]Like a poem said by a lady in red [00:49.45]You hear the last few words of your life [00:53.15]With a haunting dance now you're both in a trance [00:56.90]It's time to cast your spell on the night [01:01.40]Abracadabra ama-ooh-na-na [01:04.88]Abracadabra porta-ooh-ga-ga [01:08.92]Abracadabra abra-ooh-na-na [01:12.30]In her tongue she's sayin' [01:14.76]Death or love tonight [01:18.61]Abracadabra abracadabra [01:22.18]Abracadabra abracadabra [01:26.08]Feel the beat under your feet [01:27.82]The floor's on fire [01:29.90]Abracadabra abracadabra
Generated music output
For the most updated unit pricing, please
see our pricing page more details!
$0.02/call
$0.02/call
DiffRhythm is the first open-source latent diffusion model developed by ASLP Lab, designed specifically for end-to-end full-length song generation. It creates complete tracks (up to 285s) with both vocals and instrumental accompaniment in seconds, using only text prompts like lyrics and style descriptions. Unlike traditional multi-stage systems, it combines simplicity, speed, and scalability - making AI powered music creation accessible to everyone.
DiffRhythm API is created by PiAPI and it lets developers integrate our song generation technology directly into apps, tools, or workflows. It provides programmatic access to the model’s capabilities, enabling features like bulk generation, real-time customization, and seamless scaling for commercial use. Whether you’re building a music production app, a game soundtrack engine, or a creative AI platform, the API handles compute-heavy tasks while you focus on user experience.
The DiffRhythm API supports virtually any genre or style described in your prompts! Whether you need pop, jazz, electronic, folk, cinematic scores, or experimental soundscapes, it adapts to your creative vision.
For our "Pay-as-you-go" option, it costs $0.02 for each generation call for both txt2audio base (1.35 mins) and txt2audio full (4.45mins).
Yes, different subscription plans (Free, Creator, or Pro Plan) will have grant the "Pay-as-you-go" users different number of concurrent jobs, please refer to our pricing for details.
Yes! The Apache 2.0 license permits commercial use, but you must verify originality and disclose AI involvement!
Just lyrics (optional) and a style prompt. No instrumental references or melody templates needed!
Our workspace has integrated Stripe in our payment system, which will allow payments to be made from most major credit card providers.
No, we do not offer refunds. But when you first sign up for an account on PiAPI‘s Workspace, you are given free credits to try our DiffRhythm (specifically the "Pay-as-you-go" service) before making payments!
Please email us at contact@piapi.ai - we'd love to listen to your feedback and explore potential collaborations!