Stability AI, the world’s leading open generative AI company, today announced the launch of Stable Audio, the company’s first AI product for music and sound generation.
Stable Audio is a first-of-its-kind product that uses the latest generative AI techniques to deliver faster, higher-quality music and sound effects via an easy-to-use web interface. Stability AI offers a basic free version of Stable Audio, which can be used to generate and download tracks of up to 20 seconds, and a ‘Pro’ subscription, which delivers 90-second tracks that are downloadable for commercial projects.
“As the only independent, open and multimodal generative AI company, we are thrilled to use our expertise to develop a product in support of music creators,” said Emad Mostaque, CEO of Stability AI. “Our hope is that Stable Audio will empower music enthusiasts and creative professionals to generate new content with the help of AI, and we look forward to the endless innovations it will inspire.”
Stable Audio is ideal for musicians seeking to create samples to use in their music, but the opportunities for creators are limitless. Audio tracks are generated in response to descriptive text prompts supplied by the user, along with a desired length of audio. For instance, “Post-Rock, Guitars, Drum Kit, Bass, Strings, Euphoric, Up-Lifting, Moody, Flowing, Raw, Epic, Sentimental, 125 BPM” can be entered with a request for a 95-second track, and it would deliver this track.
Here are some more generated tracks and their prompts:
The underlying model was trained using music and metadata from AudioSparx, a leading music library, in a partnership between the companies that will generate both economic and creative value for all parties.
Stable Audio is the first music generation product enabling the creation of high-quality, 44.1 kHz music for commercial use via latent diffusion. The latent diffusion architecture uses audio conditioned on text metadata as well as audio file duration and start time, allowing for control over the content and length of the generated audio. You can read more about the research behind the model here. For further information or to provide feedback on the release, we welcome you to contact us at research@stability.ai.
You can try Stable Audio at www.stableaudio.com.
https://x.com/HuggingPapers/status/2055176632491778363 https://huggingface.co/microsoft/Lens https://huggingface.co/microsoft/Lens-Turbo submitted by /u/Total-Resort-3120 [link] [comments]
Organizations that must restrict access to sensitive documents increasingly rely on AI-driven search and chat…
The Gemini Live Agent Challenge is officially in the books! We challenged developers worldwide to…
It’s the best time of year to pick up all the outdoor gadgets, tents, sleeping…
NASA is testing a next-generation space computer chip that could give spacecraft the ability to…
submitted by /u/dr_lm [link] [comments]