Stability AI shows tool for creating audio trained on royalty-free sound
Stability AI, best known for its image generator Stable Diffusion, has announced Stable Audio Open. The tool can generate up to 47 seconds of audio from a text prompt and is said to have only been trained on royalty-free material.
Stability AI: Stable Audio Open
The model cannot generate music with vocals, so says Stability AI. The target group includes makers who want background sounds and want to rely on royalty-free material. The input is a text prompt, after which up to 47 seconds of sound is output. The sources are Freesound and Free Music Archive, as can be seen from the page on HuggingFace. Stable Audio Open is available for free, unlike Stable Audio which is only available to paying customers and can generate up to three minutes of audio at a time. Additionally, Open only works with text prompts, while Stable Audio allows customers to upload samples as well.