GitHub - suno-ai/bark: 🔊 Text-Prompted Generative Audio Model

🔊 Introducing Bark by Suno AI - an open-source, multilingual text-to-audio model on GitHub. 🎶 Generate speech, music, and more with transformer-based technology. 🌐 Supports various languages and provides 100+ speaker presets. Check it out for your audio needs! 🎧🤖 #AI #TextToSpeech #OpenSource

Bark is an open-source text-to-audio model created by Suno that can generate multilingual speech, music, background noise, and nonverbal communications like laughing and sighing.
Bark is transformer-based and provides pretrained model checkpoints for inference and commercial use.
Bark was developed for research purposes and may deviate unexpectedly from prompts; it's a fully generative text-to-audio model.
Updates to Bark include licensing changes, speed improvements, long-form generation documentation, and a voice prompt library.
Bark supports various languages, detects language automatically, and includes music generation capability by adding music notes in prompts.
It offers 100+ speaker presets, longer audio generation options, and hardware support for both CPU and GPU.
Bark is built on a GPT-style architecture and directly converts input text prompts to audio without using phonemes.