GitHub - suno-ai/bark: 🔊 Text-Prompted Generative Audio Model
🔊 Introducing Bark by Suno AI - an open-source, multilingual text-to-audio model on GitHub. 🎶 Generate speech, music, and more with transformer-based technology. 🌐 Supports various languages and provides 100+ speaker presets. Check it out for your audio needs! 🎧🤖 #AI #TextToSpeech #OpenSource
- Bark is an open-source text-to-audio model created by Suno that can generate multilingual speech, music, background noise, and nonverbal communications like laughing and sighing.
- Bark is transformer-based and provides pretrained model checkpoints for inference and commercial use.
- Bark was developed for research purposes and may deviate unexpectedly from prompts; it's a fully generative text-to-audio model.
- Updates to Bark include licensing changes, speed improvements, long-form generation documentation, and a voice prompt library.
- Bark supports various languages, detects language automatically, and includes music generation capability by adding music notes in prompts.
- It offers 100+ speaker presets, longer audio generation options, and hardware support for both CPU and GPU.
- Bark is built on a GPT-style architecture and directly converts input text prompts to audio without using phonemes.