GitHub - suno-ai/bark: 🔊 Text-Prompted Generative Audio Model

GitHub - suno-ai/bark: 🔊 Text-Prompted Generative Audio Model

🔊 Introducing Bark by Suno AI - an open-source, multilingual text-to-audio model on GitHub. 🎶 Generate speech, music, and more with transformer-based technology. 🌐 Supports various languages and provides 100+ speaker presets. Check it out for your audio needs! 🎧🤖 #AI #TextToSpeech #OpenSource

  • Bark is an open-source text-to-audio model created by Suno that can generate multilingual speech, music, background noise, and nonverbal communications like laughing and sighing.
  • Bark is transformer-based and provides pretrained model checkpoints for inference and commercial use.
  • Bark was developed for research purposes and may deviate unexpectedly from prompts; it's a fully generative text-to-audio model.
  • Updates to Bark include licensing changes, speed improvements, long-form generation documentation, and a voice prompt library.
  • Bark supports various languages, detects language automatically, and includes music generation capability by adding music notes in prompts.
  • It offers 100+ speaker presets, longer audio generation options, and hardware support for both CPU and GPU.
  • Bark is built on a GPT-style architecture and directly converts input text prompts to audio without using phonemes.