GitHub - openai/whisper: Robust Speech Recognition via Large-Scale Weak Supervision
🚀🗣️ Dive into the world of robust speech recognition with OpenAI Whisper! 🎙️🤖 This multitasking model not only recognizes speech but also translates & identifies languages. 🌐🔊 Train your own model today! 🔥 #AI #SpeechRecognition #OpenAIWhisper
- **Repository Name**: OpenAI Whisper
- **Description**: General-purpose speech recognition model for multilingual tasks
- **Model Features**: Multitasking abilities include speech recognition, translation, and language identification
- **Training Approach**: Transformer sequence-to-sequence model trained on various speech processing tasks
- **Dependencies**: Python 3.8-3.11, PyTorch 1.10.1, and OpenAI's tiktoken for tokenizer
- **Installation**: `pip install -U openai-whisper`
- **Model Sizes**: Tiny, Base, Small, Medium, and Large with English-only and Multilingual variants
- **Command-line Usage**: Transcribing audio files, specifying models and languages
- **Python Usage**: Transcribing within Python, detecting language, and decoding audio
- **Additional Models**: Available memory requirements and inference speeds relative to the large model
- **Performance**: Varies based on language, with performance breakdowns provided
- **License**: Released under the MIT License
- **Resources**: Links to README, license, and additional model information
- **Contributors**: 68 contributors to the project