Free local speech-to-text transcription using OpenAI Whisper. Transcribe audio files (mp3, wav, m4a, ogg, etc.) to text without API costs. Use when: (1) User...
- Initial release of the whisper-stt skill for free, local speech-to-text transcription using OpenAI Whisper. - Supports a range of audio/video input formats (mp3, wav, m4a, ogg, etc.) without API costs. - Multiple output formats available: json, txt, srt, and vtt (for subtitles). - Configurable model sizes for performance vs. accuracy tradeoffs. - Option to specify target language and leverage GPU acceleration if available. - Comprehensive usage instructions and troubleshooting included.