Communityv1.0.0

Audio Video To Text

音视频转文字技能,使用 Whisper 进行语音识别。支持多种音视频格式,可输出纯文本、SRT/VTT 字幕或 JSON 格式。适用于会议记录、视频字幕生成、采访整理、播客转录等场景。

1.5kdownloads1stars8active installsivan830826
View on ClawHubBack to Skills

Skill Details

Slug
audio-video-to-text
Latest Version
1.0.0
Author
ivan830826
Published
Mar 7, 2026
Updated
May 11, 2026
Total Versions
1

How to Install

  1. 1 on OpenClawdBots (takes under 60 seconds).
  2. 2Open your bot dashboard and go to the Skills tab.
  3. 3Switch to the ClawHub tab and search for Audio Video To Text.
  4. 4Click Install and the skill is deployed to your bot automatically.

Changelog — v1.0.0

Initial release of the audio-video-to-text skill. - Converts audio/video files to text using OpenAI Whisper. - Supports multiple formats: txt, SRT, VTT, and JSON. - Handles various audio/video types: MP3, WAV, MP4, AVI, and more. - Allows model selection for speed/accuracy trade-offs. - Suitable for meeting notes, subtitles, interviews, and podcasts.