Turn long text or a file into an audiobook using async TTS, voice selection, and file tools.
Goal
Ask for
Workflow
Response style
Feishu document read/write operations. Activate when user mentions Feishu docs, cloud docs, or docx links.
Extract frames or short clips from videos using ffmpeg.
Generate spectrograms and feature-panel visualizations from audio with the songsee CLI.
QQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Summarize or extract text/transcripts from URLs, podcasts, and local files (great fallback for “transcribe this YouTube/video”).