Media processing via GODMODE MCP — PDF text extraction, OCR from images, Whisper audio transcription, YouTube download, and image resize/convert. Tools — pdf_to_text, image_to_text, audio_transcribe, youtube_download, image_resize, image_convert.
Generate spectrograms and feature-panel visualizations from audio with the songsee CLI.
Extract frames or short clips from videos using ffmpeg.
Search GIF providers with CLI/TUI, download results, and extract stills/sheets.
QQBot 富媒体收发能力。使用 <qqmedia> 标签,系统根据文件扩展名自动识别类型(图片/语音/视频/文件)。
Capture frames or clips from RTSP/ONVIF cameras.
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).