Add or modify speech-to-text providers in assistant-api with transport-aware ingestion (WS/SDK/HTTP), transcript packet correctness, and UI/provider wiring.
Integrate STT providers that emit reliable interim/final transcripts, interruption signaling, and stt_latency_ms metrics.
If user does not answer:
In scope:
api/assistant-api/internal/transformer/<provider>/stt.go (+ provider option/callback helpers)api/assistant-api/internal/transformer/transformer.goapi/assistant-api/internal/type/stt_transformer.go, packet.goOut of scope:
deepgram, assembly-ai, sarvamazure, google, awsInput:
UserAudioPacketRequired outputs:
SpeechToTextPacket (interim/final)InterruptionPacket{Source:"word"} when provider exposes speech-onset word signalConversationEventPacket{Name:"stt", ...}MessageMetricPacket{Name:"stt_latency_ms"} per utterancego test ./api/assistant-api/internal/transformer/... -run TestSTTgo test ./api/assistant-api/internal/transformer/<provider>/...cd ui && yarn test providers./.claude/skills/stt-integration/scripts/validate.sh --check-diff --provider <provider>