SenseAudio

SenseAudio batch speech-to-text for inbound voice notes

SenseAudio can transcribe inbound audio and voice-note attachments through OpenClaw's shared tools.media.audio pipeline. OpenClaw posts multipart audio to the OpenAI-compatible transcription endpoint and injects the returned text as {{Transcript}} plus an [Audio] block.

PropertyValue
Provider idsenseaudio
Pluginbundled, enabledByDefault: true
ContractmediaUnderstandingProviders (audio)
Auth env varSENSEAUDIO_API_KEY
Default modelsenseaudio-asr-pro-1.5-260319
Default URLhttps://api.senseaudio.cn/v1
Websitesenseaudio.cn
Docssenseaudio.cn/docs

Getting started

```bash export SENSEAUDIO_API_KEY="..." ``` ```json5 { tools: { media: { audio: { enabled: true, models: [{ provider: "senseaudio", model: "senseaudio-asr-pro-1.5-260319" }], }, }, }, } ``` Send an audio message through any connected channel. OpenClaw uploads the audio to SenseAudio and uses the transcript in the reply pipeline.

Options

OptionPathDescription
modeltools.media.audio.models[].modelSenseAudio ASR model id
languagetools.media.audio.models[].languageOptional language hint
prompttools.media.audio.promptOptional transcription prompt
baseUrltools.media.audio.baseUrl or modelOverride the OpenAI-compatible base
headerstools.media.audio.request.headersExtra request headers
SenseAudio is batch STT only in OpenClaw. Voice Call realtime transcription continues to use providers with streaming STT support.
  • [Media understanding (audio)](/docs/openclaw-docs/nodes/audio
  • [Model providers](/docs/openclaw-docs/concepts/model-providers