Overview
Transcribe audio files to text using models like Whisper.
The audio file to transcribe. Supported formats: mp3, mp4, mpeg, mpga, m4a,
wav, webm
The model to use (e.g. whisper-1)
ISO-639-1 language code (e.g. en, zh, ja)
Optional text to guide the model’s style or continue a previous segment
(Prompt)
Response Format: json, text, srt, verbose_json, or vtt
Sampling Temperature (0-1)
Timestamp granularity: word, segment
curl -X POST "https://api.applerouter.ai/v1/audio/transcriptions" \
-H "Authorization: Bearer sk-your-api-key" \
-F "file=@audio.mp3" \
-F "model=whisper-1" \
-F "response_format=json"
{
"text": "Hello, this is a test transcription of the audio file."
}
使用 Bearer Token 认证。格式: Authorization: Bearer sk-xxxxxx
Available options:
json,
text,
srt,
verbose_json,
vtt
Available options:
word,
segment