Voice
Transcribe audio recordings into text with speech-to-text. Pair the transcript with Lang2FHIR to turn spoken clinical notes into structured FHIR resources.
1 endpoint
POST
/transcribeTranscribes an uploaded audio recording and returns the transcript. Send the raw audio bytes as the request body; the audio format is detected automatically (WAV, FLAC, MP3, OGG/WebM Opus).
Supports up to ~5 minutes of audio per request. This limit is on audio
duration regardless of file size or format, so a compressed recording
within the size limit can still be rejected for being too long. Pair the
transcript with a downstream text step (e.g. POST /lang2fhir/create)
to turn it into a FHIR resource.
Input
Result
transcriptexample