Create Batch Transcription

Create multiple transcriptions in a single batch request. **Choose a processing mode:** - **Express mode**: Transcriptions start immediately. Max 10 files. Higher cost. Pass `audio_sources` directly in the request. Pricing for v1 is $1.20/hour. - **Standard mode**: Transcriptions processed within 24 hours. Max 500 files. Lower cost. First upload a manifest via [Upload Batch Manifest File](/api-reference/batch-transcribe/upload-batch-file), then pass the `batch_file_id` here. Pricing for v1 batch standard is $0.90/hour. If you need higher limits, contact support@sofer.ai. All files in the batch share the same transcription settings (model, language, etc.) defined in `info`. Speaker settings can be provided at the batch level or per item. Per-item `num_speakers` or `auto_detect_speakers` settings take precedence over the batch-level speaker settings in `info`. If an item omits both speaker fields, it inherits the batch-level setting. If neither level provides a speaker setting, the transcription defaults to one speaker. Do not provide both `num_speakers` and `auto_detect_speakers` in the same object. If you include a `client_item_id` on each item, it must be unique within the batch. You can later resolve a `client_item_id` back to the canonical transcription ID with [Get Batch Transcription By Client Item ID](/api-reference/batch-transcribe/get-batch-transcription-by-client-item-id).

Authentication

AuthorizationBearer

Bearer authentication of the form Bearer <token>, where token is your auth token.

Request

This endpoint expects an object.
infoobjectRequired

Transcription settings applied to all files in the batch (model, language, etc.).

Batch-level speaker settings are defaults. Per-item num_speakers or auto_detect_speakers settings in audio_sources or a batch manifest take precedence for that item.

processing_modeenumOptional

Choose how the batch is processed:

  • standard (default): Lower cost, processed within 24 hours. Max 500 files. Use with batch_file_id. Pricing for v1 batch standard is $0.90/hour. If you need higher limits, contact support@sofer.ai.
  • express: Higher cost, starts immediately. Max 10 files. Use with audio_sources. Pricing for v1 is $1.20/hour.
batch_file_idUUIDOptional

For standard mode only. ID of a previously uploaded batch manifest.

Get this by calling Upload Batch Manifest File first.

audio_sourceslist of objectsOptional

For express mode only. List of audio URLs to transcribe (max 10).

Each item needs an audio_url and can optionally include a title, client_item_id, num_speakers, or auto_detect_speakers.

Per-item num_speakers or auto_detect_speakers settings take precedence over the batch-level speaker settings in info.

If you provide client_item_id, it must be unique within the batch and can be used later to look up the resulting transcription.

batch_titlestringOptional

Default title prefix for transcriptions. Individual items can override this. Items without titles become “{batch_title} - Item 1”, “{batch_title} - Item 2”, etc.

Response

Returns the batch ID and individual transcription IDs. Use Get Batch Status to track progress.

batch_idUUID

Use this ID to check batch progress via Get Batch Status

transcription_idslist of UUIDs

IDs for each transcription in the batch. Use these to retrieve individual results via Get Transcription

total_countinteger
Number of transcriptions created
statusenum

Initial batch status (typically RECEIVED)

Errors

401
Authentication Error
429
Rate Limit Error