Create Batch Transcription | SoferAi

Create multiple transcriptions in a single batch request.

Choose a processing mode:

Express mode: Transcriptions start immediately. Max 10 files. Higher cost. Pass audio_sources directly in the request. Pricing for v1 is $1.20/hour.
Standard mode: Transcriptions processed within 24 hours. Max 500 files. Lower cost. First upload a manifest via Upload Batch Manifest File, then pass the batch_file_id here. Pricing for v1 batch standard is $0.90/hour. If you need higher limits, contact support@sofer.ai.

All files in the batch share the same transcription settings (model, language, etc.) defined in info.

Speaker settings can be provided at the batch level or per item. Per-item num_speakers or auto_detect_speakers settings take precedence over the batch-level speaker settings in info. If an item omits both speaker fields, it inherits the batch-level setting. If neither level provides a speaker setting, the transcription defaults to one speaker. Do not provide both num_speakers and auto_detect_speakers in the same object.

If you include a client_item_id on each item, it must be unique within the batch. You can later resolve a client_item_id back to the canonical transcription ID with Get Batch Transcription By Client Item ID.

Create multiple transcriptions in a single batch request. **Choose a processing mode:** - **Express mode**: Transcriptions start immediately. Max 10 files. Higher cost. Pass `audio_sources` directly in the request. Pricing for v1 is $1.20/hour. - **Standard mode**: Transcriptions processed within 24 hours. Max 500 files. Lower cost. First upload a manifest via [Upload Batch Manifest File](/api-reference/batch-transcribe/upload-batch-file), then pass the `batch_file_id` here. Pricing for v1 batch standard is $0.90/hour. If you need higher limits, contact support@sofer.ai. All files in the batch share the same transcription settings (model, language, etc.) defined in `info`. Speaker settings can be provided at the batch level or per item. Per-item `num_speakers` or `auto_detect_speakers` settings take precedence over the batch-level speaker settings in `info`. If an item omits both speaker fields, it inherits the batch-level setting. If neither level provides a speaker setting, the transcription defaults to one speaker. Do not provide both `num_speakers` and `auto_detect_speakers` in the same object. If you include a `client_item_id` on each item, it must be unique within the batch. You can later resolve a `client_item_id` back to the canonical transcription ID with [Get Batch Transcription By Client Item ID](/api-reference/batch-transcribe/get-batch-transcription-by-client-item-id).

Authentication

AuthorizationBearer

Bearer authentication of the form Bearer <token>, where token is your auth token.

Request

This endpoint expects an object.

infoobjectRequired

Transcription settings applied to all files in the batch (model, language, etc.).

Batch-level speaker settings are defaults. Per-item num_speakers or auto_detect_speakers settings in audio_sources or a batch manifest take precedence for that item.

processing_modeenumOptional

Choose how the batch is processed:

standard (default): Lower cost, processed within 24 hours. Max 500 files. Use with batch_file_id. Pricing for v1 batch standard is $0.90/hour. If you need higher limits, contact support@sofer.ai.
express: Higher cost, starts immediately. Max 10 files. Use with audio_sources. Pricing for v1 is $1.20/hour.

batch_file_idUUIDOptional

For standard mode only. ID of a previously uploaded batch manifest.

Get this by calling Upload Batch Manifest File first.

audio_sourceslist of objectsOptional

For express mode only. List of audio URLs to transcribe (max 10).

Each item needs an audio_url and can optionally include a title, client_item_id, num_speakers, or auto_detect_speakers.

Per-item num_speakers or auto_detect_speakers settings take precedence over the batch-level speaker settings in info.

If you provide client_item_id, it must be unique within the batch and can be used later to look up the resulting transcription.

batch_titlestringOptional

Default title prefix for transcriptions. Individual items can override this. Items without titles become “{batch_title} - Item 1”, “{batch_title} - Item 2”, etc.

Response

Returns the batch ID and individual transcription IDs. Use Get Batch Status to track progress.

batch_idUUID

Use this ID to check batch progress via Get Batch Status

transcription_idslist of UUIDs

IDs for each transcription in the batch. Use these to retrieve individual results via Get Transcription

total_countinteger

Number of transcriptions created

statusenum

Initial batch status (typically RECEIVED)

Errors

401

Authentication Error

429

Rate Limit Error

Create multiple transcriptions in a single batch request.

Choose a processing mode:

Express mode: Transcriptions start immediately. Max 10 files. Higher cost. Pass audio_sources directly in the request. Pricing for v1 is $1.20/hour.
Standard mode: Transcriptions processed within 24 hours. Max 500 files. Lower cost. First upload a manifest via Upload Batch Manifest File, then pass the batch_file_id here. Pricing for v1 batch standard is $0.90/hour. If you need higher limits, contact support@sofer.ai.

All files in the batch share the same transcription settings (model, language, etc.) defined in info.

1	from soferai import SoferAI
2	from soferai.transcribe import BatchTranscriptionRequestInfo, BatchAudioSource
3
4	client = SoferAI(
5	api_key="YOUR_TOKEN_HERE",
6	)
7
8	client.batch_transcribe.create_batch_transcription(
9	info=BatchTranscriptionRequestInfo(
10	model="v1",
11	primary_language="en",
12	hebrew_word_format=[
13	"en",
14	"he"
15	],
16	num_speakers=1,
17	),
18	processing_mode="express",
19	audio_sources=[
20	BatchAudioSource(
21	audio_url="https://example.com/audio1.mp3",
22	title="Rabbi Cohen\'s Shiur on Shabbos",
23	client_item_id="shiur_shabbos",
24	),
25	BatchAudioSource(
26	audio_url="https://example.com/audio2.mp3",
27	title="Rabbi Cohen\'s Shiur on Kashrus",
28	client_item_id="shiur_kashrus",
29	)
30	],
31	batch_title="Rabbi Cohen Shiurim",
32	)

1	{
2	"batch_id": "a1b2c3d4-e5f6-7890-abcd-ef1234567890",
3	"transcription_ids": [
4	"d5e9c84f-c2b2-4bf4-b4b0-7ffd7a9ffc32",
5	"e6f0d95a-d3c3-5ca5-c5c1-8aad8b0aad43"
6	],
7	"total_count": 2,
8	"status": "RECEIVED"
9	}