Batches

batches

Methods

Cancel Batch ->
post/batches/{batch_id}/cancel

Cancels an in-progress batch. The batch will be in status cancelling for up to 10 minutes, before changing to cancelled, where it will have partial results (if any) available in the output file.

Create Batch ->
post/batches

Creates and executes a batch from an uploaded file of requests

Security
Bearer Auth

Example: Authorization: Bearer My API Key

Body parameters
completion_window: "24h"

The time frame within which the batch should be processed. Currently only 24h is supported.

endpoint: "/v1/chat/completions" | "/v1/embeddings" | "/v1/completions"

The endpoint to be used for all requests in the batch. Currently /v1/chat/completions, /v1/embeddings, and /v1/completions are supported. Note that /v1/embeddings batches are also restricted to a maximum of 50,000 embedding inputs across all requests in the batch.

input_file_id: string

The ID of an uploaded file that contains requests for the new batch.

See upload file for how to upload a file.

Your input file must be formatted as a JSONL file, and must be uploaded with the purpose batch. The file can contain up to 50,000 requests, and can be up to 100 MB in size.

metadata?: Record<string, string>

Optional custom metadata for the batch.

Response fields
Request example
200Example
List Batch -> CursorPage<>
get/batches

List your organization's batches.

Retrieve Batch ->
get/batches/{batch_id}

Retrieves a batch.

Domain types

Batch = { id, completion_window, created_at, 17 more... }
BatchError = { code, line, message, 1 more... }
BatchRequestCounts = { completed, failed, total }