Set service_tier flex for Batch Prices on the Sync Endpoint

~50% on input + output tokens for latency-tolerant workloads; you trade variable latency and occasional free-to-retry 429s Intermediate 2 min read

Add a single parameter to your OpenAI Responses or Chat Completions calls to pay Batch-API rates without restructuring anything into async batch jobs. You keep a normal synchronous request/response flow and give up only guaranteed speed.

๐Ÿ”’ Pro tip ยท Intermediate

Unlock this tip โ€” and 50 more

This is one of 51 advanced, fact-checked tactics reserved for Pro. Get the full 73-tip library, a searchable archive, and a new tip every morning. Free for 7 days, then $9/mo.

Prefer to browse? The 22 Beginner tips are free forever.