Gemini model support for reasoning is coming soon.
Quick Start
- Chat Completions
- Responses API
Configuration
- Chat Completions
- Responses API
reasoning_effort
| Level | Description |
|---|---|
low | Light reasoning for simple tasks |
medium | Balanced reasoning |
high | Deep reasoning for complex problems |
For Anthropic models, the default is 4096 max completion tokens with 2048 budget reasoning tokens.
Handling Responses
Chat Completions
- Streaming
- Non-Streaming
When streaming, reasoning content arrives in chunks via the
reasoning delta field, followed by content, and finally reasoning_details with the finish reason:Responses API
- Streaming
- Non-Streaming (OpenAI)
- Non-Streaming (Anthropic)
Streaming events follow the Responses API format:
Anthropic models always return
encrypted_content (signatures) in reasoning items. These signatures validate the reasoning chain and are required for multi-turn conversations. Other providers like OpenAI can optionally return signatures when configured.Related
- Responses API - Alternative API format with reasoning support
- Context Editing - Manage context in long reasoning sessions