Reasoning Modes

Crucible exposes three reasoning modes that let you control the tradeoff between speed, depth, and cost per request.

Standard

The default mode. The model processes your input and returns a response without an explicit reasoning chain. Best for classification, labeling, summarization, and extraction tasks where the answer is relatively direct.

Deep

The model works through the problem step by step before producing a final output. The full reasoning chain is returned in the reasoning_trace field of the response. Deep mode is best for multi-step analysis, complex document review, and tasks where accuracy matters more than latency.

Fast

Optimized for latency-sensitive applications. The model produces a response with minimal processing overhead. Best for high-volume, lower-complexity tasks where throughput is the priority.

Choosing a mode

Task type	Recommended mode
Document extraction	Deep
Classification at volume	Fast
Research synthesis	Deep
Real-time query answering	Standard or Fast
Contract review	Deep
Structured data parsing	Standard

You can switch modes per request by passing the reasoning_mode parameter in your API call.

Get Template